Alert for WiredTiger Cache

Hi,

Can you please create an alert for WiredTiger metrics, such as used cache?

We had several cluster instances going over the 5% of used cached (dirty data) and would like to be notified when it happens.

Regards,
Sergei

This is needed in order to determine whether

7 votes

Sergei shared this idea · Apr 28, 2020 · Report… · Admin →

An error occurred while saving the comment

Sergei commented · January 19, 2022 10:14 AM · Report

Hi,

Jumping back in after I thought that this link was no longer present.

To elaborate more about the use case:
Currently we don't see the actual percentage of wiredTiger cache usage in Atlas, only the amount of its utilization in GB's.
We had already few incidents of bad instance performances due to critical (>90%) levels of cache utilization.
Displaying this type of information and metric will assist us in catching these issues before they snowball and affect our system. In addition, if the matter is related to natural growth and workload, we would know that it is time to upgrade the cluster to its next tier.

Sergei

Submitting...
Errol commented · June 29, 2020 5:47 PM · Report

Used bytes exceeding 80% of WT Cache or dirty bytes consistently exceeding 5% are signs that the system is undersized. I have to set this explicitly for many of our customers, which means performing the calculation and setting per cluster within each project; there is no way to do this as a global alert within ops manager, this way. Moreover, in cases where the WT cache is not the standard (RAM -1 GB)2, there is no easy way to do this calculation

Submitting...
AdminAndrew Davidson (VP, Cloud Products, MongoDB) commented · April 29, 2020 6:53 PM · Report

Hi Sergei,

It looks like your last sentence was cut off: can you provide more detail? It's super helpful to understand how you would use this information and why it's important to you so we can think about the problem space holistically.

Thanks
-Andrew

Submitting...
Eric commented · April 28, 2020 11:09 AM · Report

Note that current cache alerts are based on absolute quantity (bytes, GB, etc.) and not %, whereas performance deals in percentages (cache util 80%, dirty fill 5%). Beyond requiring a calculation, the current need to specify an absolute quantity means that the alert must be manually updated any time the instance size changes.

Submitting...

How can we improve the platform?

Alert for WiredTiger Cache

Feedback

Atlas: Monitoring and Metrics

Feedback and Knowledge Base

Searching…

Give feedback

Alert for WiredTiger Cache

We're glad you're here

We're glad you're here

We're glad you're here

We're glad you're here

We're glad you're here

Atlas: Monitoring and Metrics

Categories

Searching…

Give feedback