Add metrics to monitor CPU credits for burstable performance Atlas clusters

Add metrics to Atlas for tracking burstable CPU credit spend for M10 and M20 cluster tier instances. Additional add support for creating alerts based on these metrics.

16 votes

Ryan Shockley shared this idea · Jun 10, 2022 · Report… · Admin →

An error occurred while saving the comment

Justas commented · July 25, 2025 3:07 AM · Report

In our case the M20 CPU "Steal %" metric jumped from 0% to 80% within a few minutes effectively causing a denial of service. Being able to view how often we use credits and how many are left, ideally even set alerts would go a long way.

Submitting...
Tom commented · February 23, 2023 7:46 AM · Report

We use M10/M20 instances (AWS backend), which accumulate CPU credits over time. The problem is that when Atlas nodes run out of CPU credits, performance goes down and we see a lot of CPU steals. This is a common issue with AWS EC2 instances. We need a way to monitor this CPU credit balance on Atlas so that we can plan in advance (before the problem happens).

Submitting...
Rafael Orta commented · June 13, 2022 11:45 AM · Report

The goal here is to identify when the CPU and Network credits are or are getting close to exhausted and the instance is going to be throttled down to its base.

Submitting...

How can we improve the platform?

Add metrics to monitor CPU credits for burstable performance Atlas clusters

Feedback

Atlas: Monitoring and Metrics

Feedback and Knowledge Base

Searching…

Give feedback

Add metrics to monitor CPU credits for burstable performance Atlas clusters

We're glad you're here

We're glad you're here

We're glad you're here

We're glad you're here

We're glad you're here

Atlas: Monitoring and Metrics

Categories

Searching…

Give feedback