An option to protect the "Pause" action on clusters with 2FA would help to reduce the chance of accidental pausing of production systems.
Yes we did experience an accidental pause, and it was immediately unpaused, but it took about 5 minutes to come back online, during which time we had an outage.
Anything that could be done to increase the checks needed to pause production clusters would be helpful
Out of curiosity, did you experience an accidental pause of a production system? One thing we do for termination is require the user to type in the cluster name--that could be something we'd do here but hadn't seen it as quite necessary since Pause is a reversible operation and so we instead protect it with an informational modal and large red button. Knowing that this has not been enough is a valuable data point.