pause
An option to protect the "Pause" action on clusters with 2FA would help to reduce the chance of accidental pausing of production systems.
-
Kaz commented
Hi Andrew,
Yes we did experience an accidental pause, and it was immediately unpaused, but it took about 5 minutes to come back online, during which time we had an outage.
Anything that could be done to increase the checks needed to pause production clusters would be helpful
-
Hi Kaz,
Out of curiosity, did you experience an accidental pause of a production system? One thing we do for termination is require the user to type in the cluster name--that could be something we'd do here but hadn't seen it as quite necessary since Pause is a reversible operation and so we instead protect it with an informational modal and large red button. Knowing that this has not been enough is a valuable data point.
-Andrew