From the course: DevOps Foundations: Effective Postmortems

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Incident metrics

Incident metrics

From the course: DevOps Foundations: Effective Postmortems

Start my 1-month free trial

Incident metrics

- [Instructor] Metrics. Everyone loves metrics. They're what separate us from the animals. Let's talk about incident metrics. But first, a warning. It's very easy to misuse or weaponize incident metrics and get a false sense of having information because you have a number. Be careful how you use them. A well-documented example of metrics going wrong is that if you make a days since last incident metric and make a big deal about it and tie it to group or individual incentives, the very next thing that's going to happen is that minor incidents will go unreported. The classic incident metrics you'll see cited are time to detect or TTD, how long it took to find out about an outage, time to resolve or TTR, how long it took to fix an outage, incident frequency, how many incidents you have, sometimes expressed as time between failures or TBF. And then people love to average them and get the mean values of MTTD, MTTR, and MTBF.…

Contents