From the course: DevOps Foundations: Incident Management

Unlock the full course today

Join today to access over 22,600 courses taught by industry experts or purchase this course individually.

Your incident toolchain

Your incident toolchain

From the course: DevOps Foundations: Incident Management

Start my 1-month free trial

Your incident toolchain

- [Instructor] There's no tool that will fix incidents for you. There are unexpected failure modes, and therefore, the domain of experts that need to put thought into the resolution. However, these experts will need a set of tools to help them detect, investigate, coordinate response to, and communicate about incidents. Before going into any specific tools, the most important attributes that all these tools can have is that they are commonly accessible by everyone participating in incidents; their use and role is defined in the process; and, all responders are trained in their operation. It doesn't matter how good a tool is in isolation. If many incident participants can't access it, don't understand how it works, or use it for the wrong thing, it will fail you. Let's look at tools in the first three areas of incident response: detection, escalation, and communication. Monitoring systems and making sure those systems are omitting actionable events is too large of a subject to cover…

Contents