Games We Play to Improve on Incident Response

22. Září 2021

Řečníci

O prezentaci

Incident Response is a core competency for many teams, but how can teams practice and improve? Inefficient incident response can be costly to a company. It causes lost revenue and destroys customer trust. We will discuss mainstream games, a conceptual frameworks for creating team specific drills, and finally introduce an innovative research topic - outage simulation. An outage simulator gives on-call teams a tool for practicing incident response. Core incident response skills are: severity triage, communication, delegation, and system familiarity. Drilling on these increases these skills, knowledge, efficiency, team cohesion and resilience. DevOps/SRE Managers and ICs will learn why games such as “Keep Talking and Nobody Explodes” are played by many SRE teams. They will learn in detail how to create their own fire-drills from existing runbook entries. An overview of chaos testing and gamedays will be mentioned to provide the broader context. We will cover the pros and cons of each of these methods. Lastly, we will present the concept of an incident response simulator as an open research topic. The hypothesis will be that an incident simulator is a good trade-off between non-domain specific games and full-blown Gamedays.

Organizátor

Kategorie

O organizátorovi (DevOpsDays Houston)

Devopsdays is a worldwide series of technical conferences covering topics of software development, IT infrastructure operations, and the intersection between them.

Uložení prezentace

Měla by být tato prezentace uložena po dobu 1000 let?

Jak ukládáme prezentace

Pro uložení prezentace do věčného trezoru hlasovalo 0 diváků, což je 0.0 %

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Zajímají Vás podobná videa? Sledujte DevOpsDays Houston