Complex systems are interconnected. When an incident occurs, it isn’t triggered by one single event, but rather a series of events leading up to that one failure. Yet teams employ root cause analysis (RCA) as the primary means to identify one ultimate, sufficient cause.
Robert Blumen, Lead DevOps Engineer at Salesforce, discusses the idea that it’s not the single root cause, but rather the series of events that should be more closely examined. He reviews why humans are cognitively drawn to RCA, research and examples on event analysis, and argues that it’s the “how” — not the “why” — we should explore when systems fail.
Learning Objectives:
- Learn how to identify issues without Root Cause Analysis
- Discover “how” to explore systems when they fail
"The PagerDuty Operations Cloud is critical for TUI. This is what is actually going to help us grow as a business when it comes to making sure that we provide quality services for our customers."
- Yasin Quareshy, Head of Technology at TUI