SREDAY

Site Reliability, DevOps and Cloud

March 27-28, 2025 London, UK

2
Days
50+
Speakers
6
Tracks
200
Attendees

Incident Groundhog Day

Stuart Rimell
Uptime Labs

Learning how to respond effectively to incidents is hard. One of the reasons is that we never see the same incident twice. While we can learn vital lessons during and after an incident, we can’t hop into a time machine, and apply these lessons to the same incident to discover their impact. What if we could experience the same incident over and over again? What might we learn? This talk describes a ‘staged world’ experiment in which 20 incident managers separately experienced the same simulated incident affecting a fictitious e-commerce company. We discuss what we noticed that differentiated some incident responders from others, and some surprising things that we expected to see, but didn’t.

Stuart Rimell is a transformation leader at IG Group, driving operational change, enterprise architecture, and agile adoption. He leads IG’s largest efficiency program, optimizing client services. Previously, he built enterprise architecture, agile frameworks, and real-time trading platforms. He also advises startups on product management, specializing in fintech and high-performance systems.

Sponsors & Partners

Want to become a sponsor? Get in touch!