In our world where everything is code, reliability extends beyond clean and reliable code running on the right infrastructure. It requires a robust sociotechnical system, the dynamic interplay between social and technical components. Our North Star is an engineering culture, built on shared beliefs, practices and behaviours that shape how we operate, solve problems, collaborate, innovate and continuously learn.
But how do we achieve this engineering culture, where innovation and success are driven by collaboration, trust, autonomy, and passion? What is our code of conduct and how does it propel us forward?
In this talk, we will take you through our 5-year journey in Reliability Advocacy. We started out as a group of 10 enthusiastic engineers from different platform and enablement teams who wanted to share knowledge of our reliability product offering and practices across the organization. We are now a household name, run our annual Reliability Event conference which are attended by 300+ engineers every year and our reliability trainings are part of the curriculum for all new joiners and we've guided hundreds of engineers on their path to defining SLIs and SLOs. In part, thanks to our reliability advocates, reliability is now one of our engineering pillars for years to come shaping our organization in the process, raising our general availability rating from 99.54% to 99,87% in the process.
We will share what worked for us, what didn’t, and how we gradually embedded ourselves into our large, regulated, and risk-averse organization with over 15.000 engineers. We will reveal the code that helped us shape our SRE practices and make this transformation possible. In doing so, we will share a 5-step plan to start a reliability advocacy function within your organization.
Stephan Mousset is the Global Product Manager for the Performance & Resilience Engineering Platform at ING, where he also serves as Lead Reliability Advocate. With nearly two decades of experience at ING, Stephan has led major efforts in performance testing automation, SLI/SLO adoption, and reliability engineering at scale.
He is an active voice in the DevOps and SRE communities — co-organizing DevOpsDays Amsterdam, Site Reliability Engineering NL, and ING’s internal Reliability Event.
Stephan has shared his insights at multiple conferences, including: - SLOconf 2022: ING’s global SLO rollout and lessons learned - SRE NL Meetup at ING (2023): Building a platform for SLO-driven performance engineering - DevOpsDays Berlin 2024 (Ignite): "Unleashing DevOps Magic in Performance Testing & Analysis Automation"
His talks focus on practical, platform-enabled approaches to making reliability and resilience part of everyday engineering — through automation, observability, and culture.