32 Rue Blanche
75009 Paris, France
In modern distributed systems, the volume and fragmentation of production data can easily and frequently overwhelm human operators. This talk introduces a LangChain agent built to autonomously investigate production issues by orchestrating multiple tools across organizations’ stacks. We'll walk through how to build a modular, composable multi-tool agent, dig deeper into real-world reliability challenges, and share strategies and lessons learned in making these agents production-ready.
Tomaz is the Founding Engineer at Ewake.ai, working actively on advancing Ewake’s mission of building an AI teammate that brings peace of mind to engineering teams by investigating issues reactively and watching production proactively.
Prior to that, Tomaz worked as a Software Engineer for Doctolib, Benie Saúde, and Abbiamo in Brazil.
From product to community, weak teams block success. This talk explores how team unity gets destroyed via bad habits, poor leadership, and more. With humor and practical advice, learn to spot the dysfunction, laugh at it, and build a stronger, trusted team.
Currently providing technical evangelism for NGINX, Dave McAllister works with DevOps, developers and architects to understand the advantages of modern microservice architectures and orchestration to solve large-scale distributed systems challenges, especially with open source and its innovation. Dave has been a champion for open systems and open source from the early days of Linux, through open distributed file systems like XFS, GFS, and GlusterFS to today's world of clouds and containers. He often speaks on topics such as the real-world issues associated with emerging software architectures and practices, on open source software and on creating new technology companies. Dave has spoken on technical topics like distributed request tracing, modern monitoring practices, open sources projects from both corporations and foundation views, and on how open source innovations powers todays world. Dave was named as one of the top ten pioneers in open source by Computer Business Review, having cut his teeth on Linux and compilers before the phrase "open source" was coined. Well versed in trivia, he won a Golden Penguin in 2002. When he's not talking, you can find him hiking with his trusty camera, trying to keep up with his wife.
Hannah Ouellette is an experienced community leader and strategist with a strong background in developer relations, event production, and technical community building. Currently serving as Senior Community Manager at NGINX, she brings over three years of experience fostering engagement and scaling global community programs. Prior to this, she led technical community efforts at Postman, where she empowered developers through events, content, and meaningful connections, while also championing DEI initiatives.
With earlier roles at Kong Inc., DroneDeploy, and Take 3 Presents, Hannah has worn many hats—from marketing and demand generation to immersive event production and community policy. Her career reflects a passion for creating inclusive, vibrant spaces where people connect, grow, and contribute meaningfully.
Observability is the cornerstone of reliable systems. It lets teams identify and resolve issues before they impact a broader group of users. Yet building an ideal observability stack is far from easy. It demands time and effort, instrumenting every app, service, and component that emits telemetry. Many teams default to “Store’em All - just in case”, logs that no one reads, traces that no one queries, metrics that never inform action. The result? Costs escalate, operational clarity fades, and ROI on observability tends to plateau or even decline. So, shouldn’t we be asking ourselves: are we really investing in observability, or just paying for some distributed noise?
The issue isn’t lack of telemetry; it’s unchecked volume without purpose. This talk explores the telemetry pipeline as a strategy to take back control. At the OpenTelemetry Collector level, we can filter, transform, sample, redact sensitive data, and route telemetry with intent. The goal is to extract clear business value from every signal and every dollar spent. By aligning observability with outcomes, we get an adaptive, efficient, and cost-aware setup. Whether you’re just starting out or operating at scale, this talk will show how to turn observability into a strategic asset instead of a liability.
Yash is a software engineer and researcher with a deep interest in distributed systems. His focus is on observability and performance, areas where he constantly seeks new insights. As an active advocate of OpenTelemetry, Yash contributes to both the project and the wider community. Outside of tech, he’s an avid explorer, whether in the kitchen experimenting with new recipes or traveling the world to taste diverse cuisines.
OpenTelemetry Semantic conventions cover many layers of your stack but fall flat when it comes to business logic. But this doesn’t have to be the case! The OpenTelemetry Weaver project gives you the tools to build your own semantic conventions. With auto generated instrumentation libraries and documentation, developers no longer have to worry about whether an attribute is called customer_id, customerID, accountNumber or something completely different - it’s all in the schema! Built-in support for policy validation also ensures your alerts never break because a metric has been renamed.
Dominik started his journey in technology as an SRE, working on projects ranging from warehouse logistics and photobook designers to analyzing satellite imagery. During this time, he discovered his passion for developer tooling and making sure developers can focus on what they do best - build great software! Now he is working as a Developer Experience Engineer at Grafana Labs, building tools to see clearly in the ever-changing world of software.
AI is making its way into platform engineering—not just as a workload, but as a smart automation layer for how platforms are built, operated, and optimized. Promises of intelligent autoscaling, self-tuning systems, and AI-assisted remediation are everywhere. But how do these claims hold up in real-world Kubernetes environments?
In this deeply technical session, the latest generation of AI-powered features and patterns will be benchmarked and stress-tested in the context of platform operations. From scaling decisions to observability-driven automation and adaptive infrastructure behavior, the focus will be on how these systems perform under load, how they handle edge cases, and what the operational overhead truly looks like.
Attendees will walk away with a clear-eyed view of the strengths and limitations of AI-driven platforms, grounded in data—not just demos.
Because if your infrastructure says “I’ll be back,” it’s worth knowing what it’s planning.
Annie Talvasto is an award-winning international technology speaker and leader. She has spoken at over 100+ tech events worldwide, including KubeCon + CloudNativeCon and Microsoft Build & Ignite. She has been recognized with the CNCF Ambassador, Azure & AI Platform MVP awards. She has co-organized the Kubernetes & CNCF Finland meetup since 2017. In the past, she has also served as a track chair for KubeCon + CloudNativeCon, Program Chair for Secure AI Summit (powered by Cloud Native) and has been hosting Cloud Native Live, a weekly livestream by CNCF, since 2021.
How many times were you woken up during the night to either spend more time than you would like trying to figure out what exactly broke, or just bash your keyboard in frustration once you figure out it was actually a false positive? What if there was a better way? I mean, AI is everywhere nowadays, what if we used it to solve these problems? Let's talk about AIOps and, through real-world case studies and industry research, see why it enables companies to reduce their MTTR by 62%, cut their alert noise by 91%, and predict 87% of potential service degradations — before they impact customers. We will take a journey through the evolution of AIOps: from early AI analytics, to the rise of GenAI, and taking us to the latest promised savior: Agents. But how do Agents help us when incidents come to knock? Will they speed us up? Are they even able to fix incidents without having to wake us up? And even if Agents are here to change everything, where are the gaps? Can everything be "agentified"?
Daniel Afonso is a Senior Developer Advocate at PagerDuty, SolidJS DX team member, Instructor at Egghead.io, and Author of State Management with React Query. Daniel has a full-stack background, having worked with different languages and frameworks on various projects from IoT to Fraud Detection. He is passionate about learning and teaching and has spoken at multiple conferences around the world about topics he loves. In his free time, when he's not learning new technologies or writing about them, he's probably reading comics or watching superhero movies and shows.
SREs are on the frontlines of uptime, performance, cost efficiency, and incident response. So, at times, policies for security and compliance often live in stale docs enforced inconsistently, if at all, until something breaks or someone has an audit. Policy as Code (PaC) replaces that mess with real-time automation right where the work happens.
We will discuss how PaC bridges the gap between security and operations by making policies transparent, codified, versioned, and customizable so you can enforce external standards (CIS, SOC 2, HIPAA) and your own internal rules (like requiring HA in prod but not in dev). You'll see how to apply policy intent consistently from the pipeline directly to runtime, giving teams proactive control, not reactive fire drills or unnecessary "vibe killers".
We'll walk through a full-lifecycle live demo to see the possibilities of PaC to: - Prevent misconfigurations with real-time checks - Enforce policies consistently across repos, infra, and runtime - Customize and codify controls that reflect unique needs - Control cloud costs through autoscaling, right-sizing, and cleanup - Unify security, dev, and ops with shared policies
What will we accomplish? Well, no more stale docs or arguments around policies, faster deploy, tighter cost controls, and safer infrastructure without slowing anyone down.
Alayshia Knighten is a seasoned engineering leader and customer success strategist with a strong background in DevOps, cloud architecture, and technical enablement. Currently serving as the Founding Principal Customer Success Engineer at Mondoo, she brings over a decade of experience helping organizations build and scale secure, efficient, and reliable infrastructure.
Previously, Alayshia held key roles at Pulumi as a Senior Customer Architect and at Honeycomb.io, where she led implementation engineering and partnerships architecture. Her expertise spans across engineering consulting at Chef Software and hands-on DevOps engineering at Verisign. Passionate about empowering teams and driving technical excellence, Alayshia is known for bridging the gap between engineering and customer success.
Coming soon...
Matthieu Blumberg is Senior Vice President of Engineering at Criteo, where he leads Infrastructure, Security, and Internal IT initiatives to drive business transformation and empower teams with a world-class digital workplace. With over 14 years at Criteo and a strong background in engineering leadership and cybersecurity, Matthieu plays a key role in scaling technology platforms and ensuring the integrity and efficiency of global operations.
Production today is messy. There’s noise, complexity, and a constant stream of change. And while we’ve come a long way with observability, it still leans heavily on human foresight. Logs, metrics, alerts, they’re all things we had to think of ahead of time. But when we don’t? That’s where blind spots are born.
Ambient agents try to shift that model. These are always-on, proactive teammates who don’t wait for a prompt. They listen to everything happening in production. They surface things we’d likely miss.
In this talk, we’ll dive into what it takes to bring an ambient agent into your stack, how it listens, learns, and acts, and why this might just be the layer of intelligence your system’s been missing.
Pooné Mokari is the CEO and co-founder of Ewake.ai, an AI Reliability Teammate on a mission to bring real peace of mind to engineering teams. Drawing on her experience as an SRE at Criteo, she founded Ewake to offer engineers their dream teammate, which investigates issues reactively and watches production proactively. Throughout her career, she was active as a speaker in different tech conferences, such as Devoxx Belgium and Devoxx France. She’s also been engaged in mentoring women in tech.