SREDAY

Site Reliability, DevOps and Cloud

November 7, 2025 ING Cedar, Amsterdam, Netherlands

1
Days
25+
Speakers
1
Tracks
200
Attendees

Patterns and Practices for Building Resilient AWS Serverless Applications

Yan Cui
Lumigo

Lambda provides multi-AZ support out of the box, but even then, things can still go wrong in production.

Region-wide outages and performance degradations can render your applications non-responsive. And what if you're dealing with downstream systems that aren't as scalable as your system and can't handle the load you put on them?

The bottom line is that many things can go wrong, and they often do at the worst of times. The goal of building resilient systems is not to prevent failures, but to design systems that can withstand them.

In this talk, we will examine several practices and architectural patterns that can help you build more resilient serverless architectures, such as multi-region design, employing DLQs and surge queues and cell-based architectures.

We'll also explore how chaos experiments can help us identify failure modes before they happen in production.

Yan is an experienced engineer who has run production workload at scale on AWS since 2010. He has been an architect and principal engineer in a variety of industries ranging from banking, e-commerce, sports streaming to mobile gaming. He has worked extensively with AWS Lambda in production since 2015. Nowadays, he splits his time between advancing the state of serverless observability as a Developer Advocate at lumigo.io and helping companies around the world adopt serverless as an independent consultant.

Yan is also an AWS Serverless Hero and a regular speaker at user groups and conferences internationally. He is the author of Production-Ready Serverless and co-author of Serverless Architectures on AWS, 2nd Edition. And he keeps an active blog at theburningmonk.com and hosts a serverless-focused podcast at realworldserverless.com.

Sponsors & Partners

Want to become a sponsor? Get in touch!