Crossrail Place,
Canary Wharf,
E14 5AR, London, UK
Level -2
Tube access
Jubilee, Elizabeth and DLR lines: Canary Wharf station
Imagine a self-healing system that handles surprises, letting you sleep peacefully. If that sounds appealing, chaos engineering could be the answer. Trusted by Netflix, LinkedIn, Google, and Facebook, it's key for business resilience. In this session, we'll explore its history, learn how to apply its principles to stress-test applications, and review tools for fault injection in real-world scenarios.
Agnieszka is a dedicated engineer specializing in cloud solutions, with expertise in cutting-edge technologies such as Kubernetes and Terraform. She also brings valuable insights from her experience in testing activities. Agnieszka is passionate about enhancing people’s wellbeing and promoting a healthy work-life balance. Outside of her professional endeavors, she is a sports freak, board game lover and owner of two adorable cats.
Serverless breaches expose dangerous missteps in securing function chains, IAM policies, and API gateways. We unravel serverless compromises to reveal the overlooked risks lurking in your infrastructureless apps. Arm yourself with actionable lessons to lock down your functions and avoid headlines.
As organizations rapidly adopt serverless, they expose themselves to new risks not secured by traditional controls. Real-world serverless breaches have already exposed overlooked flaws and misconfigurations resulting in catastrophic data theft and service disruption.
In this talk, we conduct a deep forensic analysis of high-profile serverless compromises involving misconfigured S3 buckets, overly permissive functions, vulnerable web frameworks, and Common missteps in serverless permissions and secrets management.
Walk away with actionable lessons to avoid becoming the next serverless breach headline. We’ll provide concrete steps to reduce your attack surface, implement least privilege access, monitor anomalous activity, and instill a “secure by default” posture across your infrastructureless apps.
Babar Khan Akhunzada is a 24-year-old cyber wizard, Founder & CEO of SecurityWall - Stacked on AI & Big Data technology to help enterprises and individuals enhance security capabilities through capability building, risk management and hybrid security audit.
Babar have helped numerous enterprises to tackle financial cyber crime issues on business application and infrastructure scale. Enterprises improved 95% of their security alignments and protected against cyber criminals for zero-paid orders leading to financial crisis.
Babar is acknowledged by well-known tech companies for contributing to their products' security including Adobe, eBay, Apple, Nokia, Microsoft, Oracle, Sony, Redhat, Yahoo, DuckDuckGo, StackOverflow, NextCloud, and a 100+ more.
Babar have many years of experience of bug bounty hunting and ethical work, which make him unique in all at such young age. Babar has served as a Security Consultant for Directorate of Information Technology and was responsible for security matters of Government Data Center during his tenure.
Recently, Babar has been featured in 25-Under-25 as a young high achiever. Babar have been to many conferences and events including BlackHatMEA, High Technology Crime Investigation Association, UC EXPO, OWASP Romania, Cyber Security Indonesia, EC-Council for Annual Halted Conference, HITCON Taiwan and many more.
This session delivers critical insights into leveraging Kubernetes Operators in the data space. I'll cover the Kubernetes operator pattern, KubeBuilder, data governance, how we built our connector ecosystem, and many more.
Building a data streaming platform that can scale with your organization while maintaining developer productivity is challenging. In this talk, I'll unpack the architectural decisions and engineering challenges of building a cloud-native streaming platform from scratch using Kubernetes Operators. Through this journey - from initial design to production scale, we'll explore transforming the operator pattern into a foundational piece of our data infrastructure, powering thousands of streaming workflows daily. Beyond the technical implementation, I'll share insights about building developer experiences that scale. We will cover the architectural patterns used and the extensible connector ecosystem that enabled our platform to handle millions of daily events without changing its core design.
Elad Leev is a Staff Engineer and open-source enthusiast with over 10 years of experience managing complex production systems. He has expertise in distributed systems, databases, and data streaming technologies. Elad is using Kafka in production since 2016, and he is a strong advocate for data streaming solutions, particularly with technologies like Kafka, Flink, Debezium and Kafka Connect.
What does it take to design and launch a DevOps platform capable of running alien code—written by both users and AI—fast, securely, and at scale? This talk dives deep into the journey of creating Triform, a platform redefining DevOps for a new era of AI-driven development.
You’ll learn how we:
Through real-world insights, I’ll share how we transitioned from concept to launch—crafting a platform designed to support complex, dynamic workflows written by humans and AI alike.
Whether you're an engineer building scalable systems, a founder dreaming of your next big idea, or just someone intrigued by cutting-edge technology, this session will provide inspiration, practical takeaways, and a fresh perspective on what DevOps can be in the age of AI.
Join me for a story about innovation, challenges, and how Triform was built to empower the next generation of developers.
Iggy Gullstrand is the CEO and co-founder of Triform, a pioneering platform for building, deploying, and orchestrating large-scale AI agents tailored to meet today’s dynamic production demands. Iggy and Josef Nilsen are currently out on a road trip through Europe with a tiny campervan visiting events and building the platform while meeting up with developers from all over the world.
I'm a member of a platform team that managed to change 3 service mesh solutions during the past 7 years. We did it seamlessly for other 1,500 engineers that work at Avito; our solution manages over 3,000 microservices and > 3 mln RPS. I will share difficulties and lifehacks how we achieved that.
Over the past seven years, our team at Avito has operated two self-implemented service meshes before completing a two-year migration to Istio. Today, our service mesh supports over 3,000 services across dozens of Kubernetes clusters, processing millions of requests per second. This solution is maintained by a single team, while more than 1,500 developers seamlessly work without needing to understand the internals of the service mesh or write Kubernetes manifests. This talk will delve into: - Our two-year journey migrating from one service mesh to another. - Challenges in organizing ingress communication rules. - Accelerating the adoption of Mutual TLS (mTLS) and service authorization. - Testing changes within a service mesh. - Should developers be aware of the service mesh? How we’ve made it transparent to them. - The new possibilities and advantages unlocked by adopting a service mesh. - What can you expect from a service mesh at scale, and how can you choose the right one?
Igor works at Avito, the largest classified advertisement website (MAU over 50 million).
A Platform Engineer who is strongly interested in Kubernetes, observability tools, service meshes, and other modern cloud-native approaches.
Before working as a software engineer, Igor studied distributed systems and participated in ACM competitions.
Building an event-driven system is the easy part. You build producers that produce messages and consumers that consume messages, and you leverage managed services as the message channels between your systems. But what does this mean for your operations? The things that keep your systems online, your users delighted, and your pager quiet at 3 am.
In this session, you'll gain practical knowledge you can apply when building and operating event-driven systems. You'll learn how to test systems that prioritize asynchronous communication, evolve them over time, and, most importantly, observe them and recover from failures when things go wrong. This session will walk through the software development lifecycle for an actual application, and it includes the theory behind the different practices and practical things you can take away and implement in your architectures.
James Eastham is a Serverless Developer Advocate at Datadog. He has over 10 years experience in software, at all layers of the application stack.
He has worked in front-line support, database administration, backend development and with everything from startups to some of the biggest companies in the world architecting systems using AWS technologies.
James produces content on YouTube, focused around building applications with serverless technologies using .NET, Java & Rust.
In this talk, I present a novel, meta-operating system approach to the cloud continuum - showcasing the NebulOuS project vision and the first results that enable cloud continuum ops.
NebulOuS accomplishes substantial research contributions in the realms of cloud continuum brokerage by introducing methods and tools for enabling secure and optimal application provisioning and reconfiguration over the cloud continuum.
NebulOuS project develops a platform that seamlessly exploits edge nodes in conjunction with multi-cloud resources, to cope with the requirements posed by low latency applications. We call this kind of platform a Meta Operating System (MOS) as it works as the layer above traditional operating systems.
The NebulOuS software is published on GitHub at https://github.com/eu-nebulous under an open-source licence: Mozilla Public Licence 2.0. It extensively uses other open-source software published under permissive licences such as Apache License 2.0. Most notably, the core of deployment relies on Kubernetes, KubeVela and Knative. The solution includes the following directions of work: - Modelling methods and tools for describing the cloud continuum, application requirements, and data streams; for assuring the QoS of the provisioned services. - Efficient comparison of available offerings, using appropriate multi-criteria decision-making methods. - Addressing the security aspects emerging in the cloud continuum. - Conducting and monitoring smart contracts-based service level agreements.
Jan is a dedicated Java Developer and Cloud Engineer, currently advancing in his computer science degree. With over three years of involvement in the HORIZON 2020 programs, he has gained substantial experience in developing Java applications and managing cloud infrastructures. Janek's practical knowledge and hands-on approach to cloud technologies and software development make him a valuable asset. His work primarily focuses on implementing scalable cloud-native solutions and ensuring robust system performance. Janek is known for his ability to navigate complex technical challenges, a skill he continuously refines through his academic and project-based experiences.
From Spot Ocean to Karpenter: adjoe's zero-downtime migration story. Learn how we switched autoscalers in production, the challenges we faced along the way, and why we built a custom controller to fix broken nodes.
Is it possible to seamlessly hand over your Kubernetes cluster from one node-autoscaler to another without causing downtime — and, more importantly, should you? In this talk, we’ll dive into how adjoe accomplished this transition, tackling challenges such as scaling test environments outside working hours, navigating the release of Karpenter V1 mid-migration, and even creating a custom controller to manage broken nodes. Join to learn about the obstacles we faced and the valuable benefits we gained along the way.
Marius, a dedicated DevOps Engineer at Hamburg’s fast-growing adtech company, adjoe. Since joining over three years ago, Marius has been on a mission to enhance the reliability and scalability of our backend — an event-driven microservice architecture written in Go and powered by Kubernetes. His passion for coding and Kubernetes doesn’t stop here. Marius also actively contributes to open-source projects, including CoreDNS and Karpenter, constantly pushing his skills and knowledge forward. When he’s not scaling systems or optimizing code, you’ll find him hosting board game nights, tackling bouldering walls, or setting off on adventures across Asia.
DevOps has many benefits for software eng, but is rarely talked about outside of that context. In this talk we’ll explore why DevOps is not a purely technical endeavour, what it means to apply DevOps across the whole organisation, and how you can use these ideas to deliver change where you work.
DevOps is one of those terms whose meaning varies greatly depending on the background, job role, and lived experience of those being asked, but most people would agree that it is something that applies to the development and operation of software. But at some point, DevOps outgrew this simple coming together, and so we combined other functions to get terms like ‘DevSecOps’, ‘DataOps’, ‘MLOps’ and so on. All these terms are, however, too simplistic to capture the actual essence of DevOps and the ways of working that have developed around it. However, there is no reason that these principles should only be applied to the development and operation of digital products and services. They can actually be applied across organisations as a whole to create ‘The DevOps Organisation’. This talk explores how DevOps principles can be taken and applied outside of the context of software development to provide better outcomes, both internally, and externally, for an organisation.
Mark has over 15 years experience in software engineering, working across the SDLC starting out as a software engineer and later transitioning into DevOps and Cloud focused roles. He has been leading Communities of Practice for around 5 years and has worked with numerous technology stacks and across a wide range of industries, including Healthcare, FinTech, and Logistics. He has worked as an AWS consultant to some of the biggest Financial and Insurance firms in the U.K. and is now the DevOps Practice Lead at RiverSafe. In this role he is responsible for leading and building a highly skilled team of DevOps experts, delivering expert services and exceptional outcomes for customers. Mark is especially passionate about serverless technology and sustainability.
July 18th 2020, Max Verstappen qualifies 7th for the Hungarian Grand Prix. With Red Bull fighting Mercedes for the Constructor’s championship and Max fighting Lewis Hamilton and Vallteri Bottas for the Driver’s Championship, this wasn’t his best qualifying session.
July 19th 2020, during the formation lap, Max crashed his car, damaging his front wing and suspension. In the next 22 minutes, Red Bull mechanics performed an absolute miracle.
How did they do it? How did they go from a crashed car to giving Max a chance to fight for a podium in under 23 minutes? And what can we learn about incident management from one of the most demanding engineering disciplines in the world?
Principal Engineer, SRE at FanDuel/Blip.pt. MSc in Computer Science by the University of Porto. CK{AD, A, S} by Cloud Native Computing Foundation (CNCF) | Linux Foundation. {Terraform, Consul, Vault} Associate by HashiCorp. Working daily to build high-performance, reliable and scalable systems. DevOps Porto meetup co-organizer and DevOpsDays Portugal co-organizer. A strong believer in culture and teamwork. Open source passionate, martial arts amateur, and metal lover.
Parenthood often arrives with little time to prepare. Our idea of 'good' parenting usually involves emulating others, whilst hoping we don’t do any permanent damage.
Stepping into management is remarkably similar: we emulate others, but it never feels quite right.
Fortunately, there’s a better way
Simon Copsey is a change consultant, and is on a mission to help organisations understand and unwind complex, cross-functional obstacles that get in the way of their staff, and reduce the ability to serve their customers.
Simon’s career has taken him from being a developer in the trenches to helping various organisations take pragmatic steps from a place of chaos and paralysis, to one where it becomes a little easier to see the wood for the trees.
This session will guide you through the foundational concepts, setup, and best practices to effectively implement OpenTelemetry in your environment. We’ll explore key components like instrumentation libraries and collector configurations, showcasing practical examples and integrations.
Steve Flanders is a Senior Director of Engineering at Splunk, a Cisco company, responsible for the Observability Platform team, which includes contributions to the OpenTelemetry project. He was previously the Head of Product at Omnition (acquired by Splunk). Prior to Omnition, he led log analytics and data collection at VMware. Steve is writing a book called Mastering OpenTelemetry and Observability: Enhancing Application and Infrastructure Performance and Avoiding Outages, which will be available this fall.
When a legacy bank's fintech venture faced operational paralysis, one "simple" question triggered a revolution. Discover how GitOps transformed a tech stack and an entire organisational DNA. This isn't the story of another tool – it's a blueprint for turning chaos into your competitive advantage.
Every transformation story has a trigger point. For one of Britain's oldest banks, it wasn't a system failure or a security breach – it was a single question nobody could answer. This seemingly innocent inquiry exposed the fragility of their fintech venture's operational foundation and sparked a journey that would reshape their entire organisation. This isn't just another technical implementation story. It's a narrative about how GitOps principles catalysed a cultural renaissance within a centuries-old institution. You'll witness how a team transformed from firefighters to innovators and how version control evolved from a code repository to a single source of truth for the entire organisation. Whether you're drowning in operational complexity or simply seeking to future-proof your organisation, this talk will arm you with actionable insights to lead your own transformation. You'll leave understanding how GitOps can be more than a methodology – it can be the foundation of your competitive advantage in the digital age. Leave inspired, equipped, and ready to transform your own chaos into clarity.
Steve Wade founded The Cloud Native Club, a global community for cloud-native enthusiasts. He is also a maintainer of the Flux Terraform Provider. As an experienced conference speaker, independent cloud-native consultant, and trainer, Steve shares his expertise worldwide. He has held platform leadership roles across various industries, including real estate, gaming, fintech, and the UK Parliament. With a BSc in Computer Science, Steve is passionate about cloud-native software development and distributed computing.
Dive into the world of serverless and explore common, costly mistakes and learn actionable tips for cutting down waste and reducing your AWS bill. Whether you're looking to cut down on CloudWatch costs or improve cost-efficiency for your serverless application, we've got some helpful tips for you.
Helpful tips to cut down on waste and reduce AWS cost, including how to keep CloudWatch costs in check, how to implement caching, how to pick the right services for your workload, how to right-size Lambda functions, and so on.
Yan is an experienced engineer who has run production workload at scale in AWS for 10 years. He has been an architect and principal engineer with a variety of industries ranging from banking, e-commerce, sports streaming to mobile gaming. He has worked extensively with AWS Lambda in production, and has been helping companies around the world adopt AWS and serverless as a consultant. Yan is an AWS Serverless Hero and a regular speaker at user groups and conferences internationally, and he is also the author of Production-Ready Serverless and co-author of Serverless Architectures on AWS 2nd Edition, both by Manning. Yan also keeps an active blog at https://theburningmonk.com.