Modern software systems operate in complex, dynamic environments where failures are inevitable. Traditional monitoring and manual incident response are no longer sufficient to ensure resilience or customer satisfaction. This talk explores how to design and implement self-healing software systems by combining telemetry data with an AI-driven agentic approach. We’ll start by examining how high-quality telemetry forms the foundation for detecting anomalies and predicting failures. Next, we’ll show how modern GenAI (LLMs) can transform this telemetry into actionable insights for AI agents that interpret data, pinpoint root causes, and apply automated fixes. Through a practical, real-world example, you’ll see how telemetry and AI work together to create adaptive feedback loops that continuously improve system reliability, while freeing engineers from repetitive operational tasks.

Testing in the cloud = slow builds, fragile staging, surprise bills.Let’s talk about how developers are flipping the script and using local cloud environments to test smarter, faster, and cheaper without breaking production.Bonus: You’ll learn how LocalStack lets you simulate AWS on your machine. Game changer.

Testing in the cloud = slow builds, fragile staging, surprise bills.Let’s talk about how developers are flipping the script and using local cloud environments to test smarter, faster, and cheaper — without breaking production.Bonus: You’ll learn how LocalStack lets you simulate AWS on your machine. Game changer.

Real cloud developers aren’t pushing straight to AWS.They’re building and testing everything locally before a single deploy goes live.This episode breaks down the modern cloud dev workflow and how tools like LocalStack make it possible to move fast without burning money (or trust).Learn how local-first dev culture is changing the cloud game.