Creating self-healing software systems via effective usage of telemetry data and AI agents

Modern software systems operate in complex, dynamic environments where failures are inevitable. Traditional monitoring and manual incident response are no longer sufficient to ensure resilience or customer satisfaction. This talk explores how to design and implement self-healing software systems by combining telemetry data with an AI-driven agentic approach. We’ll start by examining how high-quality telemetry forms the foundation for detecting anomalies and predicting failures. Next, we’ll show how modern GenAI (LLMs) can transform this telemetry into actionable insights for AI agents that interpret data, pinpoint root causes, and apply automated fixes. Through a practical, real-world example, you’ll see how telemetry and AI work together to create adaptive feedback loops that continuously improve system reliability, while freeing engineers from repetitive operational tasks.

Related Talks

Testing Cloud Apps Shouldn’t Be This Hard

Testing in the cloud = slow builds, fragile staging, surprise bills.Let’s talk about how developers are flipping the script and using local cloud environments to test smarter, faster, and cheaper without breaking production.Bonus: You’ll learn how LocalStack lets you simulate AWS on your machine. Game changer.

Watch recording

Wait, Building in the Cloud Is This Complicated?

Testing in the cloud = slow builds, fragile staging, surprise bills.Let’s talk about how developers are flipping the script and using local cloud environments to test smarter, faster, and cheaper — without breaking production.Bonus: You’ll learn how LocalStack lets you simulate AWS on your machine. Game changer.

Watch recording

How Cloud Developers Actually Work Without Breaking Everything

Real cloud developers aren’t pushing straight to AWS.They’re building and testing everything locally before a single deploy goes live.This episode breaks down the modern cloud dev workflow and how tools like LocalStack make it possible to move fast without burning money (or trust).Learn how local-first dev culture is changing the cloud game.

Watch recording

Launch yourself in the world of local cloud development

Try for free

Talk to Sales