Creating self-healing software systems via effective usage of telemetry data and AI agents

Modern software systems operate in complex, dynamic environments where failures are inevitable. Traditional monitoring and manual incident response are no longer sufficient to ensure resilience or customer satisfaction. This talk explores how to design and implement self-healing software systems by combining telemetry data with an AI-driven agentic approach. We’ll start by examining how high-quality telemetry forms the foundation for detecting anomalies and predicting failures. Next, we’ll show how modern GenAI (LLMs) can transform this telemetry into actionable insights for AI agents that interpret data, pinpoint root causes, and apply automated fixes. Through a practical, real-world example, you’ll see how telemetry and AI work together to create adaptive feedback loops that continuously improve system reliability, while freeing engineers from repetitive operational tasks.

Related Talks

Develop and test your AI-powered cloud apps locally with LocalStack

Running AI/ML workloads in the cloud can be expensive, opaque, and difficult to iterate on. LocalStack changes this by enabling engineers to develop and test AI-powered cloud applications entirely locally, emulating services like SageMaker, Bedrock, Redshift, and Snowflake.In this presentation, Waldemar Hummer, CTO of LocalStack, demonstrates how to prototype and validate AI & ML data pipelines safely and cost-effectively using LocalStack’s cloud emulators. You’ll see how to emulate complex AI workflows, test integrations, and use “vibe coding” techniques confidently in a fully sandboxed local environment.

Watch recording
Watch recording
Run Your First AWS App Locally with LocalStack

Get started with LocalStack in minutes! In this episode, we’ll clone a ready-to-go AWS sample app, spin up LocalStack, and run everything locally without a cloud account. By the end, you’ll have a fully functional event-driven app running on your machine.📖 Want more?Check out the companion blog post for a deeper breakdown of the architecture, project structure, and all the commands you’ll need to run it yourself.🔗 https://blog.localstack.cloud/why-i-run-my-serverless-apps-locally/

Watch recording
Watch recording
From Local to Production: The CI/CD Flow That Doesn’t Suck

This is where it all comes together.CI/CD lets dev teams ship code automatically — but only if your pipeline is built to handle the cloud.In this episode, I show you how local testing + automated deployment = cloud apps that ship faster, safer, and smarter.Stick around to the end, this is the final episode of WTH is the Cloud?!

Watch recording
Watch recording

Launch yourself in the world of local cloud development

Try for free
Try for free
Talk to Sales
Talk to Sales