Creating self-healing software systems via effective usage of telemetry data and AI agents

Modern software systems operate in complex, dynamic environments where failures are inevitable. Traditional monitoring and manual incident response are no longer sufficient to ensure resilience or customer satisfaction. This talk explores how to design and implement self-healing software systems by combining telemetry data with an AI-driven agentic approach. We’ll start by examining how high-quality telemetry forms the foundation for detecting anomalies and predicting failures. Next, we’ll show how modern GenAI (LLMs) can transform this telemetry into actionable insights for AI agents that interpret data, pinpoint root causes, and apply automated fixes. Through a practical, real-world example, you’ll see how telemetry and AI work together to create adaptive feedback loops that continuously improve system reliability, while freeing engineers from repetitive operational tasks.

Related Talks

Step-up Authentication using Amazon Cognito on LocalStack

LocalStack Applications in Developer Hub provides sample templates to help LocalStack users adopt real-world scenarios to rapidly and conveniently create, configure, and deploy applications locally. Getting started with Step-up-authentication demoIn this demo, we will setup a step-up authentication workflow for a higher level of security, deployed using Cloud Development Kit on LocalStack

Watch recording
Watch recording
Infrastructure-as-Code with LocalStack—Building & testing your configurations locally!

Infrastructure-as-Code refers to the practice of defining and provisioning cloud resources using code and automation scripts, thus eliminating the need for manual configurations. With frameworks like AWS CloudFormation, AWS CDK (Cloud Development Kit), AWS Serverless Application Model (SAM), Pulumi, and Terraform, users can specify their desired infrastructure state in code, enabling rapid and consistent deployment of resources.However, as with any code, IaC scripts require thorough testing to ensure their correctness and proper functionality. Traditional cloud environments for testing can be expensive, slow, and error-prone due to complexities in provisioning and configuration. With LocalStack, you can leverage a local emulation of various cloud services, such as S3, DynamoDB, EKS, and more!LocalStack simulates these cloud services on a developer's machine, allowing for comprehensive and efficient testing of IaC scripts before deployment to actual cloud environments. In this video, we explain how you can use LocalStack to be more efficient and cost-effective at testing these major IaC frameworks:• Terraform• Pulumi• Cloud Development Kit• CloudFormation• Serverless Application ModelAs organizations will continue to embrace IaC, cloud emulation framework like LocalStack will play an increasingly vital role in ensuring the quality and robustness of cloud infrastructure implementations.

Watch recording
Watch recording
Getting started with LocalStack's Extension for Docker Desktop

LocalStack's Docker Extension allows developers to manage and run cloud applications locally within Docker Desktop efficiently. With a fully-integrated experience with features such as configuration profiles, container logs, and more, developers can now easily manage their LocalStack instance.In this video, Harsh from LocalStack discusses the LocalStack Docker Extension and how you can capitalize on an intuitive user experience to manage your LocalStack image, configuration profiles, and container logs directly within the Docker Desktop.

Watch recording
Watch recording

Launch yourself in the world of local cloud development

Try for free
Try for free
Talk to Sales
Talk to Sales