
Datadog launched Bits AI SRE, an AI agent aware of telemetry, architecture, and organizational context that investigates alerts and surfaces actionable root cause in minutes, giving engineers the information they need to confidently resolve incidents faster, save engineering hours, and reduce end-user and business impact.
Bits AI SRE is part of Datadog’s Bits AI, a suite of AI capabilities that works autonomously across critical monitoring, development, and security workflows to help teams resolve application issues in real time.
Powered by the full breadth and depth of the Datadog platform’s data, Bits AI SRE provides an understanding of organizations’ systems to identify and resolve alerts fast. When an alert fires, Bits AI SRE rapidly analyzes runbooks, telemetry, and more, to separate signal from noise and uncover hypothetical root causes. It validates its own findings, identifies a final conclusion, and delivers that conclusion directly to third-party collaboration tools—all before on-call responders even log in.
What used to take hours to troubleshoot manually, can now be done in minutes autonomously by Bits AI SRE, representing a step toward a future where engineers can focus less on managing incidents and more on building resilient systems.
Designed for enterprise scale, Bits AI SRE supports HIPAA-regulated workloads, includes role-based access controls (RBAC), and features enterprise contracts with trusted AI partners—ensuring organizations adopt AI with confidence and control.
“This launch represents a pivotal expansion of Datadog’s AI strategy as our first generally available AI agent, and signals a new phase of intelligent, automated reliability,” said Yanbing Li, Chief Product Officer at Datadog. “Bits AI SRE allows companies to mitigate issues faster, reduce customer impact, and adopt AI safely. It has already been tested against more than 2,000 customer environments, including both global enterprises and fast-growing start-ups with a diverse range of production environments. Tens of thousands of investigations have run to date, from routine alerts to high-severity incidents, with organizations already reporting positive outcomes. This reflects the tangible and immediate value, tied directly to operational and business outcomes, that we are delivering.”
Bits AI SRE is the first of three AI agents that is Generally Available to all Datadog users.
The Latest
In APMdigest's 2026 Observability Predictions Series, industry experts offer predictions on how Observability and related technologies will evolve and impact business in 2025. Part 6 covers OpenTelemetry ...
In APMdigest's 2026 Observability Predictions Series, industry experts offer predictions on how Observability and related technologies will evolve and impact business in 2025. Part 5 covers APM and infrastructure monitoring ...
AI continues to be the top story across the industry, but a big test is coming up as retailers make the final preparations before the holiday season starts. Will new AI powered features help load up Santa's sleigh this year? Or are early adopters in for unpleasant surprises in the form of unexpected high costs, poor performance, or even service outages? ...
In APMdigest's 2026 Observability Predictions Series, industry experts offer predictions on how Observability and related technologies will evolve and impact business in 2025. Part 4 covers user experience, digital performance, website performance and ITSM ...
In APMdigest's 2026 Observability Predictions Series, industry experts offer predictions on how Observability and related technologies will evolve and impact business in 2025. Part 3 covers more predictions about Observability ...
In APMdigest's 2026 Observability Predictions Series, industry experts offer predictions on how Observability and related technologies will evolve and impact business in 2025. Part 2 covers predictions about Observability and AIOps ...
The Holiday Season means it is time for APMdigest's annual list of predictions, covering Observability and other IT performance topics. Industry experts — from analysts and consultants to the top vendors — offer thoughtful, insightful, and often controversial predictions on how Observability, AIOps, APM and related technologies will evolve and impact business in 2026 ...
IT organizations are preparing for 2026 with increased expectations around modernization, cloud maturity, and data readiness. At the same time, many teams continue to operate with limited staffing and are trying to maintain complex environments with small internal groups. These conditions are creating a distinct set of priorities for the year ahead. The DataStrike 2026 Data Infrastructure Survey Report, based on responses from nearly 280 IT leaders across industries, points to five trends that are shaping data infrastructure planning for 2026 ...
Developers building AI applications are not just looking for fault patterns after deployment; they must detect issues quickly during development and have the ability to prevent issues after going live. Unfortunately, traditional observability tools can no longer meet the needs of AI-driven enterprise application development. AI-powered detection and auto-remediation tools designed to keep pace with rapid development are now emerging to proactively manage performance and prevent downtime ...
Every few years, the cybersecurity industry adopts a new buzzword. "Zero Trust" has endured longer than most — and for good reason. Its promise is simple: trust nothing by default, verify everything continuously. Yet many organizations still hesitate to implement Zero Trust Network Access (ZTNA). The problem isn't that ZTNA doesn't work. It's that it's often misunderstood ...