
Datadog introduced three new AI agents that perform interactive investigations and asynchronous code fixes for development, security and operations teams.
The launch of the Bits AI SRE, Bits AI Dev Agent and Bits AI Security Analyst agents, alongside the new Proactive App Recommendations and APM Investigator capabilities, marks the continued evolution of Bits AI, Datadog’s generative AI assistant that helps engineers resolve application issues in real time.
Datadog’s new domain-specific AI agents, unveiled at DASH, are each trained to be experts in incident response, product development and security. The agents are built on a flexible system of shared tasks—core capabilities such as querying data, analyzing anomalies or scaling infrastructure that can be reused across agents. The architecture allows Datadog to build and deploy new agents quickly while maintaining a consistent and powerful user experience. This is combined with expansive, high-quality observability data, enabling Datadog’s AI capabilities to operate with context and precision, and deliver insights and actions to eliminate risk.
“Datadog is uniquely positioned to deliver value with AI as a platform that has a wealth of clean, rich data—we process trillions of data points and are embedded in our customers’ critical engineering, developer and security workflows,” said Yanbing Li, Chief Product Officer at Datadog. “With these advancements in AI reasoning and multi-modality, we’ve gone beyond helping organizations understand their availability, security, performance and reliability. We now enable human-in-the-middle workflows by guiding customers on what to look for and where to start looking, and augment their ability to take action.”
The new AI agents announced at DASH are:
- Bits AI SRE, a 24x7 on-call responder that is now in Limited Availability. For all alerts, Bits AI SRE performs early triage using telemetry and service context to surface initial investigation findings all before responders log in. It assigns appropriate owners, aligns all parties with real-time incident summaries and status updates, and proactively suggests next steps. It also generates a first draft of the incident post-mortem to save responders time.
- Bits AI Dev Agent, now in Preview, which detects issues, generates code fixes, and opens pull requests tailored to organizations’ technology stack to allow users to quickly review and merge changes directly within their SCM. As a result, engineers don’t just have software that surfaces errors—they have an AI teammate that helps fix the issue and improve productivity for human resources.
- Bits AI Security Analyst, which autonomously triages Cloud SIEM (security information and event management) signals, conducts in-depth investigations of potential threats and delivers reasoned resolution recommendations without human prompting. Bits AI Security Analyst, now in Preview, automates investigations and reduces response times, fundamentally transforming how organizations process security signals.
Additional Applied AI capabilities in preview include:
- Proactive App Recommendations, which continuously analyzes telemetry Datadog already collects to suggest high-impact fixes and the next best action. Whether it’s optimizing a slow query, addressing inefficient code paths or catching recurring exceptions, APM Recommendations shows developers exactly where to improve performance, reduce errors and cut down resource usage—all before users are affected.
- APM Investigator, to help engineers troubleshoot and resolve latency spikes faster. It automates a previously manual process by identifying bottlenecks, scoping impact, highlighting patterns across slow traces, suggesting likely causes and proposing a fix.
The Latest
Artificial intelligence (AI) is core to observability practices, with some 41% of respondents reporting AI adoption as a core driver of observability, according to the State of Observability for Financial Services and Insurance report from New Relic ...
Application performance monitoring (APM) is a game of catching up — building dashboards, setting thresholds, tuning alerts, and manually correlating metrics to root causes. In the early days, this straightforward model worked as applications were simpler, stacks more predictable, and telemetry was manageable. Today, the landscape has shifted, and more assertive tools are needed ...
Cloud adoption has accelerated, but backup strategies haven't always kept pace. Many organizations continue to rely on backup strategies that were either lifted directly from on-prem environments or use cloud-native tools in limited, DR-focused ways ... Eon uncovered a handful of critical gaps regarding how organizations approach cloud backup. To capture these prevailing winds, we gathered insights from 150+ IT and cloud leaders at the recent Google Cloud Next conference, which we've compiled into the 2025 State of Cloud Data Backup ...
Private clouds are no longer playing catch-up, and public clouds are no longer the default as organizations recalibrate their cloud strategies, according to the Private Cloud Outlook 2025 report from Broadcom. More than half (53%) of survey respondents say private cloud is their top priority for deploying new workloads over the next three years, while 69% are considering workload repatriation from public to private cloud, with one-third having already done so ...
As organizations chase productivity gains from generative AI, teams are overwhelmingly focused on improving delivery speed (45%) over enhancing software quality (13%), according to the Quality Transformation Report from Tricentis ...
Back in March of this year ... MongoDB's stock price took a serious tumble ... In my opinion, it reflects a deeper structural issue in enterprise software economics altogether — vendor lock-in ...
In MEAN TIME TO INSIGHT Episode 15, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses Do-It-Yourself Network Automation ...
Zero-day vulnerabilities — security flaws that are exploited before developers even know they exist — pose one of the greatest risks to modern organizations. Recently, such vulnerabilities have been discovered in well-known VPN systems like Ivanti and Fortinet, highlighting just how outdated these legacy technologies have become in defending against fast-evolving cyber threats ... To protect digital assets and remote workers in today's environment, companies need more than patchwork solutions. They need architecture that is secure by design ...
Traditional observability requires users to leap across different platforms or tools for metrics, logs, or traces and related issues manually, which is very time-consuming, so as to reasonably ascertain the root cause. Observability 2.0 fixes this by unifying all telemetry data, logs, metrics, and traces into a single, context-rich pipeline that flows into one smart platform. But this is far from just having a bunch of additional data; this data is actionable, predictive, and tied to revenue realization ...
64% of enterprise networking teams use internally developed software or scripts for network automation, but 61% of those teams spend six or more hours per week debugging and maintaining them, according to From Scripts to Platforms: Why Homegrown Tools Dominate Network Automation and How Vendors Can Help, my latest EMA report ...