Skip to main content

New Relic Updates SRE Agent

New Relic announced a series of innovations that operationalize AI across the enterprise, empowering teams to focus on business growth instead of firefighting. 

Bolstered by cutting-edge deterministic analytical tools and a suite of new AIOps capabilities, New Relic’s SRE Agent provides next-generation issue triage, change management, incident lifecycle management, intelligent root cause analysis and other features to help engineers cut through data noise and boost operational stability.

“AI is pushing software development beyond human scale, creating a surge of system changes and telemetry volume that IT teams can no longer manage on their own,” said New Relic Chief Product Officer Brian Emerson. “Observability must evolve from simply surfacing data to analyzing it and helping humans take action with less toil. With the new SRE Agent that draws on our powerful AI-strengthened observability capabilities, we’re providing engineers with agentic teammates grounded in live data to resolve incidents faster and with fewer mistakes. The enterprises that win in this era will be those that use AI to cut through noise and optimize business uptime.”

The New Relic SRE Agent helps customers shift operations from reactive to proactive by deploying “always on” AI teammates that diagnose incidents and recommend next steps oftentimes before an engineer acknowledges a page. The agent acts as a specialized worker that performs deep, full-stack diagnostics, combining the flexibility and reasoning capabilities of generative models with “ground truth” brought by a suite of finely-honed deterministic features, such as causal graphs, incident data, performance antipattern knowledge and customer-developed workflows.

The SRE Agent acts as an intelligent context engine for the incident lifecycle. Through Slack and Zoom integrations, responders can query New Relic directly from triage rooms while the SRE Agent captures human context to power automated fact finding, root cause analysis, impact assessments, and reporting. Users gain a unified view of the evolving timeline of events that led up to and following an incident. As a result, they can measure user impact in real-time, identify duplicate incidents, and generate and refine comprehensive post-incident reports.  

The New Relic SRE Agent draws on new Intelligent Observability Platform capabilities including:

  • Intelligent RCA (iRCA): iRCA cuts through the noise by automatically searching the entity's topology graph, scoring the graph using probabilistic causal models, and applying a path-based ranking algorithm to narrow down the problem space in seconds, not hours. By leveraging iRCA, the SRE Agent performs its most time consuming task —separating noise from signal—in high-confidence, deterministic methodologies.
  • Workflow Automation: Intelligent automation that enables teams to automate complex or repetitive operational tasks by creating workflows—structured, multi-step processes that can include conditional logic, human approvals, and integrations with external tools, without writing additional code. DevOps and SREs can improve efficiency by automating everything from notification routing and post-deployment health checks to complex processes such as EC2 instance resizing or Lambda function rollbacks. The SRE Agent will be able to invoke workflows but also be invoked as part of a workflow, which adds an almost endless potential for customization and utility to the mix.

Additional AIOps capabilities now available include:

  • Performance Risks Inbox: Goes beyond reactive application performance monitoring (APM) to show why an incident or outage is about to happen so action can quickly be taken. Proactively detects and groups critical coding anti-patterns, including slow SQL queries, N+1 queries, excessive database queries, and the like which threaten application stability and business continuity.
  • Smart Alerts: An automated alerting engine that uses AI-strengthened anomaly detection and dynamic baselines to reduce alert noise and improve signal quality across complex environments. By delivering more reliable, behavior-aware alerts, it helps teams respond faster and with greater confidence. Use of the capability also lays the groundwork for businesses to maximize agentic AI, ensuring alerts are automated for better deployment of digital workforces.

These innovations are now available in preview to customers as part of the New Relic Intelligent Observability Platform. Workflow Automation is now generally available. 

The Latest

For years, infrastructure teams have treated compute as a relatively stable input. Capacity was provisioned, costs were forecasted, and performance expectations were set based on the assumption that identical resources behaved identically. That mental model is starting to break down. AI infrastructure is no longer behaving like static cloud capacity. It is increasingly behaving like a market ...

Resilience can no longer be defined by how quickly an organization recovers from an incident or disruption. The effectiveness of any resilience strategy is dependent on its ability to anticipate change, operate under continuous stress, and adapt confidently amid uncertainty ...

Mobile users are less tolerant of app instability than ever before. According to a new report from Luciq, No Margin for Error: What Mobile Users Expect and What Mobile Leaders Must Deliver in 2026, even minor performance issues now result in immediate abandonment, lost purchases, and long-term brand impact ...

Artificial intelligence (AI) has become the dominant force shaping enterprise data strategies. Boards expect progress. Executives expect returns. And data leaders are under pressure to prove that their organizations are "AI-ready" ...

Agentic AI is a major buzzword for 2026. Many tech companies are making bold promises about this technology, but many aren't grounded in reality, at least not yet. This coming year will likely be shaped by reality checks for IT teams, and progress will only come from a focus on strong foundations and disciplined execution ...

AI systems are still prone to hallucinations and misjudgments ... To build the trust needed for adoption, AI must be paired with human-in-the-loop (HITL) oversight, or checkpoints where humans verify, guide, and decide what actions are taken. The balance between autonomy and accountability is what will allow AI to deliver on its promise without sacrificing human trust ...

More data center leaders are reducing their reliance on utility grids by investing in onsite power for rapidly scaling data centers, according to the Data Center Power Report from Bloom Energy ...

In MEAN TIME TO INSIGHT Episode 21, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses AI-driven NetOps ... 

Enterprise IT has become increasingly complex and fragmented. Organizations are juggling dozens — sometimes hundreds — of different tools for endpoint management, security, app delivery, and employee experience. Each one needs its own license, its own maintenance, and its own integration. The result is a patchwork of overlapping tools, data stuck in silos, security vulnerabilities, and IT teams are spending more time managing software than actually getting work done ...

2025 was the year everybody finally saw the cracks in the foundation. If you were running production workloads, you probably lived through at least one outage you could not explain to your executives without pulling up a diagram and a whiteboard ...

New Relic Updates SRE Agent

New Relic announced a series of innovations that operationalize AI across the enterprise, empowering teams to focus on business growth instead of firefighting. 

Bolstered by cutting-edge deterministic analytical tools and a suite of new AIOps capabilities, New Relic’s SRE Agent provides next-generation issue triage, change management, incident lifecycle management, intelligent root cause analysis and other features to help engineers cut through data noise and boost operational stability.

“AI is pushing software development beyond human scale, creating a surge of system changes and telemetry volume that IT teams can no longer manage on their own,” said New Relic Chief Product Officer Brian Emerson. “Observability must evolve from simply surfacing data to analyzing it and helping humans take action with less toil. With the new SRE Agent that draws on our powerful AI-strengthened observability capabilities, we’re providing engineers with agentic teammates grounded in live data to resolve incidents faster and with fewer mistakes. The enterprises that win in this era will be those that use AI to cut through noise and optimize business uptime.”

The New Relic SRE Agent helps customers shift operations from reactive to proactive by deploying “always on” AI teammates that diagnose incidents and recommend next steps oftentimes before an engineer acknowledges a page. The agent acts as a specialized worker that performs deep, full-stack diagnostics, combining the flexibility and reasoning capabilities of generative models with “ground truth” brought by a suite of finely-honed deterministic features, such as causal graphs, incident data, performance antipattern knowledge and customer-developed workflows.

The SRE Agent acts as an intelligent context engine for the incident lifecycle. Through Slack and Zoom integrations, responders can query New Relic directly from triage rooms while the SRE Agent captures human context to power automated fact finding, root cause analysis, impact assessments, and reporting. Users gain a unified view of the evolving timeline of events that led up to and following an incident. As a result, they can measure user impact in real-time, identify duplicate incidents, and generate and refine comprehensive post-incident reports.  

The New Relic SRE Agent draws on new Intelligent Observability Platform capabilities including:

  • Intelligent RCA (iRCA): iRCA cuts through the noise by automatically searching the entity's topology graph, scoring the graph using probabilistic causal models, and applying a path-based ranking algorithm to narrow down the problem space in seconds, not hours. By leveraging iRCA, the SRE Agent performs its most time consuming task —separating noise from signal—in high-confidence, deterministic methodologies.
  • Workflow Automation: Intelligent automation that enables teams to automate complex or repetitive operational tasks by creating workflows—structured, multi-step processes that can include conditional logic, human approvals, and integrations with external tools, without writing additional code. DevOps and SREs can improve efficiency by automating everything from notification routing and post-deployment health checks to complex processes such as EC2 instance resizing or Lambda function rollbacks. The SRE Agent will be able to invoke workflows but also be invoked as part of a workflow, which adds an almost endless potential for customization and utility to the mix.

Additional AIOps capabilities now available include:

  • Performance Risks Inbox: Goes beyond reactive application performance monitoring (APM) to show why an incident or outage is about to happen so action can quickly be taken. Proactively detects and groups critical coding anti-patterns, including slow SQL queries, N+1 queries, excessive database queries, and the like which threaten application stability and business continuity.
  • Smart Alerts: An automated alerting engine that uses AI-strengthened anomaly detection and dynamic baselines to reduce alert noise and improve signal quality across complex environments. By delivering more reliable, behavior-aware alerts, it helps teams respond faster and with greater confidence. Use of the capability also lays the groundwork for businesses to maximize agentic AI, ensuring alerts are automated for better deployment of digital workforces.

These innovations are now available in preview to customers as part of the New Relic Intelligent Observability Platform. Workflow Automation is now generally available. 

The Latest

For years, infrastructure teams have treated compute as a relatively stable input. Capacity was provisioned, costs were forecasted, and performance expectations were set based on the assumption that identical resources behaved identically. That mental model is starting to break down. AI infrastructure is no longer behaving like static cloud capacity. It is increasingly behaving like a market ...

Resilience can no longer be defined by how quickly an organization recovers from an incident or disruption. The effectiveness of any resilience strategy is dependent on its ability to anticipate change, operate under continuous stress, and adapt confidently amid uncertainty ...

Mobile users are less tolerant of app instability than ever before. According to a new report from Luciq, No Margin for Error: What Mobile Users Expect and What Mobile Leaders Must Deliver in 2026, even minor performance issues now result in immediate abandonment, lost purchases, and long-term brand impact ...

Artificial intelligence (AI) has become the dominant force shaping enterprise data strategies. Boards expect progress. Executives expect returns. And data leaders are under pressure to prove that their organizations are "AI-ready" ...

Agentic AI is a major buzzword for 2026. Many tech companies are making bold promises about this technology, but many aren't grounded in reality, at least not yet. This coming year will likely be shaped by reality checks for IT teams, and progress will only come from a focus on strong foundations and disciplined execution ...

AI systems are still prone to hallucinations and misjudgments ... To build the trust needed for adoption, AI must be paired with human-in-the-loop (HITL) oversight, or checkpoints where humans verify, guide, and decide what actions are taken. The balance between autonomy and accountability is what will allow AI to deliver on its promise without sacrificing human trust ...

More data center leaders are reducing their reliance on utility grids by investing in onsite power for rapidly scaling data centers, according to the Data Center Power Report from Bloom Energy ...

In MEAN TIME TO INSIGHT Episode 21, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses AI-driven NetOps ... 

Enterprise IT has become increasingly complex and fragmented. Organizations are juggling dozens — sometimes hundreds — of different tools for endpoint management, security, app delivery, and employee experience. Each one needs its own license, its own maintenance, and its own integration. The result is a patchwork of overlapping tools, data stuck in silos, security vulnerabilities, and IT teams are spending more time managing software than actually getting work done ...

2025 was the year everybody finally saw the cracks in the foundation. If you were running production workloads, you probably lived through at least one outage you could not explain to your executives without pulling up a diagram and a whiteboard ...