Skip to main content

New Relic Updates SRE Agent

New Relic announced a series of innovations that operationalize AI across the enterprise, empowering teams to focus on business growth instead of firefighting. 

Bolstered by cutting-edge deterministic analytical tools and a suite of new AIOps capabilities, New Relic’s SRE Agent provides next-generation issue triage, change management, incident lifecycle management, intelligent root cause analysis and other features to help engineers cut through data noise and boost operational stability.

“AI is pushing software development beyond human scale, creating a surge of system changes and telemetry volume that IT teams can no longer manage on their own,” said New Relic Chief Product Officer Brian Emerson. “Observability must evolve from simply surfacing data to analyzing it and helping humans take action with less toil. With the new SRE Agent that draws on our powerful AI-strengthened observability capabilities, we’re providing engineers with agentic teammates grounded in live data to resolve incidents faster and with fewer mistakes. The enterprises that win in this era will be those that use AI to cut through noise and optimize business uptime.”

The New Relic SRE Agent helps customers shift operations from reactive to proactive by deploying “always on” AI teammates that diagnose incidents and recommend next steps oftentimes before an engineer acknowledges a page. The agent acts as a specialized worker that performs deep, full-stack diagnostics, combining the flexibility and reasoning capabilities of generative models with “ground truth” brought by a suite of finely-honed deterministic features, such as causal graphs, incident data, performance antipattern knowledge and customer-developed workflows.

The SRE Agent acts as an intelligent context engine for the incident lifecycle. Through Slack and Zoom integrations, responders can query New Relic directly from triage rooms while the SRE Agent captures human context to power automated fact finding, root cause analysis, impact assessments, and reporting. Users gain a unified view of the evolving timeline of events that led up to and following an incident. As a result, they can measure user impact in real-time, identify duplicate incidents, and generate and refine comprehensive post-incident reports.  

The New Relic SRE Agent draws on new Intelligent Observability Platform capabilities including:

  • Intelligent RCA (iRCA): iRCA cuts through the noise by automatically searching the entity's topology graph, scoring the graph using probabilistic causal models, and applying a path-based ranking algorithm to narrow down the problem space in seconds, not hours. By leveraging iRCA, the SRE Agent performs its most time consuming task —separating noise from signal—in high-confidence, deterministic methodologies.
  • Workflow Automation: Intelligent automation that enables teams to automate complex or repetitive operational tasks by creating workflows—structured, multi-step processes that can include conditional logic, human approvals, and integrations with external tools, without writing additional code. DevOps and SREs can improve efficiency by automating everything from notification routing and post-deployment health checks to complex processes such as EC2 instance resizing or Lambda function rollbacks. The SRE Agent will be able to invoke workflows but also be invoked as part of a workflow, which adds an almost endless potential for customization and utility to the mix.

Additional AIOps capabilities now available include:

  • Performance Risks Inbox: Goes beyond reactive application performance monitoring (APM) to show why an incident or outage is about to happen so action can quickly be taken. Proactively detects and groups critical coding anti-patterns, including slow SQL queries, N+1 queries, excessive database queries, and the like which threaten application stability and business continuity.
  • Smart Alerts: An automated alerting engine that uses AI-strengthened anomaly detection and dynamic baselines to reduce alert noise and improve signal quality across complex environments. By delivering more reliable, behavior-aware alerts, it helps teams respond faster and with greater confidence. Use of the capability also lays the groundwork for businesses to maximize agentic AI, ensuring alerts are automated for better deployment of digital workforces.

These innovations are now available in preview to customers as part of the New Relic Intelligent Observability Platform. Workflow Automation is now generally available. 

The Latest

Cloud migration was supposed to be a one-way door. For most enterprises, it turns out it isn't. Cloud data repatriation is a real and growing trend. A new survey ... finds that 89% of organizations plan to expand their on-premises infrastructure footprint over the next two years — and 75% have already moved at least some workloads back from public cloud in the past 24 months. The findings point to a broad rethinking of where data belongs ...

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

For years, DevOps teams operated under a simple assumption: collect enough telemetry, and you can find and fix any problem. That assumption is breaking down. Modern enterprises now operate across microservices, hybrid cloud environments, APIs, Kubernetes, and highly automated delivery pipelines. Releases happen continuously, dependencies shift constantly, and failures spread faster than teams can diagnose them ...

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

In cloud-native systems, scaling is often as simple as moving a slider. For on-premise databases, the stakes are different. Over-provisioning hardware is expensive. Under-provisioning leads to performance bottlenecks that are difficult to fix once the equipment is in the rack ...

When most people think about cybersecurity, they picture firewalls, encryption, and access controls — technical tools designed to protect systems and data. But beneath the technology lies a deeper set of principles about trust, decision-making, and resilience ... The best leaders don't eliminate risk. They manage it intelligently. And in many ways, cybersecurity offers a surprisingly useful playbook for doing exactly that ...

Many organizations assumed their infrastructure strategy was settled. It had been implemented, optimized and built into long-term plans. Recent changes in technology and vendor consolidation are forcing a second look. Cloud outages and licensing changes have exposed how much dependency exists on a small number of platforms. As a result, organizations are reevaluating whether those decisions still hold up under current conditions ...

Edge AI is strategically embedded in core IT and infrastructure spending across industries, according to the 2026 Edge AI Survey from ZEDEDA. The research shows that 83% of C-suite and IT executive respondents say edge AI is important to their core business strategy ...

New Relic Updates SRE Agent

New Relic announced a series of innovations that operationalize AI across the enterprise, empowering teams to focus on business growth instead of firefighting. 

Bolstered by cutting-edge deterministic analytical tools and a suite of new AIOps capabilities, New Relic’s SRE Agent provides next-generation issue triage, change management, incident lifecycle management, intelligent root cause analysis and other features to help engineers cut through data noise and boost operational stability.

“AI is pushing software development beyond human scale, creating a surge of system changes and telemetry volume that IT teams can no longer manage on their own,” said New Relic Chief Product Officer Brian Emerson. “Observability must evolve from simply surfacing data to analyzing it and helping humans take action with less toil. With the new SRE Agent that draws on our powerful AI-strengthened observability capabilities, we’re providing engineers with agentic teammates grounded in live data to resolve incidents faster and with fewer mistakes. The enterprises that win in this era will be those that use AI to cut through noise and optimize business uptime.”

The New Relic SRE Agent helps customers shift operations from reactive to proactive by deploying “always on” AI teammates that diagnose incidents and recommend next steps oftentimes before an engineer acknowledges a page. The agent acts as a specialized worker that performs deep, full-stack diagnostics, combining the flexibility and reasoning capabilities of generative models with “ground truth” brought by a suite of finely-honed deterministic features, such as causal graphs, incident data, performance antipattern knowledge and customer-developed workflows.

The SRE Agent acts as an intelligent context engine for the incident lifecycle. Through Slack and Zoom integrations, responders can query New Relic directly from triage rooms while the SRE Agent captures human context to power automated fact finding, root cause analysis, impact assessments, and reporting. Users gain a unified view of the evolving timeline of events that led up to and following an incident. As a result, they can measure user impact in real-time, identify duplicate incidents, and generate and refine comprehensive post-incident reports.  

The New Relic SRE Agent draws on new Intelligent Observability Platform capabilities including:

  • Intelligent RCA (iRCA): iRCA cuts through the noise by automatically searching the entity's topology graph, scoring the graph using probabilistic causal models, and applying a path-based ranking algorithm to narrow down the problem space in seconds, not hours. By leveraging iRCA, the SRE Agent performs its most time consuming task —separating noise from signal—in high-confidence, deterministic methodologies.
  • Workflow Automation: Intelligent automation that enables teams to automate complex or repetitive operational tasks by creating workflows—structured, multi-step processes that can include conditional logic, human approvals, and integrations with external tools, without writing additional code. DevOps and SREs can improve efficiency by automating everything from notification routing and post-deployment health checks to complex processes such as EC2 instance resizing or Lambda function rollbacks. The SRE Agent will be able to invoke workflows but also be invoked as part of a workflow, which adds an almost endless potential for customization and utility to the mix.

Additional AIOps capabilities now available include:

  • Performance Risks Inbox: Goes beyond reactive application performance monitoring (APM) to show why an incident or outage is about to happen so action can quickly be taken. Proactively detects and groups critical coding anti-patterns, including slow SQL queries, N+1 queries, excessive database queries, and the like which threaten application stability and business continuity.
  • Smart Alerts: An automated alerting engine that uses AI-strengthened anomaly detection and dynamic baselines to reduce alert noise and improve signal quality across complex environments. By delivering more reliable, behavior-aware alerts, it helps teams respond faster and with greater confidence. Use of the capability also lays the groundwork for businesses to maximize agentic AI, ensuring alerts are automated for better deployment of digital workforces.

These innovations are now available in preview to customers as part of the New Relic Intelligent Observability Platform. Workflow Automation is now generally available. 

The Latest

Cloud migration was supposed to be a one-way door. For most enterprises, it turns out it isn't. Cloud data repatriation is a real and growing trend. A new survey ... finds that 89% of organizations plan to expand their on-premises infrastructure footprint over the next two years — and 75% have already moved at least some workloads back from public cloud in the past 24 months. The findings point to a broad rethinking of where data belongs ...

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

For years, DevOps teams operated under a simple assumption: collect enough telemetry, and you can find and fix any problem. That assumption is breaking down. Modern enterprises now operate across microservices, hybrid cloud environments, APIs, Kubernetes, and highly automated delivery pipelines. Releases happen continuously, dependencies shift constantly, and failures spread faster than teams can diagnose them ...

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

In cloud-native systems, scaling is often as simple as moving a slider. For on-premise databases, the stakes are different. Over-provisioning hardware is expensive. Under-provisioning leads to performance bottlenecks that are difficult to fix once the equipment is in the rack ...

When most people think about cybersecurity, they picture firewalls, encryption, and access controls — technical tools designed to protect systems and data. But beneath the technology lies a deeper set of principles about trust, decision-making, and resilience ... The best leaders don't eliminate risk. They manage it intelligently. And in many ways, cybersecurity offers a surprisingly useful playbook for doing exactly that ...

Many organizations assumed their infrastructure strategy was settled. It had been implemented, optimized and built into long-term plans. Recent changes in technology and vendor consolidation are forcing a second look. Cloud outages and licensing changes have exposed how much dependency exists on a small number of platforms. As a result, organizations are reevaluating whether those decisions still hold up under current conditions ...

Edge AI is strategically embedded in core IT and infrastructure spending across industries, according to the 2026 Edge AI Survey from ZEDEDA. The research shows that 83% of C-suite and IT executive respondents say edge AI is important to their core business strategy ...