Skip to main content

5 Takeaways from the Observability Forecast for Media and Entertainment

Nic Benders
New Relic

New research from New Relic shows how observability is being used to inform performance, reliability, and revenue across industries. In 2025, the company surveyed more than 1,700 IT and engineering leaders globally, including 120 from the media and entertainment (M&E) sector, to understand what's working — and where challenges persist with their observability practices.

The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out.

1. Outages cost M&E organizations an average of $2 million per hour

The M&E industry's average incident detection and resolution times — around 30 minutes to detect and 40 minutes to resolve — are in line with the global average, but the stakes are higher for this industry. An outage for an M&E organization can mean that millions of people miss out on a sporting event, a highly anticipated content premiere, or a livestream from their favorite gamer. It only takes minutes for downtime to significantly damage a streaming platform's brand.

Further, organizations that fail to deliver a reliable digital experience face substantial penalties in the form of lost subscriptions and ad revenue. 79% of respondents reported that outages cost one million dollars or more per hour, with the majority (33%) reporting an average cost of one to two million dollars per hour. 40% of respondents point to security failures as the primary cause of outages, followed by network failures (34%) and deployment errors (30%).

2. Observability pays off in less downtime and better customer experiences

Fortunately, M&E organizations report that their investments in observability are paying off in reduced downtime costs, improved ad performance, and subscriber retention. More than a third (39%) say that observability is increasing uptime and reliability, and a similar percentage (36%) say it's directly improving the user experience.

But the benefits of observability extend beyond technical metrics, increasingly helping organizations deliver more effectively on business objectives. Half of leadership respondents say observability helps them hit technical KPIs, and 36% say it strengthens their ability to drive tactical execution.

As a result, more than half (51%) of respondents report a 2-3x return on investment from their observability initiatives, higher than in industries such as finance and retail. And for a third of respondents, that ROI is showing up on their bottom line as business or revenue growth.

3. AI adoption is creating disruption and increasing complexity

M&E organizations have been eager to integrate AI across their businesses, from content recommendation engines and personalization algorithms to incident response, but they're cognizant of the risks it poses to stability and uptime. Nearly a third (30%) of respondents call AI adoption the primary driver of their observability strategy.

AI has complicated the observability landscape by introducing a more complex tech stack that is harder to manage than traditional software. At the same time, AI-powered observability is strengthening resilience by automating manual processes to accelerate incident detection and response. Observability platforms can detect quality drops in real time, roll back problematic deployments, and deliver ads without interruption.

4. Organizations are integrating business data and telemetry

To better quantify the business impact of downtime — and the benefits of observability — M&E organizations are increasingly integrating business data with telemetry. Nearly half (48%) of respondents say they have integrated, or plan to integrate, customer data, allowing leaders to see not just when a service goes down, but how outages immediately affect subscriber retention, ad delivery, and content engagement.

Many are extending this approach beyond customer data, integrating human resources (48%), operations (48%), and logistics data (46%) to create a more complete, real-time view of business health.

5. M&E organizations are making headway on tool sprawl

As software landscapes grow more complex, organizations use several observability tools on average that can handle a sprawling ecosystem of architectural patterns and applications. In M&E, tool sprawl is often exacerbated by the fact that content delivery, ad tech, and backend systems are monitored separately. The median number of observability tools in use by M&E organizations is four, and 18% of organizations run eight or more.

But a shift toward unified observability platforms is helping the industry chip away at tool sprawl. 40% of organizations now use three or fewer tools, up 10% from 2024. By moving away from siloed dashboards and toward unified platforms that provide full-stack visibility, organizations are strengthening their ability to deliver uptime and reliability while spending their observability budgets more efficiently.

Together, these findings show that observability is no longer a technical nice-to-have, but a business-critical capability for M&E organizations. As complexity and audience expectations continue to rise, teams that invest in unified, AI-driven observability will be best positioned to protect revenue, safeguard their brand, and deliver the seamless experiences viewers expect.

Nic Benders is Chief Technical Strategist at New Relic

Hot Topics

The Latest

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

In cloud-native systems, scaling is often as simple as moving a slider. For on-premise databases, the stakes are different. Over-provisioning hardware is expensive. Under-provisioning leads to performance bottlenecks that are difficult to fix once the equipment is in the rack ...

When most people think about cybersecurity, they picture firewalls, encryption, and access controls — technical tools designed to protect systems and data. But beneath the technology lies a deeper set of principles about trust, decision-making, and resilience ... The best leaders don't eliminate risk. They manage it intelligently. And in many ways, cybersecurity offers a surprisingly useful playbook for doing exactly that ...

Many organizations assumed their infrastructure strategy was settled. It had been implemented, optimized and built into long-term plans. Recent changes in technology and vendor consolidation are forcing a second look. Cloud outages and licensing changes have exposed how much dependency exists on a small number of platforms. As a result, organizations are reevaluating whether those decisions still hold up under current conditions ...

Edge AI is strategically embedded in core IT and infrastructure spending across industries, according to the 2026 Edge AI Survey from ZEDEDA. The research shows that 83% of C-suite and IT executive respondents say edge AI is important to their core business strategy ...

As AI adoption accelerates, operational complexity — not model intelligence — is becoming the primary barrier to reliable AI at scale, according to the State of AI Engineering 2026 from Datadog ... The report highlights a compounding complexity challenge as AI systems scale ... Around 5% of AI model requests fail in production, with nearly 60% of those failures caused by capacity limits ...

For years, production operations teams have treated alert fatigue as a quality-of-life problem: something that makes on-call rotations miserable but isn't considered a direct contributor to outages. That framing doesn't capture how these systems fail, and we now have data to show why. More importantly, it's now clear alert fatigue is a symptom of a deeper issue: production systems have outgrown the current operational approaches ...

I was on a customer call last fall when an enterprise architect said something I haven't been able to shake. Her team had just spent four months trying to swap one AI vendor for another. The original plan said three weeks. "We didn't switch vendors," she told me. "We rebuilt half our integrations and discovered what we'd actually been depending on." Most enterprise leaders don't expect that to be the experience ...

5 Takeaways from the Observability Forecast for Media and Entertainment

Nic Benders
New Relic

New research from New Relic shows how observability is being used to inform performance, reliability, and revenue across industries. In 2025, the company surveyed more than 1,700 IT and engineering leaders globally, including 120 from the media and entertainment (M&E) sector, to understand what's working — and where challenges persist with their observability practices.

The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out.

1. Outages cost M&E organizations an average of $2 million per hour

The M&E industry's average incident detection and resolution times — around 30 minutes to detect and 40 minutes to resolve — are in line with the global average, but the stakes are higher for this industry. An outage for an M&E organization can mean that millions of people miss out on a sporting event, a highly anticipated content premiere, or a livestream from their favorite gamer. It only takes minutes for downtime to significantly damage a streaming platform's brand.

Further, organizations that fail to deliver a reliable digital experience face substantial penalties in the form of lost subscriptions and ad revenue. 79% of respondents reported that outages cost one million dollars or more per hour, with the majority (33%) reporting an average cost of one to two million dollars per hour. 40% of respondents point to security failures as the primary cause of outages, followed by network failures (34%) and deployment errors (30%).

2. Observability pays off in less downtime and better customer experiences

Fortunately, M&E organizations report that their investments in observability are paying off in reduced downtime costs, improved ad performance, and subscriber retention. More than a third (39%) say that observability is increasing uptime and reliability, and a similar percentage (36%) say it's directly improving the user experience.

But the benefits of observability extend beyond technical metrics, increasingly helping organizations deliver more effectively on business objectives. Half of leadership respondents say observability helps them hit technical KPIs, and 36% say it strengthens their ability to drive tactical execution.

As a result, more than half (51%) of respondents report a 2-3x return on investment from their observability initiatives, higher than in industries such as finance and retail. And for a third of respondents, that ROI is showing up on their bottom line as business or revenue growth.

3. AI adoption is creating disruption and increasing complexity

M&E organizations have been eager to integrate AI across their businesses, from content recommendation engines and personalization algorithms to incident response, but they're cognizant of the risks it poses to stability and uptime. Nearly a third (30%) of respondents call AI adoption the primary driver of their observability strategy.

AI has complicated the observability landscape by introducing a more complex tech stack that is harder to manage than traditional software. At the same time, AI-powered observability is strengthening resilience by automating manual processes to accelerate incident detection and response. Observability platforms can detect quality drops in real time, roll back problematic deployments, and deliver ads without interruption.

4. Organizations are integrating business data and telemetry

To better quantify the business impact of downtime — and the benefits of observability — M&E organizations are increasingly integrating business data with telemetry. Nearly half (48%) of respondents say they have integrated, or plan to integrate, customer data, allowing leaders to see not just when a service goes down, but how outages immediately affect subscriber retention, ad delivery, and content engagement.

Many are extending this approach beyond customer data, integrating human resources (48%), operations (48%), and logistics data (46%) to create a more complete, real-time view of business health.

5. M&E organizations are making headway on tool sprawl

As software landscapes grow more complex, organizations use several observability tools on average that can handle a sprawling ecosystem of architectural patterns and applications. In M&E, tool sprawl is often exacerbated by the fact that content delivery, ad tech, and backend systems are monitored separately. The median number of observability tools in use by M&E organizations is four, and 18% of organizations run eight or more.

But a shift toward unified observability platforms is helping the industry chip away at tool sprawl. 40% of organizations now use three or fewer tools, up 10% from 2024. By moving away from siloed dashboards and toward unified platforms that provide full-stack visibility, organizations are strengthening their ability to deliver uptime and reliability while spending their observability budgets more efficiently.

Together, these findings show that observability is no longer a technical nice-to-have, but a business-critical capability for M&E organizations. As complexity and audience expectations continue to rise, teams that invest in unified, AI-driven observability will be best positioned to protect revenue, safeguard their brand, and deliver the seamless experiences viewers expect.

Nic Benders is Chief Technical Strategist at New Relic

Hot Topics

The Latest

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

In cloud-native systems, scaling is often as simple as moving a slider. For on-premise databases, the stakes are different. Over-provisioning hardware is expensive. Under-provisioning leads to performance bottlenecks that are difficult to fix once the equipment is in the rack ...

When most people think about cybersecurity, they picture firewalls, encryption, and access controls — technical tools designed to protect systems and data. But beneath the technology lies a deeper set of principles about trust, decision-making, and resilience ... The best leaders don't eliminate risk. They manage it intelligently. And in many ways, cybersecurity offers a surprisingly useful playbook for doing exactly that ...

Many organizations assumed their infrastructure strategy was settled. It had been implemented, optimized and built into long-term plans. Recent changes in technology and vendor consolidation are forcing a second look. Cloud outages and licensing changes have exposed how much dependency exists on a small number of platforms. As a result, organizations are reevaluating whether those decisions still hold up under current conditions ...

Edge AI is strategically embedded in core IT and infrastructure spending across industries, according to the 2026 Edge AI Survey from ZEDEDA. The research shows that 83% of C-suite and IT executive respondents say edge AI is important to their core business strategy ...

As AI adoption accelerates, operational complexity — not model intelligence — is becoming the primary barrier to reliable AI at scale, according to the State of AI Engineering 2026 from Datadog ... The report highlights a compounding complexity challenge as AI systems scale ... Around 5% of AI model requests fail in production, with nearly 60% of those failures caused by capacity limits ...

For years, production operations teams have treated alert fatigue as a quality-of-life problem: something that makes on-call rotations miserable but isn't considered a direct contributor to outages. That framing doesn't capture how these systems fail, and we now have data to show why. More importantly, it's now clear alert fatigue is a symptom of a deeper issue: production systems have outgrown the current operational approaches ...

I was on a customer call last fall when an enterprise architect said something I haven't been able to shake. Her team had just spent four months trying to swap one AI vendor for another. The original plan said three weeks. "We didn't switch vendors," she told me. "We rebuilt half our integrations and discovered what we'd actually been depending on." Most enterprise leaders don't expect that to be the experience ...