Skip to main content

Why Observability Is the Missing Piece in Your Business Growth

Mimi Shalash
Splunk

For years, "observability" has been a backstage function. A quiet force that keeps digital systems running. What once lived deep in the data center is now at the center of every digital strategy. Even back in 2020, Gartner foreshadowed this shift, defining observability as "the evolution of monitoring into a process that offers insight into digital business applications, innovation, and customer experience."

That prediction has become even more relevant in an AI-driven world.

Every digital customer interaction, every cloud deployment, and every AI model depends on the same foundation: the ability to see, understand, and act on data in real time.

Digital moments like a mobile purchase, a supply chain handoff, or an AI inference run through complex layers of infrastructure. When that infrastructure falters, so does the business. Recent data from Splunk confirms that 74% of the business leaders believe observability is essential to monitoring critical business processes, and 66% feel it's key to understanding user journeys.

Because while the unknown is inevitable, observability makes it manageable. Let's explore why.

The Symbiotic Relationship Between AI and Observability

Image
Splunk

AI and observability are now inseparable. Data is the common language between them, and when used together, they amplify each other's strengths. AI helps observability teams detect patterns faster and gives engineers time back to focus on what truly moves the business forward: building better products and improving customer experiences.

Still, most teams aren't there yet. Many ITOps and engineering groups struggle with too many disconnected tools and an overload of false alerts, keeping them in a constant state of reactivity. This is the structural challenge that has persisted across enterprises: fragmented telemetry, inconsistent context, and decentralized standards.

It's no surprise, then, that organizations are turning to AI to correlate signals, reduce noise, and surface what matters most. Because prediction without observability is just speculation, and no business can afford to guess.

Splunk's research shows that 76% of practitioners now use AI regularly in daily operations, and 78% say it gives them more time to focus on innovation instead of maintenance. Yet every advancement brings new complexity. Humans in the loop are now responsible for ensuring model performance, and 47% of observability professionals say monitoring AI workloads has made their jobs more challenging, with 40% citing a lack of expertise as a barrier to AI readiness.

This gap represents a strategic opportunity. Organizations that upskill observability teams to measure AI performance and manage data quality will build a foundation of clean, governed, and trusted data that spans the entire enterprise. That means going beyond traditional IT telemetry to include operational technology (OT), IoT, sensor, and other machine data that power critical business systems.

The convergence of these once disparate data domains represents one of the most transformative opportunities in modern observability. Whether it's connecting insights from the factory floor, ERP systems, or even turbine sensors, organizations can finally uncover cross-functional intelligence that drives predictive action and measurable business outcomes.

Unlocking the Business Catalyst in Your Observability Practice

Realizing that vision requires strengthening the foundations of observability. Let's discuss four:

Minimize War Rooms and Reactivity: Many organizations still default to large, cross-functional escalations that duplicate effort and prolong mean time to resolution (MTTR). In fact, 1 in 5 respondents said they "often" or "always" start a war room that includes various departments. A more effective model emphasizes coordinated isolation and parallel response. When ITOps, engineering, and security teams visualize data through a common lens, they can trace the source of an issue faster and determine ownership. For example, when performance degradation in a key app is detected, shared telemetry allows engineering to see that the latency originates in an overloaded API gateway, not the database or underlying infrastructure. It's a simple example, but one that highlights how service mapping can triage, so people can focus on resolution, not reaction.

Get Alerting Under Control: False alerts drain engineering focus and erode trust in monitoring systems. Mature organizations address this by implementing adaptive thresholding, which dynamically adjusts alert parameters based on historical trends, system baselines, and seasonality. For example, instead of triggering dozens of CPU utilization alerts during routine batch processing every night, adaptive thresholding automatically adjusts expectations based on historical behavior. Managing alert suppression without removing early indicators of degradation is as much about data discipline as it is about process. When thresholds, alerts, and suppression logic are governed transparently and evolve with the environment, organizations build the foundation of data needed for higher levels of maturity and ultimately, AI readiness.

Lay the Foundation for Good Data That Reaps AI Benefits: Nearly half of respondents (48%) cite poor data quality as a barrier to achieving AI readiness. When engineering teams align on common data models, standardized collection practices, and comprehensive data coverage that reflects the full complexity of their environments, they establish consistent, reliable inputs for AI systems. The future belongs to organizations that can aggregate and contextualize all machine data from traditional homegrown applications, commercial off-the-shelf applications to environmental signals like temperature, vibration, and motion. When every data source speaks a common language, AI systems will be the catalyst for a new era of operational intelligence grounded in the full reality of the enterprise.

Embrace Forward Looking Architectures: The next evolution of observability is about building architectures that can adapt as fast as the systems they monitor. Organizations are investing in open and extensible technologies such as OpenTelemetry, code profiling, and observability as code to future-proof their data strategy. These approaches establish portability across environments, reduce vendor dependency, and embed observability into the software delivery lifecycle itself. OpenTelemetry, for example, is quickly becoming the industry standard for collecting, normalizing, and enriching telemetry data across hybrid and multicloud ecosystems. By adopting it early, teams can ensure consistency in how data is defined and exchanged, which sets the stage for complementary frameworks like Machine Communication Protocol (MCP). Together, these standards will be the future of advanced analytics, AI workflows, and autonomous operational systems.

Image
Splunk

Realizing Tangible Business Growth

When organizations take an innovative and responsible approach to observability, they create a foundation of agility and resilience that enables them to thrive through disruption and change. While the pace of innovation accelerates, the anchors of business success remain constant: building exceptional products, elevating customer experiences, and delivering measurable ROI that strengthens the bottom line.

In a world defined by data and driven by AI, observability is no longer just about visibility. It's now about vision.

Mimi Shalash is Observability Advisor at Splunk, a Cisco company

The Latest

Enterprises today operate in a real-time environment where uninterrupted access to trusted data has become a baseline expectation for users, applications and automated systems. Traditional DataOps models, built on manual effort and human triage, cannot keep pace with this always active demand. AI agents are emerging as the operational backbone, ensuring consistent data availability, reinforcing trustworthiness and enabling a level of scale that manual processes cannot achieve ...

For decades, trust in the digital workplace rested on familiar signals. We trusted faces on video calls, voices on the phone, and emails that appeared to come from people we knew. These cues felt human and intuitive. They anchored how decisions were made, approvals were granted, and access was authorized. AI-powered deepfakes have quietly broken that model ...

Cloud migration was supposed to be a one-way door. For most enterprises, it turns out it isn't. Cloud data repatriation is a real and growing trend. A new survey ... finds that 89% of organizations plan to expand their on-premises infrastructure footprint over the next two years — and 75% have already moved at least some workloads back from public cloud in the past 24 months. The findings point to a broad rethinking of where data belongs ...

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

For years, DevOps teams operated under a simple assumption: collect enough telemetry, and you can find and fix any problem. That assumption is breaking down. Modern enterprises now operate across microservices, hybrid cloud environments, APIs, Kubernetes, and highly automated delivery pipelines. Releases happen continuously, dependencies shift constantly, and failures spread faster than teams can diagnose them ...

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

In cloud-native systems, scaling is often as simple as moving a slider. For on-premise databases, the stakes are different. Over-provisioning hardware is expensive. Under-provisioning leads to performance bottlenecks that are difficult to fix once the equipment is in the rack ...

When most people think about cybersecurity, they picture firewalls, encryption, and access controls — technical tools designed to protect systems and data. But beneath the technology lies a deeper set of principles about trust, decision-making, and resilience ... The best leaders don't eliminate risk. They manage it intelligently. And in many ways, cybersecurity offers a surprisingly useful playbook for doing exactly that ...

Why Observability Is the Missing Piece in Your Business Growth

Mimi Shalash
Splunk

For years, "observability" has been a backstage function. A quiet force that keeps digital systems running. What once lived deep in the data center is now at the center of every digital strategy. Even back in 2020, Gartner foreshadowed this shift, defining observability as "the evolution of monitoring into a process that offers insight into digital business applications, innovation, and customer experience."

That prediction has become even more relevant in an AI-driven world.

Every digital customer interaction, every cloud deployment, and every AI model depends on the same foundation: the ability to see, understand, and act on data in real time.

Digital moments like a mobile purchase, a supply chain handoff, or an AI inference run through complex layers of infrastructure. When that infrastructure falters, so does the business. Recent data from Splunk confirms that 74% of the business leaders believe observability is essential to monitoring critical business processes, and 66% feel it's key to understanding user journeys.

Because while the unknown is inevitable, observability makes it manageable. Let's explore why.

The Symbiotic Relationship Between AI and Observability

Image
Splunk

AI and observability are now inseparable. Data is the common language between them, and when used together, they amplify each other's strengths. AI helps observability teams detect patterns faster and gives engineers time back to focus on what truly moves the business forward: building better products and improving customer experiences.

Still, most teams aren't there yet. Many ITOps and engineering groups struggle with too many disconnected tools and an overload of false alerts, keeping them in a constant state of reactivity. This is the structural challenge that has persisted across enterprises: fragmented telemetry, inconsistent context, and decentralized standards.

It's no surprise, then, that organizations are turning to AI to correlate signals, reduce noise, and surface what matters most. Because prediction without observability is just speculation, and no business can afford to guess.

Splunk's research shows that 76% of practitioners now use AI regularly in daily operations, and 78% say it gives them more time to focus on innovation instead of maintenance. Yet every advancement brings new complexity. Humans in the loop are now responsible for ensuring model performance, and 47% of observability professionals say monitoring AI workloads has made their jobs more challenging, with 40% citing a lack of expertise as a barrier to AI readiness.

This gap represents a strategic opportunity. Organizations that upskill observability teams to measure AI performance and manage data quality will build a foundation of clean, governed, and trusted data that spans the entire enterprise. That means going beyond traditional IT telemetry to include operational technology (OT), IoT, sensor, and other machine data that power critical business systems.

The convergence of these once disparate data domains represents one of the most transformative opportunities in modern observability. Whether it's connecting insights from the factory floor, ERP systems, or even turbine sensors, organizations can finally uncover cross-functional intelligence that drives predictive action and measurable business outcomes.

Unlocking the Business Catalyst in Your Observability Practice

Realizing that vision requires strengthening the foundations of observability. Let's discuss four:

Minimize War Rooms and Reactivity: Many organizations still default to large, cross-functional escalations that duplicate effort and prolong mean time to resolution (MTTR). In fact, 1 in 5 respondents said they "often" or "always" start a war room that includes various departments. A more effective model emphasizes coordinated isolation and parallel response. When ITOps, engineering, and security teams visualize data through a common lens, they can trace the source of an issue faster and determine ownership. For example, when performance degradation in a key app is detected, shared telemetry allows engineering to see that the latency originates in an overloaded API gateway, not the database or underlying infrastructure. It's a simple example, but one that highlights how service mapping can triage, so people can focus on resolution, not reaction.

Get Alerting Under Control: False alerts drain engineering focus and erode trust in monitoring systems. Mature organizations address this by implementing adaptive thresholding, which dynamically adjusts alert parameters based on historical trends, system baselines, and seasonality. For example, instead of triggering dozens of CPU utilization alerts during routine batch processing every night, adaptive thresholding automatically adjusts expectations based on historical behavior. Managing alert suppression without removing early indicators of degradation is as much about data discipline as it is about process. When thresholds, alerts, and suppression logic are governed transparently and evolve with the environment, organizations build the foundation of data needed for higher levels of maturity and ultimately, AI readiness.

Lay the Foundation for Good Data That Reaps AI Benefits: Nearly half of respondents (48%) cite poor data quality as a barrier to achieving AI readiness. When engineering teams align on common data models, standardized collection practices, and comprehensive data coverage that reflects the full complexity of their environments, they establish consistent, reliable inputs for AI systems. The future belongs to organizations that can aggregate and contextualize all machine data from traditional homegrown applications, commercial off-the-shelf applications to environmental signals like temperature, vibration, and motion. When every data source speaks a common language, AI systems will be the catalyst for a new era of operational intelligence grounded in the full reality of the enterprise.

Embrace Forward Looking Architectures: The next evolution of observability is about building architectures that can adapt as fast as the systems they monitor. Organizations are investing in open and extensible technologies such as OpenTelemetry, code profiling, and observability as code to future-proof their data strategy. These approaches establish portability across environments, reduce vendor dependency, and embed observability into the software delivery lifecycle itself. OpenTelemetry, for example, is quickly becoming the industry standard for collecting, normalizing, and enriching telemetry data across hybrid and multicloud ecosystems. By adopting it early, teams can ensure consistency in how data is defined and exchanged, which sets the stage for complementary frameworks like Machine Communication Protocol (MCP). Together, these standards will be the future of advanced analytics, AI workflows, and autonomous operational systems.

Image
Splunk

Realizing Tangible Business Growth

When organizations take an innovative and responsible approach to observability, they create a foundation of agility and resilience that enables them to thrive through disruption and change. While the pace of innovation accelerates, the anchors of business success remain constant: building exceptional products, elevating customer experiences, and delivering measurable ROI that strengthens the bottom line.

In a world defined by data and driven by AI, observability is no longer just about visibility. It's now about vision.

Mimi Shalash is Observability Advisor at Splunk, a Cisco company

The Latest

Enterprises today operate in a real-time environment where uninterrupted access to trusted data has become a baseline expectation for users, applications and automated systems. Traditional DataOps models, built on manual effort and human triage, cannot keep pace with this always active demand. AI agents are emerging as the operational backbone, ensuring consistent data availability, reinforcing trustworthiness and enabling a level of scale that manual processes cannot achieve ...

For decades, trust in the digital workplace rested on familiar signals. We trusted faces on video calls, voices on the phone, and emails that appeared to come from people we knew. These cues felt human and intuitive. They anchored how decisions were made, approvals were granted, and access was authorized. AI-powered deepfakes have quietly broken that model ...

Cloud migration was supposed to be a one-way door. For most enterprises, it turns out it isn't. Cloud data repatriation is a real and growing trend. A new survey ... finds that 89% of organizations plan to expand their on-premises infrastructure footprint over the next two years — and 75% have already moved at least some workloads back from public cloud in the past 24 months. The findings point to a broad rethinking of where data belongs ...

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

For years, DevOps teams operated under a simple assumption: collect enough telemetry, and you can find and fix any problem. That assumption is breaking down. Modern enterprises now operate across microservices, hybrid cloud environments, APIs, Kubernetes, and highly automated delivery pipelines. Releases happen continuously, dependencies shift constantly, and failures spread faster than teams can diagnose them ...

New Relic surveyed IT and engineering leaders from the media and entertainment (M&E) sector to understand what's working — and where challenges persist with their observability practices. The findings reveal how M&E organizations are navigating rising platform complexity, audience expectations, and AI-driven change. Below are five takeaways that stand out ...

Let me start with something I've seen play out more times than I can count. A team hits a wall with the cloud. Costs creep up, then spike. Performance starts to feel inconsistent. Someone in finance asks a simple question like "why did this double?" and nobody has a clean answer ... Maybe this isn't the right place for everything. That realization feels like a breakthrough, like you've identified the problem. In reality, you've just identified the starting line ...

In MEAN TIME TO INSIGHT Episode 24, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses network observability tool sprawl ... 

In cloud-native systems, scaling is often as simple as moving a slider. For on-premise databases, the stakes are different. Over-provisioning hardware is expensive. Under-provisioning leads to performance bottlenecks that are difficult to fix once the equipment is in the rack ...

When most people think about cybersecurity, they picture firewalls, encryption, and access controls — technical tools designed to protect systems and data. But beneath the technology lies a deeper set of principles about trust, decision-making, and resilience ... The best leaders don't eliminate risk. They manage it intelligently. And in many ways, cybersecurity offers a surprisingly useful playbook for doing exactly that ...