Skip to main content

Downtime in a Downturn Could Mean Customer Churn

Phil Tee

The last year has been challenging for Tech. Everyone in the industry, from IT and DevOps leaders to field technicians, grapples with recessionary pressures like inflation and rising interest rates in their personal life. And thanks to a never-ending barrage of stories about high-profile layoffs, they are also keenly aware that Tech is experiencing an aggravated downturn.

For many IT leaders, the well-reasoned response to these stories is to locate cost-cutting opportunities in their organization. Ultimately, an economic softening will encourage managers to audit their ITOps tech stack. This is a reasonable first step since the average engineering team manages more than 16 monitoring tools alone.

However, IT leaders must ensure their tool consolidation process is strategic. After all, many solutions are mission-critical — especially during an economic downturn, when hitting key metrics like revenue and availability becomes necessary for business continuity. The best rule of thumb is to consider which tools provide actionable insights and ROI without wasting technicians' time. This benchmark for success allows leaders to cut ties with superfluous solutions and double down on those that map back to critical KPIs like system performance and operational efficiency.

An array of tools purport to maintain availability — the trick is sorting through the noise to find the right one. Let us discuss why availability is so important and then unpack the ROI of deploying Artificial Intelligence for IT Operations (AIOps) during an economic downturn.

Maintaining Availability Has Become More Important Than Ever

Over half the world's GDP (60%) is digitized as of 2019. That means organizations with improper digital infrastructure will repeatedly lose out on revenue opportunities. And in a downturn, revenue-generating opportunities are not simply competitive differentiators — they are the difference between sinking and swimming.

True, revenue is a guiding KPI regardless of macroeconomic conditions. But the recent economic softening has refocused efforts from a "growth at all costs" mindset to a "generate revenue efficiently" perspective. Now, organizations are buckling down to the basics — and providing consumers with a reliable online destination to interact with a brand and its products is downright critical.

That is where availability comes in. Availability is the glue that binds all digital interfaces together. Defined by maximum system performance and uptime, availability is achieved through rigorous behind-the-scenes engineering work. AIOps are an essential part of this equation because these tools reduce an organization's mean time to detect (MTTD) and mean time to recover (MTTR) by simplifying, collating and escalating data errors before they create downtime.

Let us use an example to illustrate the importance of reduced MTTX. If a top broadcast network experiences an outage during a major sporting event, they stand to lose millions of viewers — and, as a result, millions of dollars in ad revenue. But if that broadcast network has deployed AIOps, they can expediently identify the nature of the error (low MTTD) and resolve it within 30 seconds (low MTTR). Compare that resolution to a network without AIOps, which may experience an outage measured in minutes not seconds. This extended outage could immediately cost the network millions of dollars, not to mention millions more in lost customer loyalty and damaged brand reputation.

In an economically fraught environment, the losses associated with such an outage are more likely to become exacerbated. Hence, maintaining availability is not a luxury but a necessity.

AIOps Goes Beyond Simple Event Management

Availability, uptime and system performance are leading DevOps concerns. Consequently, many vendors advertise that their monitoring tool can improve these vectors in isolation, but this is not so. Monitoring tools are foundational for a tech stack, but they are fundamentally incapable of identifying and escalating data errors across all telemetry points. Only AIOps solutions that ingest disparate data from all devices, networks and tools will provide a complete overhead of the incident lifecycle. Furthermore, top AIOps solutions rely on machine learning (ML) to grow with their system and fill contextual gaps.

AIOps tools are superior to point solutions because their AI-based algorithms can parse thousands of incidents to determine which are relevant. Consider that any data state change creates an incident, yet data is inherently ephemeral, and only a select few changes indicate an actual system error. AIOps reduce the time technicians spend combing over data by eradicating non-harmful events and escalating the rest to the appropriate party — all with minimal supervision.

And when technicians need to step in, AIOps-based systems provide them with context-rich event tickets that explain the data issue in detail. This provides ample time for technicians to address the problem and return to revenue-generating responsibilities like improving the user experience (UX) and driving down technical debt. During an economic softening, the ROI here is even more apparent, especially given the extended tech talent crunch that continues to leave IT and DevOps teams struggling to fill labor-related gaps.

Of course, budget cuts and hiring freezes are only natural responses to concerns about fluctuations in economic stability. But IT and DevOps leaders should carefully consider the ROI behind each solution they cut — and adopt — during an economic softening.

For example, does a solution of interest provide excess data to interpret, or does it also understand and act on that data?

Does a solution reduce monotonous labor needs?

And, most importantly, does it provide revenue-generating opportunities like increased uptime and availability?

This line of questioning will ultimately demonstrate that certain tools are unnecessary during an economic downturn while others are more critical than ever. But, in general, leaders should treat availability as their guiding light when auditing their tech stack. Doing so will leave their organization better positioned to excel in the months ahead.

The Latest

In live financial environments, capital markets software cannot pause for rebuilds. New capabilities are introduced as stacked technology layers to meet evolving demands while systems remain active, data keeps moving, and controls stay intact. AI is no exception, and its opportunities are significant: accelerated decision cycles, compressed manual workflows, and more effective operations across complex environments. The constraint isn't the models themselves, but the architectural environments they enter ...

Like most digital transformation shifts, organizations often prioritize productivity and leave security and observability to keep pace. This usually translates to both the mass implementation of new technology and fragmented monitoring and observability (M&O) tooling. In the era of AI and varied cloud architecture, a disparate observability function can be dangerous. IT teams will lack a complete picture of their IT environment, making it harder to diagnose issues while slowing down mean time to resolve (MTTR). In fact, according to recent data from the SolarWinds State of Monitoring & Observability Report, 77% of IT personnel said the lack of visibility across their on-prem and cloud architecture was an issue ...

In MEAN TIME TO INSIGHT Episode 23, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses the NetOps labor shortage ... 

Technology management is evolving, and in turn, so is the scope of FinOps. The FinOps Foundation recently updated their mission statement from "advancing the people who manage the value of cloud" to "advancing the people who manage the value of technology." This seemingly small change solidifies a larger evolution: FinOps practitioners have organically expanded to be focused on more than just cloud cost optimization. Today, FinOps teams are largely — and quickly — expanding their job descriptions, evolving into a critical function for managing the full value of technology ...

Enterprises are under pressure to scale AI quickly. Yet despite considerable investment, adoption continues to stall. One of the most overlooked reasons is vendor sprawl ... In reality, no organization deliberately sets out to create sprawling vendor ecosystems. More often, complexity accumulates over time through well-intentioned initiatives, such as enterprise-wide digital transformation efforts, point solutions, or decentralized sourcing strategies ...

Nearly every conversation about AI eventually circles back to compute. GPUs dominate the headlines while cloud platforms compete for workloads and model benchmarks drive investment decisions. But underneath that noise, a quieter infrastructure challenge is taking shape. The real bottleneck in enterprise AI is not processing power, it is the ability to store, manage and retrieve the relentless volumes of data that AI systems generate, consume and multiply ...

The 2026 Observability Survey from Grafana Labs paints a vivid picture of an industry maturing fast, where AI is welcomed with careful conditions, SaaS economics are reshaping spending decisions, complexity remains a defining challenge, and open standards continue to underpin it all ...

The observability industry has an evolving relationship with AI. We're not skeptics, but it's clear that trust in AI must be earned ... In Grafana Labs' annual Observability Survey, 92% said they see real value in AI surfacing anomalies before they cause downtime. Another 91% endorsed AI for forecasting and root cause analysis. So while the demand is there, customers need it to be trustworthy, as the survey also found that the practitioners most enthusiastic about AI are also the most insistent on explainability ...

In the modern enterprise, the conversation around AI has moved past skepticism toward a stage of active adoption. According to our 2026 State of IT Trends Report: The Human Side of Autonomous AI, nearly 90% of IT professionals view AI as a net positive, and this optimism is well-founded. We are seeing agentic AI move beyond simple automation to actively streamlining complex data insights and eliminating the manual toil that has long hindered innovation. However, as we integrate these autonomous agents into our ecosystems, the fundamental DNA of the IT role is evolving ...

AI workloads require an enormous amount of computing power ... What's also becoming abundantly clear is just how quickly AI's computing needs are leading to enterprise systems failure. According to Cockroach Labs' State of AI Infrastructure 2026 report, enterprise systems are much closer to failure than their organizations realize. The report ... suggests AI scale could cause widespread failures in as little as one year — making it a clear risk for business performance and reliability.

Downtime in a Downturn Could Mean Customer Churn

Phil Tee

The last year has been challenging for Tech. Everyone in the industry, from IT and DevOps leaders to field technicians, grapples with recessionary pressures like inflation and rising interest rates in their personal life. And thanks to a never-ending barrage of stories about high-profile layoffs, they are also keenly aware that Tech is experiencing an aggravated downturn.

For many IT leaders, the well-reasoned response to these stories is to locate cost-cutting opportunities in their organization. Ultimately, an economic softening will encourage managers to audit their ITOps tech stack. This is a reasonable first step since the average engineering team manages more than 16 monitoring tools alone.

However, IT leaders must ensure their tool consolidation process is strategic. After all, many solutions are mission-critical — especially during an economic downturn, when hitting key metrics like revenue and availability becomes necessary for business continuity. The best rule of thumb is to consider which tools provide actionable insights and ROI without wasting technicians' time. This benchmark for success allows leaders to cut ties with superfluous solutions and double down on those that map back to critical KPIs like system performance and operational efficiency.

An array of tools purport to maintain availability — the trick is sorting through the noise to find the right one. Let us discuss why availability is so important and then unpack the ROI of deploying Artificial Intelligence for IT Operations (AIOps) during an economic downturn.

Maintaining Availability Has Become More Important Than Ever

Over half the world's GDP (60%) is digitized as of 2019. That means organizations with improper digital infrastructure will repeatedly lose out on revenue opportunities. And in a downturn, revenue-generating opportunities are not simply competitive differentiators — they are the difference between sinking and swimming.

True, revenue is a guiding KPI regardless of macroeconomic conditions. But the recent economic softening has refocused efforts from a "growth at all costs" mindset to a "generate revenue efficiently" perspective. Now, organizations are buckling down to the basics — and providing consumers with a reliable online destination to interact with a brand and its products is downright critical.

That is where availability comes in. Availability is the glue that binds all digital interfaces together. Defined by maximum system performance and uptime, availability is achieved through rigorous behind-the-scenes engineering work. AIOps are an essential part of this equation because these tools reduce an organization's mean time to detect (MTTD) and mean time to recover (MTTR) by simplifying, collating and escalating data errors before they create downtime.

Let us use an example to illustrate the importance of reduced MTTX. If a top broadcast network experiences an outage during a major sporting event, they stand to lose millions of viewers — and, as a result, millions of dollars in ad revenue. But if that broadcast network has deployed AIOps, they can expediently identify the nature of the error (low MTTD) and resolve it within 30 seconds (low MTTR). Compare that resolution to a network without AIOps, which may experience an outage measured in minutes not seconds. This extended outage could immediately cost the network millions of dollars, not to mention millions more in lost customer loyalty and damaged brand reputation.

In an economically fraught environment, the losses associated with such an outage are more likely to become exacerbated. Hence, maintaining availability is not a luxury but a necessity.

AIOps Goes Beyond Simple Event Management

Availability, uptime and system performance are leading DevOps concerns. Consequently, many vendors advertise that their monitoring tool can improve these vectors in isolation, but this is not so. Monitoring tools are foundational for a tech stack, but they are fundamentally incapable of identifying and escalating data errors across all telemetry points. Only AIOps solutions that ingest disparate data from all devices, networks and tools will provide a complete overhead of the incident lifecycle. Furthermore, top AIOps solutions rely on machine learning (ML) to grow with their system and fill contextual gaps.

AIOps tools are superior to point solutions because their AI-based algorithms can parse thousands of incidents to determine which are relevant. Consider that any data state change creates an incident, yet data is inherently ephemeral, and only a select few changes indicate an actual system error. AIOps reduce the time technicians spend combing over data by eradicating non-harmful events and escalating the rest to the appropriate party — all with minimal supervision.

And when technicians need to step in, AIOps-based systems provide them with context-rich event tickets that explain the data issue in detail. This provides ample time for technicians to address the problem and return to revenue-generating responsibilities like improving the user experience (UX) and driving down technical debt. During an economic softening, the ROI here is even more apparent, especially given the extended tech talent crunch that continues to leave IT and DevOps teams struggling to fill labor-related gaps.

Of course, budget cuts and hiring freezes are only natural responses to concerns about fluctuations in economic stability. But IT and DevOps leaders should carefully consider the ROI behind each solution they cut — and adopt — during an economic softening.

For example, does a solution of interest provide excess data to interpret, or does it also understand and act on that data?

Does a solution reduce monotonous labor needs?

And, most importantly, does it provide revenue-generating opportunities like increased uptime and availability?

This line of questioning will ultimately demonstrate that certain tools are unnecessary during an economic downturn while others are more critical than ever. But, in general, leaders should treat availability as their guiding light when auditing their tech stack. Doing so will leave their organization better positioned to excel in the months ahead.

The Latest

In live financial environments, capital markets software cannot pause for rebuilds. New capabilities are introduced as stacked technology layers to meet evolving demands while systems remain active, data keeps moving, and controls stay intact. AI is no exception, and its opportunities are significant: accelerated decision cycles, compressed manual workflows, and more effective operations across complex environments. The constraint isn't the models themselves, but the architectural environments they enter ...

Like most digital transformation shifts, organizations often prioritize productivity and leave security and observability to keep pace. This usually translates to both the mass implementation of new technology and fragmented monitoring and observability (M&O) tooling. In the era of AI and varied cloud architecture, a disparate observability function can be dangerous. IT teams will lack a complete picture of their IT environment, making it harder to diagnose issues while slowing down mean time to resolve (MTTR). In fact, according to recent data from the SolarWinds State of Monitoring & Observability Report, 77% of IT personnel said the lack of visibility across their on-prem and cloud architecture was an issue ...

In MEAN TIME TO INSIGHT Episode 23, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses the NetOps labor shortage ... 

Technology management is evolving, and in turn, so is the scope of FinOps. The FinOps Foundation recently updated their mission statement from "advancing the people who manage the value of cloud" to "advancing the people who manage the value of technology." This seemingly small change solidifies a larger evolution: FinOps practitioners have organically expanded to be focused on more than just cloud cost optimization. Today, FinOps teams are largely — and quickly — expanding their job descriptions, evolving into a critical function for managing the full value of technology ...

Enterprises are under pressure to scale AI quickly. Yet despite considerable investment, adoption continues to stall. One of the most overlooked reasons is vendor sprawl ... In reality, no organization deliberately sets out to create sprawling vendor ecosystems. More often, complexity accumulates over time through well-intentioned initiatives, such as enterprise-wide digital transformation efforts, point solutions, or decentralized sourcing strategies ...

Nearly every conversation about AI eventually circles back to compute. GPUs dominate the headlines while cloud platforms compete for workloads and model benchmarks drive investment decisions. But underneath that noise, a quieter infrastructure challenge is taking shape. The real bottleneck in enterprise AI is not processing power, it is the ability to store, manage and retrieve the relentless volumes of data that AI systems generate, consume and multiply ...

The 2026 Observability Survey from Grafana Labs paints a vivid picture of an industry maturing fast, where AI is welcomed with careful conditions, SaaS economics are reshaping spending decisions, complexity remains a defining challenge, and open standards continue to underpin it all ...

The observability industry has an evolving relationship with AI. We're not skeptics, but it's clear that trust in AI must be earned ... In Grafana Labs' annual Observability Survey, 92% said they see real value in AI surfacing anomalies before they cause downtime. Another 91% endorsed AI for forecasting and root cause analysis. So while the demand is there, customers need it to be trustworthy, as the survey also found that the practitioners most enthusiastic about AI are also the most insistent on explainability ...

In the modern enterprise, the conversation around AI has moved past skepticism toward a stage of active adoption. According to our 2026 State of IT Trends Report: The Human Side of Autonomous AI, nearly 90% of IT professionals view AI as a net positive, and this optimism is well-founded. We are seeing agentic AI move beyond simple automation to actively streamlining complex data insights and eliminating the manual toil that has long hindered innovation. However, as we integrate these autonomous agents into our ecosystems, the fundamental DNA of the IT role is evolving ...

AI workloads require an enormous amount of computing power ... What's also becoming abundantly clear is just how quickly AI's computing needs are leading to enterprise systems failure. According to Cockroach Labs' State of AI Infrastructure 2026 report, enterprise systems are much closer to failure than their organizations realize. The report ... suggests AI scale could cause widespread failures in as little as one year — making it a clear risk for business performance and reliability.