The last year has been challenging for Tech. Everyone in the industry, from IT and DevOps leaders to field technicians, grapples with recessionary pressures like inflation and rising interest rates in their personal life. And thanks to a never-ending barrage of stories about high-profile layoffs, they are also keenly aware that Tech is experiencing an aggravated downturn.
For many IT leaders, the well-reasoned response to these stories is to locate cost-cutting opportunities in their organization. Ultimately, an economic softening will encourage managers to audit their ITOps tech stack. This is a reasonable first step since the average engineering team manages more than 16 monitoring tools alone.
However, IT leaders must ensure their tool consolidation process is strategic. After all, many solutions are mission-critical — especially during an economic downturn, when hitting key metrics like revenue and availability becomes necessary for business continuity. The best rule of thumb is to consider which tools provide actionable insights and ROI without wasting technicians' time. This benchmark for success allows leaders to cut ties with superfluous solutions and double down on those that map back to critical KPIs like system performance and operational efficiency.
An array of tools purport to maintain availability — the trick is sorting through the noise to find the right one. Let us discuss why availability is so important and then unpack the ROI of deploying Artificial Intelligence for IT Operations (AIOps) during an economic downturn.
Maintaining Availability Has Become More Important Than Ever
Over half the world's GDP (60%) is digitized as of 2019. That means organizations with improper digital infrastructure will repeatedly lose out on revenue opportunities. And in a downturn, revenue-generating opportunities are not simply competitive differentiators — they are the difference between sinking and swimming.
True, revenue is a guiding KPI regardless of macroeconomic conditions. But the recent economic softening has refocused efforts from a "growth at all costs" mindset to a "generate revenue efficiently" perspective. Now, organizations are buckling down to the basics — and providing consumers with a reliable online destination to interact with a brand and its products is downright critical.
That is where availability comes in. Availability is the glue that binds all digital interfaces together. Defined by maximum system performance and uptime, availability is achieved through rigorous behind-the-scenes engineering work. AIOps are an essential part of this equation because these tools reduce an organization's mean time to detect (MTTD) and mean time to recover (MTTR) by simplifying, collating and escalating data errors before they create downtime.
Let us use an example to illustrate the importance of reduced MTTX. If a top broadcast network experiences an outage during a major sporting event, they stand to lose millions of viewers — and, as a result, millions of dollars in ad revenue. But if that broadcast network has deployed AIOps, they can expediently identify the nature of the error (low MTTD) and resolve it within 30 seconds (low MTTR). Compare that resolution to a network without AIOps, which may experience an outage measured in minutes not seconds. This extended outage could immediately cost the network millions of dollars, not to mention millions more in lost customer loyalty and damaged brand reputation.
In an economically fraught environment, the losses associated with such an outage are more likely to become exacerbated. Hence, maintaining availability is not a luxury but a necessity.
AIOps Goes Beyond Simple Event Management
Availability, uptime and system performance are leading DevOps concerns. Consequently, many vendors advertise that their monitoring tool can improve these vectors in isolation, but this is not so. Monitoring tools are foundational for a tech stack, but they are fundamentally incapable of identifying and escalating data errors across all telemetry points. Only AIOps solutions that ingest disparate data from all devices, networks and tools will provide a complete overhead of the incident lifecycle. Furthermore, top AIOps solutions rely on machine learning (ML) to grow with their system and fill contextual gaps.
AIOps tools are superior to point solutions because their AI-based algorithms can parse thousands of incidents to determine which are relevant. Consider that any data state change creates an incident, yet data is inherently ephemeral, and only a select few changes indicate an actual system error. AIOps reduce the time technicians spend combing over data by eradicating non-harmful events and escalating the rest to the appropriate party — all with minimal supervision.
And when technicians need to step in, AIOps-based systems provide them with context-rich event tickets that explain the data issue in detail. This provides ample time for technicians to address the problem and return to revenue-generating responsibilities like improving the user experience (UX) and driving down technical debt. During an economic softening, the ROI here is even more apparent, especially given the extended tech talent crunch that continues to leave IT and DevOps teams struggling to fill labor-related gaps.
Of course, budget cuts and hiring freezes are only natural responses to concerns about fluctuations in economic stability. But IT and DevOps leaders should carefully consider the ROI behind each solution they cut — and adopt — during an economic softening.
For example, does a solution of interest provide excess data to interpret, or does it also understand and act on that data?
Does a solution reduce monotonous labor needs?
And, most importantly, does it provide revenue-generating opportunities like increased uptime and availability?
This line of questioning will ultimately demonstrate that certain tools are unnecessary during an economic downturn while others are more critical than ever. But, in general, leaders should treat availability as their guiding light when auditing their tech stack. Doing so will leave their organization better positioned to excel in the months ahead.
The Latest
APMdigest and leading IT research firm Enterprise Management Associates (EMA) are partnering to bring you the EMA-APMdigest Podcast, a new podcast focused on the latest technologies impacting IT Operations. In Episode 2 - Part 1 Pete Goldin, Editor and Publisher of APMdigest, discusses Network Observability with Shamus McGillicuddy, Vice President of Research, Network Infrastructure and Operations, at EMA ...
CIOs have stepped into the role of digital leader and strategic advisor, according to the 2023 Global CIO Survey from Logicalis ...
Synthetic monitoring is crucial to deploy code with confidence as catching bugs with E2E tests on staging is becoming increasingly difficult. It isn't trivial to provide realistic staging systems, especially because today's apps are intertwined with many third-party APIs ...
Recent EMA field research found that ServiceOps is either an active effort or a formal initiative in 78% of the organizations represented by a global panel of 400+ IT leaders. It is relatively early but gaining momentum across industries and organizations of all sizes globally ...
Managing availability and performance within SAP environments has long been a challenge for IT teams. But as IT environments grow more complex and dynamic, and the speed of innovation in almost every industry continues to accelerate, this situation is becoming a whole lot worse ...
Harnessing the power of network-derived intelligence and insights is critical in detecting today's increasingly sophisticated security threats across hybrid and multi-cloud infrastructure, according to a new research study from IDC ...
Recent research suggests that many organizations are paying for more software than they need. If organizations are looking to reduce IT spend, leaders should take a closer look at the tools being offered to employees, as not all software is essential ...
Organizations are challenged by tool sprawl and data source overload, according to the Grafana Labs Observability Survey 2023, with 52% of respondents reporting that their companies use 6 or more observability tools, including 11% that use 16 or more.
An array of tools purport to maintain availability — the trick is sorting through the noise to find the right one. Let us discuss why availability is so important and then unpack the ROI of deploying Artificial Intelligence for IT Operations (AIOps) during an economic downturn ...
Development teams so often find themselves rushing to get a release out on time. When it comes time for testing, the software works fine in the lab. But, when it's released, customers report a bunch of bugs. How does this happen? Why weren't the flaws found in QA? ...