IT Monitoring Paradox: Let's Step Outside the Bubble!
March 26, 2014

David Hayward
CA Technologies

Share this

The world is full of paradoxes. To solve them, you have to look at the facts in a different, even nonconventional way. You have to step outside your bubble.

One of the earliest paradoxes is from the ancient Greek thinker Heraclitus. It goes like this: "You cannot step into the same river twice." As Roy Sorenson says in A Brief History of the Paradox, "[Heraclitus] means that you cannot step twice into the same water of a river. There is one river, but many distinct bodies of water flow through it. Heraclitus urges a balance between experience and reason."

Paradoxes are fun to solve, but real-life they can be serious. IT Operations faces paradoxes too, and one in particular day in and day out. Recently, an IT Manager in a FORTUNE 1000 company — let's call him "Joe" — told me that he was called into the distribution center VP's office. The center was at a standstill. Joe showed him that the IT systems supporting the center were meeting all their Service Level Agreements: servers, applications, databases, storage, routers, switches — the whole lot. And the VP's response? "So what. I can’t ship anything."

That's when the light bulb went off in Joe's head. All the IT technologies that underlie the distribution center were running fine, but not the center itself. IT Operations needed another way to look at things, so he could understand the IT environment's status in terms of its impact on the business, not just in terms of how this or that technology silo was behaving.

Like Heraclitus and the river, Joe needed to strike a balance between experience and reason. Joe had plenty of experience — reams of performance monitoring data and proof of SLA compliance for each technology domain — but no way to reason, or monitor, the distribution center business process itself.

Joe started thinking of ITIL — the framework for orienting IT with services, not technologies, in mind. The trouble is, IT operates in a bubble. In fact, lots of bubbles: silo’d teams, silo’d tools, each separately monitoring servers, applications, storage, databases, routers, switches, etc.. No one was monitoring the big picture outside the bubbles. IT Operations Level 1 (the “first line of defense”) was looking at a sea of monitoring screens, events and alerts about technology devices and circuits, and had little or no understanding about how those events and alerts impacted specific business processes.

So even when IT was meeting SLA objectives in each silo, little degradations (i.e., incidents) across silos were adding up and impacting different services (i.e., processes and user experience) in different ways. This was undetectable because there wasn't any way to way to associate all those incidents with specific business services: no operational view and real-time IT operational analytics of business processes across silos.

This is typical. As an analyst from a leading IT research firm recently told me: "The Industry has been trying to solve this problem for decades. It sounds old, but we keep coming back to the same paradox over and over again."

Joe and others like him have embarked on a mission to transform IT Operations from a purely technology monitoring team to a business service reliability monitoring team. They are transforming operations because either they’ll crack the paradox of managing services that they deliver to their business, or the business will outsource operations to someone who can.

Transformation doesn't happen overnight. As the ITIL mantra teaches us, it takes "people, processes and technology" to get IT properly focused on the business and its services. To start, you need to step outside your bubble.

David Hayward is Senior Principal Manager, Solutions Marketing at CA Technologies.

Share this

The Latest

March 27, 2024

Nearly all (99%) globa IT decision makers, regardless of region or industry, recognize generative AI's (GenAI) transformative potential to influence change within their organizations, according to The Elastic Generative AI Report ...

March 27, 2024

Agent-based approaches to real user monitoring (RUM) simply do not work. If you are pitched to install an "agent" in your mobile or web environments, you should run for the hills ...

March 26, 2024

The world is now all about end-users. This paradigm of focusing on the end-user was simply not true a few years ago, as backend metrics generally revolved around uptime, SLAs, latency, and the like. DevOps teams always pitched and presented the metrics they thought were the most correlated to the end-user experience. But let's be blunt: Unless there was an egregious fire, the correlated metrics were super loose or entirely false ...

March 25, 2024

This year, New Relic published the State of Observability for Financial Services and Insurance Report to share insights derived from the 2023 Observability Forecast on the adoption and business value of observability across the financial services industry (FSI) and insurance sectors. Here are seven key takeaways from the report ...

March 22, 2024

In MEAN TIME TO INSIGHT Episode 4 - Part 2, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at Enterprise Management Associates (EMA) discusses artificial intelligence and AIOps ...

March 21, 2024

In the course of EMA research over the last twelve years, the message for IT organizations looking to pursue a forward path in AIOps adoption is overall a strongly positive one. The benefits achieved are growing in diversity and value ...

March 20, 2024

Today, as enterprises transcend into a new era of work, surpassing the revolution, they must shift their focus and strategies to thrive in this environment. Here are five key areas that organizations should prioritize to strengthen their foundation and steer themselves through the ever-changing digital world ...

March 19, 2024

If there's one thing we should tame in today's data-driven marketing landscape, this would be data debt, a silent menace threatening to undermine all the trust you've put in the data-driven decisions that guide your strategies. This blog aims to explore the true costs of data debt in marketing operations, offering four actionable strategies to mitigate them through enhanced marketing observability ...

March 18, 2024

Gartner has highlighted the top trends that will impact technology providers in 2024: Generative AI (GenAI) is dominating the technical and product agenda of nearly every tech provider ...

March 15, 2024

In MEAN TIME TO INSIGHT Episode 4 - Part 1, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at Enterprise Management Associates (EMA) discusses artificial intelligence and network management ...