IT Monitoring Paradox: Let's Step Outside the Bubble!
March 26, 2014

David Hayward
CA Technologies

Share this

The world is full of paradoxes. To solve them, you have to look at the facts in a different, even nonconventional way. You have to step outside your bubble.

One of the earliest paradoxes is from the ancient Greek thinker Heraclitus. It goes like this: "You cannot step into the same river twice." As Roy Sorenson says in A Brief History of the Paradox, "[Heraclitus] means that you cannot step twice into the same water of a river. There is one river, but many distinct bodies of water flow through it. Heraclitus urges a balance between experience and reason."

Paradoxes are fun to solve, but real-life they can be serious. IT Operations faces paradoxes too, and one in particular day in and day out. Recently, an IT Manager in a FORTUNE 1000 company — let's call him "Joe" — told me that he was called into the distribution center VP's office. The center was at a standstill. Joe showed him that the IT systems supporting the center were meeting all their Service Level Agreements: servers, applications, databases, storage, routers, switches — the whole lot. And the VP's response? "So what. I can’t ship anything."

That's when the light bulb went off in Joe's head. All the IT technologies that underlie the distribution center were running fine, but not the center itself. IT Operations needed another way to look at things, so he could understand the IT environment's status in terms of its impact on the business, not just in terms of how this or that technology silo was behaving.

Like Heraclitus and the river, Joe needed to strike a balance between experience and reason. Joe had plenty of experience — reams of performance monitoring data and proof of SLA compliance for each technology domain — but no way to reason, or monitor, the distribution center business process itself.

Joe started thinking of ITIL — the framework for orienting IT with services, not technologies, in mind. The trouble is, IT operates in a bubble. In fact, lots of bubbles: silo’d teams, silo’d tools, each separately monitoring servers, applications, storage, databases, routers, switches, etc.. No one was monitoring the big picture outside the bubbles. IT Operations Level 1 (the “first line of defense”) was looking at a sea of monitoring screens, events and alerts about technology devices and circuits, and had little or no understanding about how those events and alerts impacted specific business processes.

So even when IT was meeting SLA objectives in each silo, little degradations (i.e., incidents) across silos were adding up and impacting different services (i.e., processes and user experience) in different ways. This was undetectable because there wasn't any way to way to associate all those incidents with specific business services: no operational view and real-time IT operational analytics of business processes across silos.

This is typical. As an analyst from a leading IT research firm recently told me: "The Industry has been trying to solve this problem for decades. It sounds old, but we keep coming back to the same paradox over and over again."

Joe and others like him have embarked on a mission to transform IT Operations from a purely technology monitoring team to a business service reliability monitoring team. They are transforming operations because either they’ll crack the paradox of managing services that they deliver to their business, or the business will outsource operations to someone who can.

Transformation doesn't happen overnight. As the ITIL mantra teaches us, it takes "people, processes and technology" to get IT properly focused on the business and its services. To start, you need to step outside your bubble.

David Hayward is Senior Principal Manager, Solutions Marketing at CA Technologies.

Share this

The Latest

March 16, 2018

The State of the Mainframe report from Syncsort revealed an increased focus on traditional data infrastructure optimization to control costs and help fund strategic organizational projects like AI, machine learning and predictive analytics in addition to widespread concern about meeting security and compliance requirements ...

March 15, 2018

The 2018 Software Fail Watch report from Tricentis investigated 606 failures that affected over 3.6 billion people and caused $1.7 trillion in lost revenue ...

March 14, 2018

Gartner predicts there will be nearly 21 billion connected “things” in use worldwide by 2020 – impressive numbers that should catch the attention of every CIO. IT leaders in nearly every vertical market will soon be inundated with the management of both the data from these devices as well as the management of the devices themselves, each of which require the same lifecycle management as any other IT equipment. This can be an overwhelming realization for CIOs who don’t have an adequate configuration management strategy for their current IT environments, the foundation upon which all future digital strategies – Internet-connected or otherwise – will be built ...

March 13, 2018

Many network operations teams question if they need to TAP their networks; perhaps they aren't familiar with test access points (TAPs), or they think there isn't an application that makes sense for them. Over the past decade, industry best-practice revealed that all network infrastructure should utilize a network TAP as the foundation for complete visibility. The following are the seven most popular applications for TAPs ...

March 12, 2018

Organizations are eager to adopt cloud based architectures in an effort to support their digital transformation efforts, drive efficiencies and strengthen customer satisfaction, according to a new online cloud usage survey conducted by Denodo ...

March 09, 2018

Globally, cloud data center traffic will represent 95 percent of total data center traffic by 2021, compared to 88 percent in 2016, according to the Cisco Global Cloud Index (2016-2021) ...

March 08, 2018

Enterprise cloud spending will grow rapidly over the next year, and yet 35 percent of cloud spend is wasted, according to The RightScale 2018 State of the Cloud Survey ...

March 07, 2018

What often goes overlooked in our always-on digital culture are the people at the other end of each of these services tasked with their 24/7 management. If something goes wrong, users are quick to complain or switch to a competitor as IT practitioners on the backend race to rectify the situation. A recent PagerDuty State of IT Work-Life Balance Report revealed that IT professionals are struggling with the pressures associated with the management of these digital offerings ...

March 06, 2018

Businesses everywhere continually strive for greater efficiency. By way of illustration, more than a third of IT professionals cite "moving faster" as their top goal for 2018, and improving the efficiency of operations was one of the top three stated business objectives for organizations considering digital transformation initiatives ...

March 05, 2018

One of the current challenges for IT teams is the movement of the network to the cloud, and the lack of visibility that comes with that shift. While there has been a lot of hype around the benefits of cloud computing, very little is being said about the inherent drawbacks ...