Root Cause Analysis: Causal Versus Derived Events
April 15, 2014

Tom Molfetto
SericeNow

Share this

Today’s business landscape is saturated with data. Big Data has become one of the most hyped trends in the tech space, and all indicators point to the reality that this volume of data is only going to grow. IDC estimates that we’ll see a 60% growth in structured and unstructured data annually. Global 2000 organizations are investing billions of dollars into harnessing the power of Big Data to help make it meaningful and actionable. In other words, organizations are spending a ton of money in an effort to translate data into information.

Data – in and of itself – is fairly useless. When data is interpreted, processed and analyzed – when its true meaning is unearthed – it becomes useful and is called information. Thus the race between players like Splunk, QlikView and others to be the first or the best to harness the power of Big Data by translating it into actionable information.

Helping data center personnel and enterprise IT professionals translate their data into information by isolating causal versus derived events is really relevant to businesses these days. In most of my explorations, I have discovered that organizations are using a best-of-breed approach to monitoring, in what has resulted in a sort of Balkanization of the data center. In a common use case: network teams may be using Cisco for monitoring, the database teams use Oracle and web server teams uses Nagios. But nothing ties all of that information together in a unified view. There is no monitor of monitors, or manager of managers, so to speak. Let alone a unified view that goes beyond the IT components and maps them to their associated business services.

So what happens when a LAN port fails, and the app server and database server that both communicate through that LAN port also fail as a result? In that scenario, the LAN port failure is the causal event and the app/database server failures are derived events. By being able to quickly distinguish between the two types of events, and isolate the root cause of the failure, the dependent business services can be restored while minimizing negative impact on overall operations.

Standard monitoring solutions will trigger a bunch of red flags showing failures, but in order to make the map “come alive” it needs to be architected and displayed in a topological format. This is what allows easier assessment of root cause versus derived events, and what contributed to a dramatically reduced Meant-Time-To-Know (MTTK) with regard to diagnosing the underlying issues impacting business services.

Best-of-breed monitoring tools should continue to be leveraged in their respective domains, but the most forward-thinking organizations are unifying these tools from a service-centric perspective to create a monitor of monitors that maps IT components to associated business services, and connects with the best-of-breed solutions to create a complete and up-to-date topology that empowers IT to do their jobs more effectively.

Providing IT with the tools required to interpret data meaningfully and isolate the root cause of problems helps to create an informed perspective from which decisions can be made and responses taken.

Tom Molfetto is Marketing Director for Neebula.

Share this

The Latest

October 20, 2017

You've heard of DevOps and SecOps, but NetOps? NetOps is a natural progression of legacy Network Operations to foster more efficient and resilient infrastructures through automation and intelligence. The efficacy of NetOps personnel is reliant upon understanding five key elements of a NetOps Platform and how to best utilize and implement each ...

October 19, 2017

It's also important to keep the diversity of the Advanced IT Analytics (AIA) landscape in mind as you plan for your investments. AIA is still not a market in the traditional sense. My vision of AIA is rather an arena of fast-growing exploration and invention, in which in-house development is beginning to cede to third-party solutions that can accelerate time to value ...

October 18, 2017

Most application performance monitoring (APM) tools offer user experience monitoring and transaction tracing capabilities. But, when there is infrastructure slowness affecting the application, these APM tools cannot always pinpoint the root cause of problems. This is where unified infrastructure monitoring comes in ...

October 17, 2017

Business transaction monitoring is the approach commonly used to identify and diagnose server-side processing slowness for web applications. While it is an important component of an application performance monitoring strategy, a key question is whether business transaction tracing is sufficient for ensuring peak application performance ...

October 16, 2017
Hurricane season is in full swing. With the latest incoming cases of mega-storms devastating the Southeastern shoreline, communities are struggling to restore daily normalcy. People have been stepping up and showing remarkable strength and leadership in helping those affected. However, there is another area that we need to remember in these trying times – and that is businesses continuity ...
October 12, 2017

Gartner highlighted the top strategic technology trends that will impact most organizations in 2018. The next trends focus on blending the digital and physical worlds to create an immersive, digitally enhanced environment. The last three refer to exploiting connections between an expanding set of people and businesses, as well as devices, content and services to deliver digital business outcomes ...

October 11, 2017

Gartner highlighted the top strategic technology trends that will impact most organizations in 2018. The first three strategic technology trends explore how artificial intelligence (AI) and machine learning are seeping into virtually everything and represent a major battleground for technology providers over the next five years ...

October 10, 2017
This is the sixth in my series of blogs inspired by EMA's AIA buyer's guide — directed at helping IT invest in Advanced IT Analytics (AIA), what the industry more commonly calls "Operational Analytics." In this blog, I examine scenario-related shopping cart objectives for AIA. At EMA, we evaluated seven unique scenarios relevant to AIA adoptions. Our scenarios included agile/DevOps, Integrated security, change impact awareness, capacity optimization, business impact, business alignment and unifying IT ...
October 06, 2017

In the Riverbed Future of Networking Global Survey, more than half of the respondents acknowledged that achieving operational agility is critical to the success of a modern enterprise, and next-generation networks as well as the technology to support them are key to reaching this goal ...

October 05, 2017

Legacy infrastructures are holding back their cloud and digital strategies, according to the Riverbed Future of Networking Global Survey 2017. Nearly all survey respondents agree that legacy network infrastructure will have difficulty keeping pace with the changing demands of the cloud and hybrid networks ...