Event Management: Reactive, Proactive or Predictive?
August 01, 2012
Larry Dragich

Can event management help foster a curiosity for innovative possibilities to make application performance better? Blue-sky thinkers may not want to deal with the myriad of details on how to manage the events being generated operationally, but could learn something from this exercise.

Consider the major system failures in your organization over the last 12 to 18 months. What if you had a system or process in place to capture those failures and mitigate them from a proactive standpoint preventing them from reoccurring? How much better off would you be if you could avoid the proverbial “Groundhog Day” with system outages? The argument that system monitoring is just a nice to have, and not really a core requirement for operational readiness, dissipates quickly when a critical application goes down with no warning.

Starting with the Event management and Incident management processes may seem like a reactive approach when implementing an Application Performance Management (APM) solution, but is it really? If “Rome is burning”, wouldn’t the most prudent action be to extinguish the fire, then come up with a proactive approach for prevention? Managing the operational noise can calm the environment allowing you to focus on APM strategy more effectively.

Asking the right questions during a post-mortem review will help generate dialog, outlining options for alerting and prevention. This will direct your thinking towards a new horizon of continual improvement that will help galvanize proactive monitoring as an operational requirement.

Here are three questions that build on each other as you work to mature your solution:

1. Did we alert on it when it went down, or did the user community call us?

2. Can we get a proactive alert on it before it goes down, (e.g. dual power supply failure in server)?

3. Can we trend on the event creating a predictive alert before it is escalated, (e.g. disk space utilization to trigger a minor@90%, major@95%, critical@98%)?

The preceding questions are directly related to the following categories respectively: Reactive, Proactive, and Predictive.

Reactive – Alerts that Occur at Failure

Multiple events can occur before a system failure; eventually an alert will come in notifying you that an application is down. This will come from either the users calling the Service Desk to report an issue or it will be system generated corresponding with an application failure.

Proactive – Alerts that Occur Before Failure

These alerts will most likely come from proactive monitoring to tell you there are component failures that need attention but have not yet affected overall application availability, (e.g. dual power supply failure in server).

Predictive – Alerts that Trend on a Possible Failure

These alerts are usually set up in parallel with trending reports that will help predict subtle changes in the environment, (e.g. trending on memory usage or disk utilization before running out of resources).


Conclusion

Once you build awareness in the organization that you have a bird’s eye view of the technical landscape and have the ability to monitor the ecosystem of each application (as an ecologist), people become more meticulous when introducing new elements into the environment. They know that you are watching, taking samples, and trending on the overall health and stability leaving you free to focus on the strategic side of APM without distraction.

ABOUT Larry Dragich

Larry Dragich, a regular blogger and contributor on APMdigest, has 23 years of IT experience, and has been in an IT leadership role at the Auto Club Group (ACG) for the past ten years. He serves as Director of Enterprise Application Services (EAS) at the Auto Club Group with overall accountability to optimize the capability of the IT infrastructure to deliver high availability and optimal performance. Dragich is actively involved with industry leaders sharing knowledge of APM technologies from best practices, technical workflows, to resource allocation and approaches for implementation of APM Strategies.

You can contact Larry on LinkedIn

Related Links:

For a high-level view of a much broader technology space refer to the slide show on BrightTALK.com which describes the “The Anatomy of APM - webcast” in more context.

For more information on the critical success factors in APM adoption and how this centers around the End-User-Experience (EUE), read The Anatomy of APM and the corresponding blog APM’s DNA – Event to Incident Flow.

Prioritizing Gartner's APM Model

APM and MoM – Symbiotic Solution Sets

The Latest

April 28, 2015

Jean-Pierre "J.P." Garbani, VP, Principal Analyst serving Infrastructure & Operations Professionals at Forrester, discusses his new report: Transform Infrastructure And Operations For The Future Technology Management Cycle, and the changing role of the I&O organization ...

April 27, 2015

Jean-Pierre "J.P." Garbani, VP, Principal Analyst serving Infrastructure & Operations Professionals at Forrester, discusses his new report: Transform Infrastructure And Operations For The Future Technology Management Cycle ...

April 24, 2015

The majority (58 percent) of March Madness viewers said poor mobile or online performance while streaming or following games is worse than seeing their favorite team perform poorly, according to the March Madness performance survey conducted online by Harris Poll ...

April 23, 2015

Infrastructure and operations (I&O) leaders planning a bimodal IT strategy will miss out on the benefits of DevOps support for agile practices unless they transform their I&O culture, according to Gartner, Inc. Gartner said that the implementation of a bimodal IT strategy requires careful planning and execution. Analysts predict that, by 2018, three quarters of enterprise IT organizations will have tried to create a bimodal capability, but that less than 50 percent of those will reap the benefits of improved agility and risk management. The following five-step approach will help I&O leaders achieve an agile I&O culture ...

April 22, 2015

comScore published a new report: 2015 US Digital Future in Focus, providing a year in review of the major shifts in digital consumer behavior that occurred in various online sectors, including mobile, social media, video, advertising, search and e-commerce. In addition, the report examines what insights can be gathered from these trends and what that means looking forward to the year ahead. Here is a sample of the findings ...

April 21, 2015

Choosing the right IT management software is sometimes like looking for a needle in a haystack. There's so much to choose from, and it all seems to do the same thing and is claimed to be fantastic. But things aren't always what they seem. In a world that's changing faster than ever, virtualization and commodity hardware make it extremely difficult for your organization to choose the right tools. To point you in the right direction, I have set out 6 basic rules ...

April 20, 2015

We are starting to see an age where speed-of-thought analytical tools are helping to quickly analyze large volumes of data to uncover market trends, customer preferences, gain competitive insight and collect other useful business information. Likewise, utilizing ‘big data’ creates new opportunities to gain deep insight into operational efficiencies. Effective integration of big data analytics within corporate business processes is critical to harness the wealth of knowledge that can be extracted from corporate data. Here are five ways IT operations can use Big Data analytics to achieve operational efficiencies ...

April 17, 2015

Unscheduled downtime has a dramatic financial impact on businesses of all sizes, yet most businesses don't have adequate recovery technologies in place, according to an independent national survey of IT professionals conducted by Dimensional Research and commissioned by Axcient ...

April 16, 2015

True Application Performance Monitoring cross-cuts many IT tiers: network infrastructure, physical and virtual infrastructure, databases, mobile devices, etc. An ideal Application Performance Monitoring solution provides visibility over any infrastructure, for any app and any audience ...

April 15, 2015

Everyone presumably loves a good mystery. And in fact the questions “What is a CMDB?” and “What is its relevance in the age of (cloud) (agile) (fill-in-the-blank)?” often provoke such conflicted industry responses that they do suggest the presence of some mystery underfoot. But we didn’t set out to write a mystery novel per se. Instead, CMDB Systems: Making Change Work in the Age of Cloud and Agile is designed to serve as both a guide and a chronicle of real-world experiences — honoring the mystery by sharing different points of view, while trying to help our readers optimize their CMDB planning for what is increasingly becoming a positive and successful initiative ...

Share this