4 Tips for Dealing with All Those Event Alerts
July 10, 2013

Ariel Gordon

Share this

IT operations handles hundreds, or even thousands, of console messages day in and day out – including weekends. It’s an ongoing 24x7 battle. Data centers keep expanding and increasing in complexity, yet operations is still expected to manage the flood of event alerts pouring in.

Compounding the problem of the sheer volume of events, these alert notifications typically uses technical language that can only be understood by domain experts and come entirely without context.

So, let’s have a look at some tips that will help IT operations personnel deal with all of this by focusing on important events, while understanding their impact on delivery of business services.

1. Add meaning with enrichment rules

Turn cryptic technical messages into meaningful information with text to describe the event including severity prioritization, owner, and if known the service(s) impacted. The illustration below provides an example. This helps to clarify impact of the event alert and provides guidance about the next steps to be taken.

2. Apply correlation rules

Apply correlation rules to help reduce redundant events displayed on the console. Use filtering rules to remove events below a specific impact level – or events that impact less important components such as test servers. It’s also possible to use de-duplication rules to reduce noise related to the same event.

3. Apply tools that define all business service infrastructure components and their interrelationships

Then, you’ll be able to understand the links between IT events and their associated context and impact on business services.

4. Be proactive to understand the impact of changes in the IT infrastructure

It’s a truism in IT that 80 percent of problems originate from changes. Get in front of those event alerts caused by change so you understand “will an upgrade to that problematic switch port take down the customer portal, or does it only affect ordering supplies?” Ensuring safer changes can eliminate many event alerts.

Ariel Gordon is Chief Technology Officer and Co-Founder of Neebula.

Share this

The Latest

October 21, 2021

Scaling DevOps and SRE practices is critical to accelerating the release of high-quality digital services. However, siloed teams, manual approaches, and increasingly complex tooling slow innovation and make teams more reactive than proactive, impeding their ability to drive value for the business, according to a new report from Dynatrace, Deep Cloud Observability and Advanced AIOps are Key to Scaling DevOps Practices ...

October 20, 2021

Over three quarters (79%) of database professionals are now using either a paid-for or in-house monitoring tool, according to a new survey from Redgate Software ...

October 19, 2021

Gartner announced the top strategic technology trends that organizations need to explore in 2022. With CEOs and Boards striving to find growth through direct digital connections with customers, CIOs' priorities must reflect the same business imperatives, which run through each of Gartner's top strategic tech trends for 2022 ...

October 18, 2021

Distributed tracing has been growing in popularity as a primary tool for investigating performance issues in microservices systems. Our recent DevOps Pulse survey shows a 38% increase year-over-year in organizations' tracing use. Furthermore, 64% of those respondents who are not yet using tracing indicated plans to adopt it in the next two years ...

October 14, 2021

Businesses are embracing artificial intelligence (AI) technologies to improve network performance and security, according to a new State of AIOps Study, conducted by ZK Research and Masergy ...

October 13, 2021

What may have appeared to be a stopgap solution in the spring of 2020 is now clearly our new workplace reality: It's impossible to walk back so many of the developments in workflow we've seen since then. The question is no longer when we'll all get back to the office, but how the companies that are lagging in their technological ability to facilitate remote work can catch up ...

October 12, 2021

The pandemic accelerated organizations' journey to the cloud to enable agile, on-demand, flexible access to resources, helping them align with a digital business's dynamic needs. We heard from many of our customers at the start of lockdown last year, saying they had to shift to a remote work environment, seemingly overnight, and this effort was heavily cloud-reliant. However, blindly forging ahead can backfire ...

October 07, 2021

SmartBear recently released the results of its 2021 State of Software Quality | Testing survey. I doubt you'll be surprised to hear that a "lack of time" was reported as the number one challenge to doing more testing, especially as release frequencies continue to increase. However, it was disheartening to see that a lack of time was also the number one response when we asked people to identify the biggest blocker to professional development ...

October 06, 2021

The role of the CIO is evolving with an increased focus on unlocking customer connections through service innovation, according to the 2021 Global CIO Survey. The study reveals the shift in the role of the CIO with the majority of CIO respondents stating innovation, operational efficiency, and customer experience as their top priorities ...

October 05, 2021

The perception of IT support has dramatically improved thanks to the successful response of service desks to the pandemic, lockdowns and working from home, according to new research from the Service Desk Institute (SDI), sponsored by Sunrise Software ...