4 Tips for Dealing with All Those Event Alerts
July 10, 2013

Ariel Gordon

Share this

IT operations handles hundreds, or even thousands, of console messages day in and day out – including weekends. It’s an ongoing 24x7 battle. Data centers keep expanding and increasing in complexity, yet operations is still expected to manage the flood of event alerts pouring in.

Compounding the problem of the sheer volume of events, these alert notifications typically uses technical language that can only be understood by domain experts and come entirely without context.

So, let’s have a look at some tips that will help IT operations personnel deal with all of this by focusing on important events, while understanding their impact on delivery of business services.

1. Add meaning with enrichment rules

Turn cryptic technical messages into meaningful information with text to describe the event including severity prioritization, owner, and if known the service(s) impacted. The illustration below provides an example. This helps to clarify impact of the event alert and provides guidance about the next steps to be taken.

2. Apply correlation rules

Apply correlation rules to help reduce redundant events displayed on the console. Use filtering rules to remove events below a specific impact level – or events that impact less important components such as test servers. It’s also possible to use de-duplication rules to reduce noise related to the same event.

3. Apply tools that define all business service infrastructure components and their interrelationships

Then, you’ll be able to understand the links between IT events and their associated context and impact on business services.

4. Be proactive to understand the impact of changes in the IT infrastructure

It’s a truism in IT that 80 percent of problems originate from changes. Get in front of those event alerts caused by change so you understand “will an upgrade to that problematic switch port take down the customer portal, or does it only affect ordering supplies?” Ensuring safer changes can eliminate many event alerts.

Ariel Gordon is Chief Technology Officer and Co-Founder of Neebula.

Share this

The Latest

September 27, 2023

Navigating observability pricing models can be compared to solving a perplexing puzzle which includes financial variables and contractual intricacies. Predicting all potential costs in advance becomes an elusive endeavor, exemplified by a recent eye-popping $65 million observability bill ...

September 26, 2023

Generative AI may be a great tool for the enterprise to help drive further innovation and meaningful work, but it also runs the risk of generating massive amounts of spam that will counteract its intended benefits. From increased AI spam bots to data maintenance due to large volumes of outputs, enterprise AI applications can create a cascade of issues that end up detracting from productivity gains ...

September 25, 2023

A long-running study of DevOps practices ... suggests that any historical gains in MTTR reduction have now plateaued. For years now, the time it takes to restore services has stayed about the same: less than a day for high performers but up to a week for middle-tier teams and up to a month for laggards. The fact that progress is flat despite big investments in people, tools and automation is a cause for concern ...

September 21, 2023

Companies implementing observability benefit from increased operational efficiency, faster innovation, and better business outcomes overall, according to 2023 IT Trends Report: Lessons From Observability Leaders, a report from SolarWinds ...

September 20, 2023

IT leaders are driving an increasing number of automation initiatives as a way to stay competitive, reduce costs and scale as they navigate an unpredictable social and economic environment, according to the 2023 State of Automation in IT survey conducted by Jitterbit ...

September 19, 2023

Customer loyalty is changing as retailers get increasingly competitive. More than 75% of consumers say they would end business with a company after a single bad customer experience. This means that just one price discrepancy, inventory mishap or checkout issue in a physical or digital store, could have customers running out to the next store that can provide them with better service. Retailers must be able to predict business outages in advance, and act proactively before an incident occurs, impacting customer experience ...

September 18, 2023
Digital transformation is key to ensuring companies keep up with the competitive market landscape. Putting digital at the core of a business can significantly reduce operating expenses and inefficiencies. However, this process often means changing the way internal teams work with one another. To help with the transition, this blog offers chief experience officers (CXOs) advice on how to lead a successful digital transformation project ...
September 14, 2023

Earlier this year, New Relic conducted a study on observability ... The 2023 Observability Forecast reveals observability's impact on the lives of technical professionals and businesses' bottom lines. Here are 10 key takeaways from the forecast ...

September 13, 2023
On September 10, MGM Resorts experienced what it called a "cybersecurity issue" that had a major impact on the company's systems, showing how cyberattacks can bring down applications, ultimately causing problems for a company in many ways ...
September 12, 2023

Only 33% of executives are "very confident" in their ability to operate in a public cloud environment, according to the 2023 State of CloudOps report from NetApp. This represents an increase from 2022 when only 21% reported feeling very confident ...