What Can AIOps Do For IT Ops? - Part 4
October 28, 2021
Share this

APMdigest asked the top minds in the industry what they think AIOps can do for IT Operations. Part 4 covers root cause analysis and automation.

Start with What Can AIOps Do For IT Ops? - Part 1

Start with What Can AIOps Do For IT Ops? - Part 2

Start with What Can AIOps Do For IT Ops? - Part 3

SINGLE PANE OF GLASS

AIOps provides a much needed real-time "single-pane-of-glass" view into complex IT infrastructures that encompass fragmented and distributed multi-vendor, multi-domain technologies including legacy, virtualization, hybrid cloud, containers, microservices, and others. Although AIOps is a seismic change for IT operations, it's not a radical application of analytics and machine learning. The potential of AIOps is enormous. Enterprises that have deployed AIOps solutions are experiencing transformational benefits in revenue growth, better customer retention, improved customer experience, lower costs, and enhanced performance. The time to move is now.
Maruti Sivakumar V
SVP, Head of Digital & Practices, Blue.cloud

ISOLATING THE ROOT CAUSE

AIOps helps build high-quality incidents that include all the necessary technical and business context, alongside AI/ML-identified probable root cause and root cause changes — and present it all within a single pane of glass.
Mohan Kompella, VP Product Marketing,
Adam Blau, Director of Product Marketing,
Anirban Chatterjee, Director of Product Marketing, BigPanda

AIOps is a buzzword 6 different types of products designed to create value for IT Operations professionals. Always pick specific use cases you wish to solve and then understand how machine learning and AI can apply to solve that issue or set of issues. Good examples of this are to help the user isolate the root cause down to a specific component, highlight outliers in graphs and other views, correlate likely related data types together. Generally, these technologies help augment the operator of the software versus being automation magic. Most often these are features in other Observability tools versus AIOps platforms. AIOps platforms are fantasy because the semantic meaning of data is not clear. The result is vendors write rules to analyze the data, making the resulted outcomes only work in specific situations which makes them useless when a major problem happens across a set of complex systems.
Jonah Kowall
CTO, Logz.io

AUTOMATED ROOT CAUSE ANALYSIS

Response automation is one of the most value-driving features of AIOps software tools. IT operators are able to conduct performance tests to establish a baseline for each metric or KPI and define acceptable thresholds for the ones they want to prioritize. When a KPI breach is detected, AIOps software can perform an automated root cause analysis to automatically determine why a problem occurred and implement a solution if one is available.
Abel Gonzalez
Director of Product Marketing, Sumo Logic

Machine learning and AI are not just critical — but foundational — components of a dynamic monitoring platform. Modern applications are constantly in flux, and microservices scale through ephemeral cloud and container infrastructure in response to demand. As these systems become more complex and dynamic, operational tasks consume an increasing share of engineering time. AIOps optimizes and automates IT operations so that engineers can get proactively alerted no matter the size of the workloads, and benefit from an augmented troubleshooting experience by cutting through noise to glean key insights. In some cases, AI can auto-discover the root cause of an issue, saving minutes or hours of stressful investigations. This is the core advantage of effective AIOps — less engineering time wasted on managing complex operations, and more time building new products for customers.
Renaud Boutet
VP of Product, Datadog

BETTER DECISION-MAKING

From a monitoring and observability perspective, a key benefit of AIOps has been the ability to use historical data to increase confidence in decisions that we previously thought were black-and-white. It's relatively simple to have a machine check if a service is up or down, but how do we find the trends that show that whilst the website is up, it's gradually been getting slower over the past few months? Modern tooling allows us to collect enough data and process it fast enough — often in real-time — for the machines to be able to make better-informed decisions, faster. Such decisions could only be made by lengthy human inspection previously. It's a great example of modern tooling working in the background to make sure everything is okay, so we don't have to.
Matt Saunders
Head of DevOps, Adaptavist

AIOps observability can play a critical role in terms of expected trends using the data from users, systems and processes and provide the data back to the decision-makers to make the investment call based on the pattern, trends, etc. With growing Cloud demand, it is imperative the enterprises start investing in AIOps before it is too late.
Vishnu Vasudevan
Head of Product Engineering and Management, Opsera

SYNCING WITH ITSM

Create automated, bi-directional syncing with your ITSM platform, on-call or other collaboration tools and reduce ticket/notification volumes by up to 95%
Mohan Kompella, VP Product Marketing,
Adam Blau, Director of Product Marketing,
Anirban Chatterjee, Director of Product Marketing, BigPanda

First generation AIOps solutions are a step in the right direction, to address the unending IT complexity, but needed more care and feed and only solved limited set of problems for ITOps teams. Looking ahead, new age AIOps platforms are poised to make AIOps faster, better and cheaper — by automating data preparations and integrations, by having native asset/topology intelligence and by using expanded AI/ML frameworks like neural networks, NLP, transformer models and graph databases to address a lot more use cases. This paves a path where everybody in the IT benefits — ITSM, Service Desk, IT Asset/Planning and more.
Tejo Prayaga
Product Management, CloudFabrix

UNDERSTANDING ALGORITHMS

The last several years have seen a dramatic increase in the use of AI across all types of companies and platforms. These complex solutions require more parts of an organization to be knowledgeable of AI, from data pipelines to the workflows that build, qualify and optimize the models. Having a specialized Ops function that understands this end-to-end is going to be critical for maximizing AI's effectiveness in a production environment. Over time, AIOps can build a deeper understanding of the algorithms, then use that knowledge to enhance the infrastructure with automated services around data cleaning, model tuning and scaling that will continue delivering key results for the business. This kind of specialty is beyond what a traditional IT Operations team can do with the breadth that they are normally expected to maintain.
David Luks
VP of Engineering, Smart Applications, Lucidworks

AUTOMATION

AIOps delivers significant value to businesses by automating many of the manual, tedious tasks that distract IT from working on higher level projects, especially when it comes to data prep.
David P. Mariani
CTO and Founder, AtScale

As the cadence of business continues to gain momentum and competition builds, organizations must not only innovate but also identify business problems and inefficiencies and utilize technology to overcome them. AIOps acts as the salve for many enterprise challenges by anchoring a triangulation of machine learning, decision automation and advanced analytics to automate repetitive tasks, freeing IT teams to work on new mission critical and challenging problems — resulting in faster completion of projects and improved business outcomes.
Alan Young
CPO, InRule

REMEDIAL OPTIMIZATION

IT Operations cannot keep up with the requirements of keeping cloud applications functional and running their best. IT Ops needs to utilize the power of AI to keep the many combinations of app parameters and metrics in an optimal state. Moreso, for AIOps to keep operational apps optimized it needs to be continuous (always on) and autonomous (no human intervention). This way AIOps can perform the remedial optimization work the IT Ops SREs would do, but much faster and with more accuracy.
Peter Nickolov
Co-Founder and VP of Engineering, Opsani

Go to What Can AIOps Do For IT Ops? - Part 5

Share this

The Latest

September 26, 2023

Generative AI may be a great tool for the enterprise to help drive further innovation and meaningful work, but it also runs the risk of generating massive amounts of spam that will counteract its intended benefits. From increased AI spam bots to data maintenance due to large volumes of outputs, enterprise AI applications can create a cascade of issues that end up detracting from productivity gains ...

September 25, 2023

A long-running study of DevOps practices ... suggests that any historical gains in MTTR reduction have now plateaued. For years now, the time it takes to restore services has stayed about the same: less than a day for high performers but up to a week for middle-tier teams and up to a month for laggards. The fact that progress is flat despite big investments in people, tools and automation is a cause for concern ...

September 21, 2023

Companies implementing observability benefit from increased operational efficiency, faster innovation, and better business outcomes overall, according to 2023 IT Trends Report: Lessons From Observability Leaders, a report from SolarWinds ...

September 20, 2023

IT leaders are driving an increasing number of automation initiatives as a way to stay competitive, reduce costs and scale as they navigate an unpredictable social and economic environment, according to the 2023 State of Automation in IT survey conducted by Jitterbit ...

September 19, 2023

Customer loyalty is changing as retailers get increasingly competitive. More than 75% of consumers say they would end business with a company after a single bad customer experience. This means that just one price discrepancy, inventory mishap or checkout issue in a physical or digital store, could have customers running out to the next store that can provide them with better service. Retailers must be able to predict business outages in advance, and act proactively before an incident occurs, impacting customer experience ...

September 18, 2023
Digital transformation is key to ensuring companies keep up with the competitive market landscape. Putting digital at the core of a business can significantly reduce operating expenses and inefficiencies. However, this process often means changing the way internal teams work with one another. To help with the transition, this blog offers chief experience officers (CXOs) advice on how to lead a successful digital transformation project ...
September 14, 2023

Earlier this year, New Relic conducted a study on observability ... The 2023 Observability Forecast reveals observability's impact on the lives of technical professionals and businesses' bottom lines. Here are 10 key takeaways from the forecast ...

September 13, 2023
On September 10, MGM Resorts experienced what it called a "cybersecurity issue" that had a major impact on the company's systems, showing how cyberattacks can bring down applications, ultimately causing problems for a company in many ways ...
September 12, 2023

Only 33% of executives are "very confident" in their ability to operate in a public cloud environment, according to the 2023 State of CloudOps report from NetApp. This represents an increase from 2022 when only 21% reported feeling very confident ...

September 11, 2023

The majority of organizations across Australia and New Zealand (A/NZ) breached over the last year had personally identifiable information (PII) compromised, but most have not yet modified their data management policies, according to the Cybersecurity and PII Report from ManageEngine ...