Skip to main content

What Can AIOps Do For IT Ops? - Part 6

APMdigest asked the top minds in the industry what they think AIOps can do for IT Operations. Part 6 is the final installment in the series.

Start with What Can AIOps Do For IT Ops? - Part 1

Start with What Can AIOps Do For IT Ops? - Part 2

Start with What Can AIOps Do For IT Ops? - Part 3

Start with What Can AIOps Do For IT Ops? - Part 4

Start with What Can AIOps Do For IT Ops? - Part 5

SCALABILITY

AIOps advantages can be summed up in one word — scalability. A main advantage of AIOps within DevOps teams is the ability to scale a business with new technology, without having to scale the operations of new services in kind. AIOps allows DevOps teams to focus on innovating and improving the customer experience — the driving force of profitability — not on the constant pressure of monitoring and operating these services. Forward thinking DevOps teams need to be looking at AIOps and machine learning as mission critical to deliver higher availability of services."
Sean McDermott
CEO, Windward Consulting Group

Over the years, there's been a change in the ratio of people managing computers to the number of computers. In the 60s and 70s, there were many operators per machine. With the cloud, one admin manages thousands, possibly hundreds of thousands, of computers. The only way that's been managed has been through improvements in tooling. AIOps is the latest improvement in tooling and enables IT staff to work effectively with huge clusters that dynamically change. No human could possibly watch all the log files looking for anomalies and no simple set of Perl or Python scripts could automate that process. The only way to do this is to use AI to analyze the data being thrown off by huge clusters of computing resources, look for anomalies, and if possible, correct problems without requiring human involvement. For example, AI could detect signatures of failing devices, like disk drives, then move the data from the failing drive to a spare and notify a human to swap in a replacement. An AI system coupled with load balancing hardware could also make predictions about what your traffic will be and allocate resources accordingly. This is especially valuable in the cloud, where admins can allocate and release computing power as needed.
Mike Loukides
VP of Emerging Tech Content, O'Reilly Media

OPTIMIZING VALUE STREAMS

AIOps allows IT Operations to focus more on creating value stream optimization
Muraleedharan Vijayakumar
Senior Technical Manager, GAVS Technologies

The conversation on domain-agnostic versus domain-specific does not really matter. In the past, the domain-agnostic AIOps tools heavily rely on integrations with many different sources to collect data. Domain-centric AIOps tools typically collect most of the required data themselves and sometimes can be more specific to special domains, such as log management or specific application topics such as ERP. What this means: I believe Artificial Intelligence will and should be used across many domains and the current task for IT enterprises is to determine where they want to leverage AI capabilities to gain insights and reduce waste and toil. When analyzing the vendors in this space I found that some vendors tout their AI capabilities specifically for IT operations, others have and are adding additional data analytics and intelligent integrations to support evolving operating models. I think the next normal will require the leverage of AI across the value streams to successfully execute and delivery quality digital services and applications to customers.
Eveline Oehrlich
Chief Research Officer, DevOps Institute

ENABLING SMALLER TEAMS TO BE MORE EFFECTIVE

AIOps enables a small traditional IT Ops team to be much more effective and expand its reach. It can cover a much wider remit, including more systems to deploy, more geographies, and more variants (support AB testing).
Gareth Smith
GM of Eggplant, part of Keysight Technologies

DRIFT TRACKING

Drift tracking from inception to current production state has been a desired state in Enterprise for decades. AIOps can provide operations with a view into the Drift of changes from what was initially deployed to how the environment has changed over time. Understanding Drift is critical to reduce tech debt, incidents and problems across clients to cloud.
Jeanne Morain
Author, Strategist and Transformation Pioneer, iSpeak Cloud

DE-RISK ROLLOUT OF NEW INITIATIVES

AIOps can be the "extra pair of hands" to help identify problems and issues before they happen and from complex and varied data sets that would be difficult for a human to comprehend. This helps de-risk the rollout of new initiatives as issues are quickly identified and, if necessary, remediated or rolled back all quicker than a human can react.
Gareth Smith
GM of Eggplant, part of Keysight Technologies

FOCUS ON MORE STRATEGIC INITIATIVES

AIOps can also classify common issues allowing the Ops team to focus its time and effort on more strategic initiatives for greater efficiencies and benefits.
Gareth Smith
GM of Eggplant, part of Keysight Technologies

LABOR SAVINGS

With AIOps, when you multiply that reduction in fruitless labor cost by the number of applications and infrastructure assets that could generate alerts, multiplied by the amount of times groups in DevOps and IT were handing off issues to each other, a significant labor savings is at stake, as well as a higher rate of employee retention.
Jason English
Principal Analyst, Intellyx

COST EFFICIENCY

AIOps can allow IT organizations to operate efficiently and provide more reliable, scalable infrastructure for their users. With the vast amount of data available today, AIOps allows IT organizations to easily understand things like resource constraints, traffic patterns and automate / scale infrastructure more efficiently. Things that would take a human a lot of time to automate.
Saro Subbiah
VP of Engineering and Technology for Monitor & Platform, Sysdig

Hot Topics

The Latest

2020 was the equivalent of a wedding with a top-shelf open bar. As businesses scrambled to adjust to remote work, digital transformation accelerated at breakneck speed. New software categories emerged overnight. Tech stacks ballooned with all sorts of SaaS apps solving ALL the problems — often with little oversight or long-term integration planning, and yes frequently a lot of duplicated functionality ... But now the music's faded. The lights are on. Everyone from the CIO to the CFO is checking the bill. Welcome to the Great SaaS Hangover ...

Regardless of OpenShift being a scalable and flexible software, it can be a pain to monitor since complete visibility into the underlying operations is not guaranteed ... To effectively monitor an OpenShift environment, IT administrators should focus on these five key elements and their associated metrics ...

An overwhelming majority of IT leaders (95%) believe the upcoming wave of AI-powered digital transformation is set to be the most impactful and intensive seen thus far, according to The Science of Productivity: AI, Adoption, And Employee Experience, a new report from Nexthink ...

Overall outage frequency and the general level of reported severity continue to decline, according to the Outage Analysis 2025 from Uptime Institute. However, cyber security incidents are on the rise and often have severe, lasting impacts ...

In March, New Relic published the State of Observability for Media and Entertainment Report to share insights, data, and analysis into the adoption and business value of observability across the media and entertainment industry. Here are six key takeaways from the report ...

Regardless of their scale, business decisions often take time, effort, and a lot of back-and-forth discussion to reach any sort of actionable conclusion ... Any means of streamlining this process and getting from complex problems to optimal solutions more efficiently and reliably is key. How can organizations optimize their decision-making to save time and reduce excess effort from those involved? ...

As enterprises accelerate their cloud adoption strategies, CIOs are routinely exceeding their cloud budgets — a concern that's about to face additional pressure from an unexpected direction: uncertainty over semiconductor tariffs. The CIO Cloud Trends Survey & Report from Azul reveals the extent continued cloud investment despite cost overruns, and how organizations are attempting to bring spending under control ...

Image
Azul

According to Auvik's 2025 IT Trends Report, 60% of IT professionals feel at least moderately burned out on the job, with 43% stating that their workload is contributing to work stress. At the same time, many IT professionals are naming AI and machine learning as key areas they'd most like to upskill ...

Businesses that face downtime or outages risk financial and reputational damage, as well as reducing partner, shareholder, and customer trust. One of the major challenges that enterprises face is implementing a robust business continuity plan. What's the solution? The answer may lie in disaster recovery tactics such as truly immutable storage and regular disaster recovery testing ...

IT spending is expected to jump nearly 10% in 2025, and organizations are now facing pressure to manage costs without slowing down critical functions like observability. To meet the challenge, leaders are turning to smarter, more cost effective business strategies. Enter stage right: OpenTelemetry, the missing piece of the puzzle that is no longer just an option but rather a strategic advantage ...

What Can AIOps Do For IT Ops? - Part 6

APMdigest asked the top minds in the industry what they think AIOps can do for IT Operations. Part 6 is the final installment in the series.

Start with What Can AIOps Do For IT Ops? - Part 1

Start with What Can AIOps Do For IT Ops? - Part 2

Start with What Can AIOps Do For IT Ops? - Part 3

Start with What Can AIOps Do For IT Ops? - Part 4

Start with What Can AIOps Do For IT Ops? - Part 5

SCALABILITY

AIOps advantages can be summed up in one word — scalability. A main advantage of AIOps within DevOps teams is the ability to scale a business with new technology, without having to scale the operations of new services in kind. AIOps allows DevOps teams to focus on innovating and improving the customer experience — the driving force of profitability — not on the constant pressure of monitoring and operating these services. Forward thinking DevOps teams need to be looking at AIOps and machine learning as mission critical to deliver higher availability of services."
Sean McDermott
CEO, Windward Consulting Group

Over the years, there's been a change in the ratio of people managing computers to the number of computers. In the 60s and 70s, there were many operators per machine. With the cloud, one admin manages thousands, possibly hundreds of thousands, of computers. The only way that's been managed has been through improvements in tooling. AIOps is the latest improvement in tooling and enables IT staff to work effectively with huge clusters that dynamically change. No human could possibly watch all the log files looking for anomalies and no simple set of Perl or Python scripts could automate that process. The only way to do this is to use AI to analyze the data being thrown off by huge clusters of computing resources, look for anomalies, and if possible, correct problems without requiring human involvement. For example, AI could detect signatures of failing devices, like disk drives, then move the data from the failing drive to a spare and notify a human to swap in a replacement. An AI system coupled with load balancing hardware could also make predictions about what your traffic will be and allocate resources accordingly. This is especially valuable in the cloud, where admins can allocate and release computing power as needed.
Mike Loukides
VP of Emerging Tech Content, O'Reilly Media

OPTIMIZING VALUE STREAMS

AIOps allows IT Operations to focus more on creating value stream optimization
Muraleedharan Vijayakumar
Senior Technical Manager, GAVS Technologies

The conversation on domain-agnostic versus domain-specific does not really matter. In the past, the domain-agnostic AIOps tools heavily rely on integrations with many different sources to collect data. Domain-centric AIOps tools typically collect most of the required data themselves and sometimes can be more specific to special domains, such as log management or specific application topics such as ERP. What this means: I believe Artificial Intelligence will and should be used across many domains and the current task for IT enterprises is to determine where they want to leverage AI capabilities to gain insights and reduce waste and toil. When analyzing the vendors in this space I found that some vendors tout their AI capabilities specifically for IT operations, others have and are adding additional data analytics and intelligent integrations to support evolving operating models. I think the next normal will require the leverage of AI across the value streams to successfully execute and delivery quality digital services and applications to customers.
Eveline Oehrlich
Chief Research Officer, DevOps Institute

ENABLING SMALLER TEAMS TO BE MORE EFFECTIVE

AIOps enables a small traditional IT Ops team to be much more effective and expand its reach. It can cover a much wider remit, including more systems to deploy, more geographies, and more variants (support AB testing).
Gareth Smith
GM of Eggplant, part of Keysight Technologies

DRIFT TRACKING

Drift tracking from inception to current production state has been a desired state in Enterprise for decades. AIOps can provide operations with a view into the Drift of changes from what was initially deployed to how the environment has changed over time. Understanding Drift is critical to reduce tech debt, incidents and problems across clients to cloud.
Jeanne Morain
Author, Strategist and Transformation Pioneer, iSpeak Cloud

DE-RISK ROLLOUT OF NEW INITIATIVES

AIOps can be the "extra pair of hands" to help identify problems and issues before they happen and from complex and varied data sets that would be difficult for a human to comprehend. This helps de-risk the rollout of new initiatives as issues are quickly identified and, if necessary, remediated or rolled back all quicker than a human can react.
Gareth Smith
GM of Eggplant, part of Keysight Technologies

FOCUS ON MORE STRATEGIC INITIATIVES

AIOps can also classify common issues allowing the Ops team to focus its time and effort on more strategic initiatives for greater efficiencies and benefits.
Gareth Smith
GM of Eggplant, part of Keysight Technologies

LABOR SAVINGS

With AIOps, when you multiply that reduction in fruitless labor cost by the number of applications and infrastructure assets that could generate alerts, multiplied by the amount of times groups in DevOps and IT were handing off issues to each other, a significant labor savings is at stake, as well as a higher rate of employee retention.
Jason English
Principal Analyst, Intellyx

COST EFFICIENCY

AIOps can allow IT organizations to operate efficiently and provide more reliable, scalable infrastructure for their users. With the vast amount of data available today, AIOps allows IT organizations to easily understand things like resource constraints, traffic patterns and automate / scale infrastructure more efficiently. Things that would take a human a lot of time to automate.
Saro Subbiah
VP of Engineering and Technology for Monitor & Platform, Sysdig

Hot Topics

The Latest

2020 was the equivalent of a wedding with a top-shelf open bar. As businesses scrambled to adjust to remote work, digital transformation accelerated at breakneck speed. New software categories emerged overnight. Tech stacks ballooned with all sorts of SaaS apps solving ALL the problems — often with little oversight or long-term integration planning, and yes frequently a lot of duplicated functionality ... But now the music's faded. The lights are on. Everyone from the CIO to the CFO is checking the bill. Welcome to the Great SaaS Hangover ...

Regardless of OpenShift being a scalable and flexible software, it can be a pain to monitor since complete visibility into the underlying operations is not guaranteed ... To effectively monitor an OpenShift environment, IT administrators should focus on these five key elements and their associated metrics ...

An overwhelming majority of IT leaders (95%) believe the upcoming wave of AI-powered digital transformation is set to be the most impactful and intensive seen thus far, according to The Science of Productivity: AI, Adoption, And Employee Experience, a new report from Nexthink ...

Overall outage frequency and the general level of reported severity continue to decline, according to the Outage Analysis 2025 from Uptime Institute. However, cyber security incidents are on the rise and often have severe, lasting impacts ...

In March, New Relic published the State of Observability for Media and Entertainment Report to share insights, data, and analysis into the adoption and business value of observability across the media and entertainment industry. Here are six key takeaways from the report ...

Regardless of their scale, business decisions often take time, effort, and a lot of back-and-forth discussion to reach any sort of actionable conclusion ... Any means of streamlining this process and getting from complex problems to optimal solutions more efficiently and reliably is key. How can organizations optimize their decision-making to save time and reduce excess effort from those involved? ...

As enterprises accelerate their cloud adoption strategies, CIOs are routinely exceeding their cloud budgets — a concern that's about to face additional pressure from an unexpected direction: uncertainty over semiconductor tariffs. The CIO Cloud Trends Survey & Report from Azul reveals the extent continued cloud investment despite cost overruns, and how organizations are attempting to bring spending under control ...

Image
Azul

According to Auvik's 2025 IT Trends Report, 60% of IT professionals feel at least moderately burned out on the job, with 43% stating that their workload is contributing to work stress. At the same time, many IT professionals are naming AI and machine learning as key areas they'd most like to upskill ...

Businesses that face downtime or outages risk financial and reputational damage, as well as reducing partner, shareholder, and customer trust. One of the major challenges that enterprises face is implementing a robust business continuity plan. What's the solution? The answer may lie in disaster recovery tactics such as truly immutable storage and regular disaster recovery testing ...

IT spending is expected to jump nearly 10% in 2025, and organizations are now facing pressure to manage costs without slowing down critical functions like observability. To meet the challenge, leaders are turning to smarter, more cost effective business strategies. Enter stage right: OpenTelemetry, the missing piece of the puzzle that is no longer just an option but rather a strategic advantage ...