Skip to main content

What Can AIOps Do For IT Ops? - Part 2

APMdigest asked the top minds in the industry what they think AIOps can do for IT Operations. Part 2 covers capabilities supported by AIOps, such as visibility and alerting.

Start with What Can AIOps Do For IT Ops? - Part 1

END-TO-END VISIBILITY

Rapid digitization has made maintaining the performance of key business services across a hybrid cloud environment more imperative than ever. AIOps has quickly become the key technology to provide end-to-end visibility across the technology domains that support business and technology services. AIOps can be used in domain-agnostic settings, applying AI/ML across existing data from monitoring tools, or in a domain-centric way within a specific technology for existing multi-cloud, serverless and container environments. In both instances, the goal is to minimize downtime, speed up code releases and allow developers to get back to writing great software and features rather than spending cycles troubleshooting, and creates opportunities for teams to leverage data-driven decisions to quickly pin-point problems. While many organizations feel that they improved incident response and problem resolution within an area, leveraging proactive AIOps to provide end-to-end visibility across and deep within technology services is a competitive advantage.
Kia Behnia
VP of IT Operations, Splunk

ANOMALY DETECTION

AI (machine learning in particular) is particularly good at recognizing patterns in data — and thus highlights exceptions to those patterns. As a result, AIOps excels at anomaly detection. However, AI drops the ball when the data don't follow clear patterns. The more chaotic the IT environment, the poorer AIOps works.
Jason Bloomberg
President, Intellyx

The anomaly detection from AIOps gives users powerful visualizations comparing the statistics from one period of time compared to another.
Russell Rothstein
Founder and CEO, IT Central Station

PREDICTING INCIDENTS BEFORE THEY BECOME PROBLEMS

Use AI/ML to easily detect evolving incidents across your IT Ops environment as they happen, through the use of cross-domain enrichment
Mohan Kompella, VP Product Marketing,
Adam Blau, Director of Product Marketing,
Anirban Chatterjee, Director of Product Marketing, BigPanda

AIOps is the quintessential analysis of application data coming from the hosting platform, systems logs, Continuous Monitoring tools, CI/CD events, as well as many other sources which can be gradually added as the practice matures (e.g. QA results/metrics output, incidents tracking, etc.) With the help of AI and ML and minimal human intervention, AIOps solutions are able to comb through aggregated data, identify the most relevant pieces of information and correlate them using deterministic analysis. This is the most exciting part of AIOps as it gives way to the true cognitive operations support capable of predicting (and possibly mitigating) incidents before they fully manifested themself in the application's ecosystem.
Oleg Boyko
CTO, Exadel

INTELLIGENT ALERTING

An AIOps tool can predict outcomes and smooth the path for IT operations using intelligent event notification indicating the likely problem within the infrastructure and propose probable solutions. AIOps can reach out to the specific owner or support team providing them with specific information about the issue, suggest remediation, or remediate the problem directly. AIOps tools are transformative for IT operations because they allow operations to concentrate on the business, not on watching screens.
Ron Williams
Analyst, Gigaom

REDUCED ALERT NOISE

AIOps platforms reduce alert noise by grouping interrelated alerts into high-quality incidents that describe to IT Ops teams what the problem is, what's impacted, what the root cause is, what action to take and then kick off automated remediation and resolution workflows.
Mohan Kompella, VP Product Marketing,
Adam Blau, Director of Product Marketing,
Anirban Chatterjee, Director of Product Marketing, BigPanda

AIOps extends value for distributed DevOps teams because of the drastic reduction in alert noise it creates. A majority of alerts currently presented to NOC, IT ops, and SRE teams are deemed irrelevant in mid-to-large sized enterprises. A great deal of toil can be eliminated, freeing up valuable employees to work on the hard, important problems.
Jason English
Principal Analyst, Intellyx

We have seen IT Central Station reviewers using AIOps capabilities for anomaly detection and for troubleshooting, which they say reduces noise when alerts come in.
Russell Rothstein
Founder and CEO, IT Central Station

Now, more than ever, IT teams are overwhelmed by the growing amount of noise from complex infrastructure and applications. Signal noise can impede finding incident root causes in critical moments, lead to longer resolution times, and bring about tedious, manual remediation practices. Teams end up with very little insight into what they need to improve to prevent similar problems from happening in the future. Investing in AIOps helps teams reduce noise from various monitoring tools, find the probable source of significant incidents, and automate as much of the incident response process as possible.
Andrew Marshall
Sr. Director of Product Marketing and Advocacy, PagerDuty

IT Ops teams are drowning in data. The advantage AIOps provides to IT Ops teams is the ability to find a signal through the noise.
Thomas LaRock
Head Geek, SolarWinds

With complicated and multilayered security stacks in play today, companies are finally having to deal with the issue of too many alerts being generated too often, commonly known as "alert fatigue." AIOps technology can easily and effectively reduce alert fatigue by applying a correlation of alerts from several sources and producing actionable alerts that point towards a common root cause. Ultimately, this reduces the noisy background, low priority, or unturned alerts that commonly distracts or hides credible alerts or patterns from security and IT professionals. As a result, there is less chance that they miss a credible alert or threat if AIOps is filtering, correlating, and pattern matching raw alerts into actionable events and alerts.
Chuck Everette
Director of Cybersecurity Advocacy, Deep Instinct

Go to What Can AIOps Do For IT Ops? - Part 3

Hot Topics

The Latest

Technology leaders across the federal landscape are facing, and will continue to face, an uphill battle when it comes to fortifying their digital environments against hostile and persistent threat actors. On one hand, they are being asked to push digital transformation ... On the other hand, they are facing the fiscal uncertainty of continuing resolutions (CR) and government shutdowns looming near and far. In the face of these challenges, CIOs, CTOs, and CISOs must figure out how to modernize legacy systems and infrastructure while doing more with less and still defending against external and internal threats ...

Reliability is no longer proven by uptime alone, according to the The SRE Report 2026 from LogicMonitor. In the AI era, it is experienced through speed, consistency, and user trust, and increasingly judged by business impact. As digital services grow more complex and AI systems move into production, traditional monitoring approaches are struggling to keep pace, increasing the need for AI-first observability that spans applications, infrastructure, and the Internet ...

If AI is the engine of a modern organization, then data engineering is the road system beneath it. You can build the most powerful engine in the world, but without paved roads, traffic signals, and bridges that can support its weight, it will stall. In many enterprises, the engine is ready. The roads are not ...

In the world of digital-first business, there is no tolerance for service outages. Businesses know that outages are the quickest way to lose money and customers. For smaller organizations, unplanned downtime could even force the business to close ... A new study from PagerDuty, The State of AI-First Operations, reveals that companies actively incorporating AI into operations now view operational resilience as a growth driver rather than a cost center. But how are they achieving it? ...

In live financial environments, capital markets software cannot pause for rebuilds. New capabilities are introduced as stacked technology layers to meet evolving demands while systems remain active, data keeps moving, and controls stay intact. AI is no exception, and its opportunities are significant: accelerated decision cycles, compressed manual workflows, and more effective operations across complex environments. The constraint isn't the models themselves, but the architectural environments they enter ...

Like most digital transformation shifts, organizations often prioritize productivity and leave security and observability to keep pace. This usually translates to both the mass implementation of new technology and fragmented monitoring and observability (M&O) tooling. In the era of AI and varied cloud architecture, a disparate observability function can be dangerous. IT teams will lack a complete picture of their IT environment, making it harder to diagnose issues while slowing down mean time to resolve (MTTR). In fact, according to recent data from the SolarWinds State of Monitoring & Observability Report, 77% of IT personnel said the lack of visibility across their on-prem and cloud architecture was an issue ...

In MEAN TIME TO INSIGHT Episode 23, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses the NetOps labor shortage ... 

Technology management is evolving, and in turn, so is the scope of FinOps. The FinOps Foundation recently updated their mission statement from "advancing the people who manage the value of cloud" to "advancing the people who manage the value of technology." This seemingly small change solidifies a larger evolution: FinOps practitioners have organically expanded to be focused on more than just cloud cost optimization. Today, FinOps teams are largely — and quickly — expanding their job descriptions, evolving into a critical function for managing the full value of technology ...

Enterprises are under pressure to scale AI quickly. Yet despite considerable investment, adoption continues to stall. One of the most overlooked reasons is vendor sprawl ... In reality, no organization deliberately sets out to create sprawling vendor ecosystems. More often, complexity accumulates over time through well-intentioned initiatives, such as enterprise-wide digital transformation efforts, point solutions, or decentralized sourcing strategies ...

Nearly every conversation about AI eventually circles back to compute. GPUs dominate the headlines while cloud platforms compete for workloads and model benchmarks drive investment decisions. But underneath that noise, a quieter infrastructure challenge is taking shape. The real bottleneck in enterprise AI is not processing power, it is the ability to store, manage and retrieve the relentless volumes of data that AI systems generate, consume and multiply ...

What Can AIOps Do For IT Ops? - Part 2

APMdigest asked the top minds in the industry what they think AIOps can do for IT Operations. Part 2 covers capabilities supported by AIOps, such as visibility and alerting.

Start with What Can AIOps Do For IT Ops? - Part 1

END-TO-END VISIBILITY

Rapid digitization has made maintaining the performance of key business services across a hybrid cloud environment more imperative than ever. AIOps has quickly become the key technology to provide end-to-end visibility across the technology domains that support business and technology services. AIOps can be used in domain-agnostic settings, applying AI/ML across existing data from monitoring tools, or in a domain-centric way within a specific technology for existing multi-cloud, serverless and container environments. In both instances, the goal is to minimize downtime, speed up code releases and allow developers to get back to writing great software and features rather than spending cycles troubleshooting, and creates opportunities for teams to leverage data-driven decisions to quickly pin-point problems. While many organizations feel that they improved incident response and problem resolution within an area, leveraging proactive AIOps to provide end-to-end visibility across and deep within technology services is a competitive advantage.
Kia Behnia
VP of IT Operations, Splunk

ANOMALY DETECTION

AI (machine learning in particular) is particularly good at recognizing patterns in data — and thus highlights exceptions to those patterns. As a result, AIOps excels at anomaly detection. However, AI drops the ball when the data don't follow clear patterns. The more chaotic the IT environment, the poorer AIOps works.
Jason Bloomberg
President, Intellyx

The anomaly detection from AIOps gives users powerful visualizations comparing the statistics from one period of time compared to another.
Russell Rothstein
Founder and CEO, IT Central Station

PREDICTING INCIDENTS BEFORE THEY BECOME PROBLEMS

Use AI/ML to easily detect evolving incidents across your IT Ops environment as they happen, through the use of cross-domain enrichment
Mohan Kompella, VP Product Marketing,
Adam Blau, Director of Product Marketing,
Anirban Chatterjee, Director of Product Marketing, BigPanda

AIOps is the quintessential analysis of application data coming from the hosting platform, systems logs, Continuous Monitoring tools, CI/CD events, as well as many other sources which can be gradually added as the practice matures (e.g. QA results/metrics output, incidents tracking, etc.) With the help of AI and ML and minimal human intervention, AIOps solutions are able to comb through aggregated data, identify the most relevant pieces of information and correlate them using deterministic analysis. This is the most exciting part of AIOps as it gives way to the true cognitive operations support capable of predicting (and possibly mitigating) incidents before they fully manifested themself in the application's ecosystem.
Oleg Boyko
CTO, Exadel

INTELLIGENT ALERTING

An AIOps tool can predict outcomes and smooth the path for IT operations using intelligent event notification indicating the likely problem within the infrastructure and propose probable solutions. AIOps can reach out to the specific owner or support team providing them with specific information about the issue, suggest remediation, or remediate the problem directly. AIOps tools are transformative for IT operations because they allow operations to concentrate on the business, not on watching screens.
Ron Williams
Analyst, Gigaom

REDUCED ALERT NOISE

AIOps platforms reduce alert noise by grouping interrelated alerts into high-quality incidents that describe to IT Ops teams what the problem is, what's impacted, what the root cause is, what action to take and then kick off automated remediation and resolution workflows.
Mohan Kompella, VP Product Marketing,
Adam Blau, Director of Product Marketing,
Anirban Chatterjee, Director of Product Marketing, BigPanda

AIOps extends value for distributed DevOps teams because of the drastic reduction in alert noise it creates. A majority of alerts currently presented to NOC, IT ops, and SRE teams are deemed irrelevant in mid-to-large sized enterprises. A great deal of toil can be eliminated, freeing up valuable employees to work on the hard, important problems.
Jason English
Principal Analyst, Intellyx

We have seen IT Central Station reviewers using AIOps capabilities for anomaly detection and for troubleshooting, which they say reduces noise when alerts come in.
Russell Rothstein
Founder and CEO, IT Central Station

Now, more than ever, IT teams are overwhelmed by the growing amount of noise from complex infrastructure and applications. Signal noise can impede finding incident root causes in critical moments, lead to longer resolution times, and bring about tedious, manual remediation practices. Teams end up with very little insight into what they need to improve to prevent similar problems from happening in the future. Investing in AIOps helps teams reduce noise from various monitoring tools, find the probable source of significant incidents, and automate as much of the incident response process as possible.
Andrew Marshall
Sr. Director of Product Marketing and Advocacy, PagerDuty

IT Ops teams are drowning in data. The advantage AIOps provides to IT Ops teams is the ability to find a signal through the noise.
Thomas LaRock
Head Geek, SolarWinds

With complicated and multilayered security stacks in play today, companies are finally having to deal with the issue of too many alerts being generated too often, commonly known as "alert fatigue." AIOps technology can easily and effectively reduce alert fatigue by applying a correlation of alerts from several sources and producing actionable alerts that point towards a common root cause. Ultimately, this reduces the noisy background, low priority, or unturned alerts that commonly distracts or hides credible alerts or patterns from security and IT professionals. As a result, there is less chance that they miss a credible alert or threat if AIOps is filtering, correlating, and pattern matching raw alerts into actionable events and alerts.
Chuck Everette
Director of Cybersecurity Advocacy, Deep Instinct

Go to What Can AIOps Do For IT Ops? - Part 3

Hot Topics

The Latest

Technology leaders across the federal landscape are facing, and will continue to face, an uphill battle when it comes to fortifying their digital environments against hostile and persistent threat actors. On one hand, they are being asked to push digital transformation ... On the other hand, they are facing the fiscal uncertainty of continuing resolutions (CR) and government shutdowns looming near and far. In the face of these challenges, CIOs, CTOs, and CISOs must figure out how to modernize legacy systems and infrastructure while doing more with less and still defending against external and internal threats ...

Reliability is no longer proven by uptime alone, according to the The SRE Report 2026 from LogicMonitor. In the AI era, it is experienced through speed, consistency, and user trust, and increasingly judged by business impact. As digital services grow more complex and AI systems move into production, traditional monitoring approaches are struggling to keep pace, increasing the need for AI-first observability that spans applications, infrastructure, and the Internet ...

If AI is the engine of a modern organization, then data engineering is the road system beneath it. You can build the most powerful engine in the world, but without paved roads, traffic signals, and bridges that can support its weight, it will stall. In many enterprises, the engine is ready. The roads are not ...

In the world of digital-first business, there is no tolerance for service outages. Businesses know that outages are the quickest way to lose money and customers. For smaller organizations, unplanned downtime could even force the business to close ... A new study from PagerDuty, The State of AI-First Operations, reveals that companies actively incorporating AI into operations now view operational resilience as a growth driver rather than a cost center. But how are they achieving it? ...

In live financial environments, capital markets software cannot pause for rebuilds. New capabilities are introduced as stacked technology layers to meet evolving demands while systems remain active, data keeps moving, and controls stay intact. AI is no exception, and its opportunities are significant: accelerated decision cycles, compressed manual workflows, and more effective operations across complex environments. The constraint isn't the models themselves, but the architectural environments they enter ...

Like most digital transformation shifts, organizations often prioritize productivity and leave security and observability to keep pace. This usually translates to both the mass implementation of new technology and fragmented monitoring and observability (M&O) tooling. In the era of AI and varied cloud architecture, a disparate observability function can be dangerous. IT teams will lack a complete picture of their IT environment, making it harder to diagnose issues while slowing down mean time to resolve (MTTR). In fact, according to recent data from the SolarWinds State of Monitoring & Observability Report, 77% of IT personnel said the lack of visibility across their on-prem and cloud architecture was an issue ...

In MEAN TIME TO INSIGHT Episode 23, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses the NetOps labor shortage ... 

Technology management is evolving, and in turn, so is the scope of FinOps. The FinOps Foundation recently updated their mission statement from "advancing the people who manage the value of cloud" to "advancing the people who manage the value of technology." This seemingly small change solidifies a larger evolution: FinOps practitioners have organically expanded to be focused on more than just cloud cost optimization. Today, FinOps teams are largely — and quickly — expanding their job descriptions, evolving into a critical function for managing the full value of technology ...

Enterprises are under pressure to scale AI quickly. Yet despite considerable investment, adoption continues to stall. One of the most overlooked reasons is vendor sprawl ... In reality, no organization deliberately sets out to create sprawling vendor ecosystems. More often, complexity accumulates over time through well-intentioned initiatives, such as enterprise-wide digital transformation efforts, point solutions, or decentralized sourcing strategies ...

Nearly every conversation about AI eventually circles back to compute. GPUs dominate the headlines while cloud platforms compete for workloads and model benchmarks drive investment decisions. But underneath that noise, a quieter infrastructure challenge is taking shape. The real bottleneck in enterprise AI is not processing power, it is the ability to store, manage and retrieve the relentless volumes of data that AI systems generate, consume and multiply ...