Skip to main content

Next Steps for ITOA - Part 4

APMdigest asked experts across the industry — including analysts, consultants and vendors — for their opinions on the next steps for ITOA. These next steps include where the experts believe ITOA is headed, as well as where they think it should be headed. Part 4 covers automation and dynamic IT environment.

Start with Next Steps for ITOA - Part 1

Start with Next Steps for ITOA - Part 2

Start with Next Steps for ITOA - Part 3

AUTOMATED PROBLEM DETECTION AND RESOLUTION

ITOA is following a general progression: from providing better visibility into the operational environment, to identifying root causes of issues, to predicting issues before they occur, to automatically preventing issues. The current state of the art today centers on predictive analytics, as machine learning and other AI approaches are particularly useful for this task. Automatic prevention of issues, especially in complex enterprise environments, is still largely in the future – but given the pace of innovation today, it's right around the corner.
Jason Bloomberg
President, Intellyx

Automating many of the back end processes developed around IT Infrastructure Library (ITIL) using IT Operations Analytics to speed up time to value will be the next step for ITOA. It is no longer about analyzing the data – it is how you automate out obstacles for the Software Defined Datacenter of tomorrow.
Jeanne Morain
Author and Strategist, iSpeak Cloud

Fueled by the learning and interpretation of operational data, intelligent automation and self-healing systems will become predominant questions for companies faced to an explosion of the number of physical and virtual devices devices (ITaaS, IoT). Those topics will prevail, not only because of major improvements in the accuracy of algorithms, but simply because of the impossibility for humans to manage booming volume of IT Ops information.
Yann Guernion
Product Marketing Director, Workload Automation, Automic Software

With the development of big data and AI techniques specific to IT operations, we can automate detection and repair of IT problems *before* they occur – a huge step towards the "holy grail" of 100% uptime.
Kimberley Parsons Trommler
Product Evangelist, Paessler AG

Performance management is important, but can be a struggle when keeping up with new technologies and constantly growing system complexities. The only viable solution over time is to extend the use of automation. Using operation analytics to create baselines and leveraging big data analytics to help detect anomalies and prevent incidents ensures that when incidents occur you have automatically gathered all relevant information to determine the root cause.
Sven Hammar
Founder and CEO, Apica

DIGITAL PROCESS AUTOMATION

Digital process automation (DPA) is fast emerging as the next step in the evolution of IT Operation Analytics. DPA allows for far closer collaboration between business and IT to map core operations and business transactions, providing greater data analytics and insight. As a result, DPA empowers the enterprise to be more responsive to customers, deliver products to market faster and provide an enhanced customer experience. Arming workers with the right data at the right time, and in the specific context of that business moment, helps them do their jobs more effectively and to respond to the changing needs of digitally-savvy consumers in near to real time.
Rich Fitchen
GM of North America, Bizagi

DEEP LEARNING

The volume, variety, and velocity of changes in telemetry data, technology, and processes will drive ITOA evolution towards deep learning. For context, artificial intelligence is exemplified by knowledge bases, while machine learning focuses the knowledge via logical regression, and deep learning moves into the realm of artificial neural networks, also known as multilayer perceptrons (MLP). The next two years will be a race to discover deep learning models that will enable ITOA automation and orchestration that optimize any organization's application stack to meet their customers' on-demand needs regardless of the consumption platform, delivery model, or rate and size of consumption. The end goal remains the same: delivering frictionless consumption for an organization's revenue-generating and revenue-supporting application services. In this journey, "monitoring with discipline" will play a key role in determining the efficiency and the effectiveness of the deep learning ITOA models towards that goal.
Kong Yang
Head Geek, SolarWinds

FLEXIBILITY AND RAPID CONFIGURATION

Real time data from advances like IoT and machine learning to artificial intelligence offer a lot of potential to make ITOA a force for improving the customer experience. However, collecting, organizing and making sense of the deluge of data from such disparate sources and executing against the insights they provide, is going to require a level of flexibility that will tax traditional IT systems. Platforms that can be rapidly configured to changing business needs will emerge.
Colin Earl
CEO, Agiloft

CONTAINERS AND MICROSERVICES

With more and more applications running in containers or microservices, IT Operations Analytics (ITOA) becomes more important to be able to process the constantly changing nature of these services. ITOA also enables us to truly understand the service dependencies, detect anomalies, and generate appropriate alerts or even self-healing fixes that can maximize uptime for the services and minimize the time requirements for IT staff.
Steve Lack
VP of Cloud Solutions, Astadia

Read Next Steps for ITOA - Part 5, offering some interesting final thoughts.

Hot Topics

The Latest

Like most digital transformation shifts, organizations often prioritize productivity and leave security and observability to keep pace. This usually translates to both the mass implementation of new technology and fragmented monitoring and observability (M&O) tooling. In the era of AI and varied cloud architecture, a disparate observability function can be dangerous. IT teams will lack a complete picture of their IT environment, making it harder to diagnose issues while slowing down mean time to resolve (MTTR). In fact, according to recent data from the SolarWinds State of Monitoring & Observability Report, 77% of IT personnel said the lack of visibility across their on-prem and cloud architecture was an issue ...

In MEAN TIME TO INSIGHT Episode 23, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses the NetOps labor shortage ... 

Technology management is evolving, and in turn, so is the scope of FinOps. The FinOps Foundation recently updated their mission statement from "advancing the people who manage the value of cloud" to "advancing the people who manage the value of technology." This seemingly small change solidifies a larger evolution: FinOps practitioners have organically expanded to be focused on more than just cloud cost optimization. Today, FinOps teams are largely — and quickly — expanding their job descriptions, evolving into a critical function for managing the full value of technology ...

Enterprises are under pressure to scale AI quickly. Yet despite considerable investment, adoption continues to stall. One of the most overlooked reasons is vendor sprawl ... In reality, no organization deliberately sets out to create sprawling vendor ecosystems. More often, complexity accumulates over time through well-intentioned initiatives, such as enterprise-wide digital transformation efforts, point solutions, or decentralized sourcing strategies ...

Nearly every conversation about AI eventually circles back to compute. GPUs dominate the headlines while cloud platforms compete for workloads and model benchmarks drive investment decisions. But underneath that noise, a quieter infrastructure challenge is taking shape. The real bottleneck in enterprise AI is not processing power, it is the ability to store, manage and retrieve the relentless volumes of data that AI systems generate, consume and multiply ...

The 2026 Observability Survey from Grafana Labs paints a vivid picture of an industry maturing fast, where AI is welcomed with careful conditions, SaaS economics are reshaping spending decisions, complexity remains a defining challenge, and open standards continue to underpin it all ...

The observability industry has an evolving relationship with AI. We're not skeptics, but it's clear that trust in AI must be earned ... In Grafana Labs' annual Observability Survey, 92% said they see real value in AI surfacing anomalies before they cause downtime. Another 91% endorsed AI for forecasting and root cause analysis. So while the demand is there, customers need it to be trustworthy, as the survey also found that the practitioners most enthusiastic about AI are also the most insistent on explainability ...

In the modern enterprise, the conversation around AI has moved past skepticism toward a stage of active adoption. According to our 2026 State of IT Trends Report: The Human Side of Autonomous AI, nearly 90% of IT professionals view AI as a net positive, and this optimism is well-founded. We are seeing agentic AI move beyond simple automation to actively streamlining complex data insights and eliminating the manual toil that has long hindered innovation. However, as we integrate these autonomous agents into our ecosystems, the fundamental DNA of the IT role is evolving ...

AI workloads require an enormous amount of computing power ... What's also becoming abundantly clear is just how quickly AI's computing needs are leading to enterprise systems failure. According to Cockroach Labs' State of AI Infrastructure 2026 report, enterprise systems are much closer to failure than their organizations realize. The report ... suggests AI scale could cause widespread failures in as little as one year — making it a clear risk for business performance and reliability.

The quietest week your engineering team has ever had might also be its best. No alarms going off. No escalations. No frantic Teams or Slack threads at 2 a.m. Everything humming along exactly as it should. And somewhere in a leadership meeting, someone looks at the metrics dashboard, sees a flat line of incidents and says: "Seems like things are pretty calm over there. Do we really need all those people?" ... I've spent many years in engineering, and this pattern keeps repeating ...

Next Steps for ITOA - Part 4

APMdigest asked experts across the industry — including analysts, consultants and vendors — for their opinions on the next steps for ITOA. These next steps include where the experts believe ITOA is headed, as well as where they think it should be headed. Part 4 covers automation and dynamic IT environment.

Start with Next Steps for ITOA - Part 1

Start with Next Steps for ITOA - Part 2

Start with Next Steps for ITOA - Part 3

AUTOMATED PROBLEM DETECTION AND RESOLUTION

ITOA is following a general progression: from providing better visibility into the operational environment, to identifying root causes of issues, to predicting issues before they occur, to automatically preventing issues. The current state of the art today centers on predictive analytics, as machine learning and other AI approaches are particularly useful for this task. Automatic prevention of issues, especially in complex enterprise environments, is still largely in the future – but given the pace of innovation today, it's right around the corner.
Jason Bloomberg
President, Intellyx

Automating many of the back end processes developed around IT Infrastructure Library (ITIL) using IT Operations Analytics to speed up time to value will be the next step for ITOA. It is no longer about analyzing the data – it is how you automate out obstacles for the Software Defined Datacenter of tomorrow.
Jeanne Morain
Author and Strategist, iSpeak Cloud

Fueled by the learning and interpretation of operational data, intelligent automation and self-healing systems will become predominant questions for companies faced to an explosion of the number of physical and virtual devices devices (ITaaS, IoT). Those topics will prevail, not only because of major improvements in the accuracy of algorithms, but simply because of the impossibility for humans to manage booming volume of IT Ops information.
Yann Guernion
Product Marketing Director, Workload Automation, Automic Software

With the development of big data and AI techniques specific to IT operations, we can automate detection and repair of IT problems *before* they occur – a huge step towards the "holy grail" of 100% uptime.
Kimberley Parsons Trommler
Product Evangelist, Paessler AG

Performance management is important, but can be a struggle when keeping up with new technologies and constantly growing system complexities. The only viable solution over time is to extend the use of automation. Using operation analytics to create baselines and leveraging big data analytics to help detect anomalies and prevent incidents ensures that when incidents occur you have automatically gathered all relevant information to determine the root cause.
Sven Hammar
Founder and CEO, Apica

DIGITAL PROCESS AUTOMATION

Digital process automation (DPA) is fast emerging as the next step in the evolution of IT Operation Analytics. DPA allows for far closer collaboration between business and IT to map core operations and business transactions, providing greater data analytics and insight. As a result, DPA empowers the enterprise to be more responsive to customers, deliver products to market faster and provide an enhanced customer experience. Arming workers with the right data at the right time, and in the specific context of that business moment, helps them do their jobs more effectively and to respond to the changing needs of digitally-savvy consumers in near to real time.
Rich Fitchen
GM of North America, Bizagi

DEEP LEARNING

The volume, variety, and velocity of changes in telemetry data, technology, and processes will drive ITOA evolution towards deep learning. For context, artificial intelligence is exemplified by knowledge bases, while machine learning focuses the knowledge via logical regression, and deep learning moves into the realm of artificial neural networks, also known as multilayer perceptrons (MLP). The next two years will be a race to discover deep learning models that will enable ITOA automation and orchestration that optimize any organization's application stack to meet their customers' on-demand needs regardless of the consumption platform, delivery model, or rate and size of consumption. The end goal remains the same: delivering frictionless consumption for an organization's revenue-generating and revenue-supporting application services. In this journey, "monitoring with discipline" will play a key role in determining the efficiency and the effectiveness of the deep learning ITOA models towards that goal.
Kong Yang
Head Geek, SolarWinds

FLEXIBILITY AND RAPID CONFIGURATION

Real time data from advances like IoT and machine learning to artificial intelligence offer a lot of potential to make ITOA a force for improving the customer experience. However, collecting, organizing and making sense of the deluge of data from such disparate sources and executing against the insights they provide, is going to require a level of flexibility that will tax traditional IT systems. Platforms that can be rapidly configured to changing business needs will emerge.
Colin Earl
CEO, Agiloft

CONTAINERS AND MICROSERVICES

With more and more applications running in containers or microservices, IT Operations Analytics (ITOA) becomes more important to be able to process the constantly changing nature of these services. ITOA also enables us to truly understand the service dependencies, detect anomalies, and generate appropriate alerts or even self-healing fixes that can maximize uptime for the services and minimize the time requirements for IT staff.
Steve Lack
VP of Cloud Solutions, Astadia

Read Next Steps for ITOA - Part 5, offering some interesting final thoughts.

Hot Topics

The Latest

Like most digital transformation shifts, organizations often prioritize productivity and leave security and observability to keep pace. This usually translates to both the mass implementation of new technology and fragmented monitoring and observability (M&O) tooling. In the era of AI and varied cloud architecture, a disparate observability function can be dangerous. IT teams will lack a complete picture of their IT environment, making it harder to diagnose issues while slowing down mean time to resolve (MTTR). In fact, according to recent data from the SolarWinds State of Monitoring & Observability Report, 77% of IT personnel said the lack of visibility across their on-prem and cloud architecture was an issue ...

In MEAN TIME TO INSIGHT Episode 23, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses the NetOps labor shortage ... 

Technology management is evolving, and in turn, so is the scope of FinOps. The FinOps Foundation recently updated their mission statement from "advancing the people who manage the value of cloud" to "advancing the people who manage the value of technology." This seemingly small change solidifies a larger evolution: FinOps practitioners have organically expanded to be focused on more than just cloud cost optimization. Today, FinOps teams are largely — and quickly — expanding their job descriptions, evolving into a critical function for managing the full value of technology ...

Enterprises are under pressure to scale AI quickly. Yet despite considerable investment, adoption continues to stall. One of the most overlooked reasons is vendor sprawl ... In reality, no organization deliberately sets out to create sprawling vendor ecosystems. More often, complexity accumulates over time through well-intentioned initiatives, such as enterprise-wide digital transformation efforts, point solutions, or decentralized sourcing strategies ...

Nearly every conversation about AI eventually circles back to compute. GPUs dominate the headlines while cloud platforms compete for workloads and model benchmarks drive investment decisions. But underneath that noise, a quieter infrastructure challenge is taking shape. The real bottleneck in enterprise AI is not processing power, it is the ability to store, manage and retrieve the relentless volumes of data that AI systems generate, consume and multiply ...

The 2026 Observability Survey from Grafana Labs paints a vivid picture of an industry maturing fast, where AI is welcomed with careful conditions, SaaS economics are reshaping spending decisions, complexity remains a defining challenge, and open standards continue to underpin it all ...

The observability industry has an evolving relationship with AI. We're not skeptics, but it's clear that trust in AI must be earned ... In Grafana Labs' annual Observability Survey, 92% said they see real value in AI surfacing anomalies before they cause downtime. Another 91% endorsed AI for forecasting and root cause analysis. So while the demand is there, customers need it to be trustworthy, as the survey also found that the practitioners most enthusiastic about AI are also the most insistent on explainability ...

In the modern enterprise, the conversation around AI has moved past skepticism toward a stage of active adoption. According to our 2026 State of IT Trends Report: The Human Side of Autonomous AI, nearly 90% of IT professionals view AI as a net positive, and this optimism is well-founded. We are seeing agentic AI move beyond simple automation to actively streamlining complex data insights and eliminating the manual toil that has long hindered innovation. However, as we integrate these autonomous agents into our ecosystems, the fundamental DNA of the IT role is evolving ...

AI workloads require an enormous amount of computing power ... What's also becoming abundantly clear is just how quickly AI's computing needs are leading to enterprise systems failure. According to Cockroach Labs' State of AI Infrastructure 2026 report, enterprise systems are much closer to failure than their organizations realize. The report ... suggests AI scale could cause widespread failures in as little as one year — making it a clear risk for business performance and reliability.

The quietest week your engineering team has ever had might also be its best. No alarms going off. No escalations. No frantic Teams or Slack threads at 2 a.m. Everything humming along exactly as it should. And somewhere in a leadership meeting, someone looks at the metrics dashboard, sees a flat line of incidents and says: "Seems like things are pretty calm over there. Do we really need all those people?" ... I've spent many years in engineering, and this pattern keeps repeating ...