Skip to main content

Q&A Part Two: IBM Talks About Predictive Analytics

Pete Goldin
APMdigest

In Part Two of APMdigest's exclusive interview, Matthew Ellis, IBM Vice President of Service Availability and Performance, discusses predictive analytics.

Click here to start with Part One of the Q&A with IBM VP Matthew Ellis.

APM: Why is predictive analytics gaining so much momentum recently, especially with respect to APM?

ME: Analytics is important to all phases of operations. In all areas of business it is axiomatic that more data enables better decisions, and operations and application management are no exceptions.

Just as important, however, is sorting that data to identify the critical context for decision makers to act on, and this is where analytics come in.

IBM is investing in analytics very seriously, and from an operations management perspective, we apply analytics in three categories: Simplify Operations Management, Avoid Business Disruption, and Enable Optimization.

Simplify Operations Management is a class of analytics technology that enables our customers to do the work that they do today more easily. This includes historical analysis of data to recommend and establish dynamic thresholds, and trending of performance and capacity data to identify areas that may become bottlenecks based on historical behavior.

Avoid Business Disruption is the key driver for the predictive analytics component. The goal is early identification of environmental changes that indicate a significant change in the behavior of an application or service, and to bring this information to the attention of the operations management team so that problems can be identified and addressed before they ever impact a customer. We have identified emerging problems days before traditional management tools saw signs of trouble and in some situations, discovered problems in unmonitored resources that were affecting the behavior of critical applications.

Enable Optimization is the ability to mine collected data across multiple dimensions enabling insight and optimization of services and applications by enabling rich insight. It is also known as business analytics.

APM: What specific functionality should an organization look for in predictive analytics technology?

ME: At IBM, we believe there are three key capabilities that any analytics solution must have to provide maximum predictive capability:

1. Algorithms: Multivariate Analytic techniques are critical to identifying emerging problems early, while all metric data is still well within their normal range.

The key to this statistical approach is to monitor the relationships of important related data metrics and raise an exception when the relationships of data change in significant ways. Any single metric displays a wide range of variability during a normal day, increasing and decreasing with changing workloads, and daily, weekly and seasonal behavior.

In general, however, related metrics will follow the same pattern all the time in a healthy system. Successfully identifying these relationships, and accurately determining when these relationships diverge in an important way is key to accurate early identification of problems.

Our algorithms are developed and refined by one of the largest private math departments in the world; the same organization that developed Watson to win at Jeopardy.

2. Scalability: Analytics solutions work better when they have more data upon which to base their conclusions. The IBM analytics solutions directly leverage proven data collection technologies that have been in use for most of a decade and have seen continual refinement. This capability is proven to be able to collect millions of data points per second, and deliver that data to the analytics engine with very low latency offering real-time evaluation of very large data streams. We believe that the data collection technology we are using is the most scalable and high performance in the industry.

3. Breadth of Monitored Resources: One of our design requirements was to deliver an easily extensible mediation capability allowing customers (or our services teams) to connect any data source to our data collection solution in a matter of hours or days.

During our pilot, we have worked with many products from non-IBM vendors and our team has found that almost all data integration work can be done in a very short time without ever requiring a visit to the customer site, saving time and money while maximizing data availability for analysis.

APM: How do you see Predictive Analytics evolving over the next few years?

ME: IBM expects that analytics tools, and the organizations that use them, will evolve rapidly over the next few years. IBM is investing heavily in providing highly scalable, flexible, and robust systems for identifying emerging problems as early as possible.

We expect analytics to evolve along multiple dimensions:

1. Improvements in analytics learning and data exchange with existing application and service discovery, topology, and CMDB data to combine the strengths of traditional IT tools with analytics learning solutions. This will accelerate the statistical learning process and allow the learned relationships to be built back into the visible topology of the environment.

2. Apply analytics solutions to additional IT management domains to include Smarter Infrastructures, improved detection of security problems, asset management and maintenance scheduling and additional problems

3. Further improve feedback and integration of learning technologies, process optimization, and analytics in general with operations processes.

About Matthew Ellis

Matthew Ellis is the Vice President of Development for Tivoli's Service Availability & Performance Management product portfolio with IBM. This product suite enables monitoring and modeling the utilization, performance, capacity and energy-use of distributed, mainframe and virtualized platforms and associated application software. Ellis joined IBM in 2006 through the Micromuse acquisition, where he was the Vice President of Software Development.

Click here to read Part One of the Q&A with IBM VP Matthew Ellis.

Hot Topic
The Latest
The Latest 10

The Latest

AI is becoming the operating system of the enterprise. It acts as an invisible coordination layer that understands intent, connects systems, and executes work across complex SaaS environments. Previously, employees had to click through multiple systems — CRM, ERP, support tools, collaboration platforms — to complete a single task. Now, instead of navigating each application manually, they can simply state what they need to accomplish ...

In 2026, the cost of downtime or an outage is no longer just a technical inconvenience; it's a $600 billion wake up call for global businesses. As our digital ecosystems become  more interconnected, each touchpoint introduces new risks and multiplies the consequences when things go wrong. And the data is clear: aggregate downtime costs  for Global 2,000 companies have surged 50% since 2024, reaching a staggering $600 billion ...

Deloitte found that 74% of enterprises expect to deploy agentic AI solutions in the next 24 months. However, the rush to deployment is outpacing foundational work, though. Only 21% of enterprises have fully formed agent governance models in place. The result? AI agents deployed without guidance or governance begin to function as fragmented islands of complexity ...

Cloud spending is no longer viewed as a passthrough IT expense, but as a strategic financial lever that directly impacts innovation capacity, profitability and enterprise resilience, according to the CFO Cloud Cost Optimization Report from Azul ...

As AI moves from generating responses to performing actions, the need for trust increases exponentially. And as organizations enlist AI agents for increasingly sophisticated business processes, trust is going to be the single most important theme for spurring adoption. What can organizations do to build trustworthy AI agents? ...

I've spent a lot of time in the channel, and one thing I keep coming back to is this: a partner program is only as good as what it looks like in the field. Many programs look great on paper, but when a partner is in front of a customer navigating a complex hybrid environment or trying to make the case for AI-powered observability, the gap between what a vendor promises and what it actually delivers becomes very clear, very fast ...

Enterprises today operate in a real-time environment where uninterrupted access to trusted data has become a baseline expectation for users, applications and automated systems. Traditional DataOps models, built on manual effort and human triage, cannot keep pace with this always active demand. AI agents are emerging as the operational backbone, ensuring consistent data availability, reinforcing trustworthiness and enabling a level of scale that manual processes cannot achieve ...

For decades, trust in the digital workplace rested on familiar signals. We trusted faces on video calls, voices on the phone, and emails that appeared to come from people we knew. These cues felt human and intuitive. They anchored how decisions were made, approvals were granted, and access was authorized. AI-powered deepfakes have quietly broken that model ...

Cloud migration was supposed to be a one-way door. For most enterprises, it turns out it isn't. Cloud data repatriation is a real and growing trend. A new survey ... finds that 89% of organizations plan to expand their on-premises infrastructure footprint over the next two years — and 75% have already moved at least some workloads back from public cloud in the past 24 months. The findings point to a broad rethinking of where data belongs ...

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...

Q&A Part Two: IBM Talks About Predictive Analytics

Pete Goldin
APMdigest

In Part Two of APMdigest's exclusive interview, Matthew Ellis, IBM Vice President of Service Availability and Performance, discusses predictive analytics.

Click here to start with Part One of the Q&A with IBM VP Matthew Ellis.

APM: Why is predictive analytics gaining so much momentum recently, especially with respect to APM?

ME: Analytics is important to all phases of operations. In all areas of business it is axiomatic that more data enables better decisions, and operations and application management are no exceptions.

Just as important, however, is sorting that data to identify the critical context for decision makers to act on, and this is where analytics come in.

IBM is investing in analytics very seriously, and from an operations management perspective, we apply analytics in three categories: Simplify Operations Management, Avoid Business Disruption, and Enable Optimization.

Simplify Operations Management is a class of analytics technology that enables our customers to do the work that they do today more easily. This includes historical analysis of data to recommend and establish dynamic thresholds, and trending of performance and capacity data to identify areas that may become bottlenecks based on historical behavior.

Avoid Business Disruption is the key driver for the predictive analytics component. The goal is early identification of environmental changes that indicate a significant change in the behavior of an application or service, and to bring this information to the attention of the operations management team so that problems can be identified and addressed before they ever impact a customer. We have identified emerging problems days before traditional management tools saw signs of trouble and in some situations, discovered problems in unmonitored resources that were affecting the behavior of critical applications.

Enable Optimization is the ability to mine collected data across multiple dimensions enabling insight and optimization of services and applications by enabling rich insight. It is also known as business analytics.

APM: What specific functionality should an organization look for in predictive analytics technology?

ME: At IBM, we believe there are three key capabilities that any analytics solution must have to provide maximum predictive capability:

1. Algorithms: Multivariate Analytic techniques are critical to identifying emerging problems early, while all metric data is still well within their normal range.

The key to this statistical approach is to monitor the relationships of important related data metrics and raise an exception when the relationships of data change in significant ways. Any single metric displays a wide range of variability during a normal day, increasing and decreasing with changing workloads, and daily, weekly and seasonal behavior.

In general, however, related metrics will follow the same pattern all the time in a healthy system. Successfully identifying these relationships, and accurately determining when these relationships diverge in an important way is key to accurate early identification of problems.

Our algorithms are developed and refined by one of the largest private math departments in the world; the same organization that developed Watson to win at Jeopardy.

2. Scalability: Analytics solutions work better when they have more data upon which to base their conclusions. The IBM analytics solutions directly leverage proven data collection technologies that have been in use for most of a decade and have seen continual refinement. This capability is proven to be able to collect millions of data points per second, and deliver that data to the analytics engine with very low latency offering real-time evaluation of very large data streams. We believe that the data collection technology we are using is the most scalable and high performance in the industry.

3. Breadth of Monitored Resources: One of our design requirements was to deliver an easily extensible mediation capability allowing customers (or our services teams) to connect any data source to our data collection solution in a matter of hours or days.

During our pilot, we have worked with many products from non-IBM vendors and our team has found that almost all data integration work can be done in a very short time without ever requiring a visit to the customer site, saving time and money while maximizing data availability for analysis.

APM: How do you see Predictive Analytics evolving over the next few years?

ME: IBM expects that analytics tools, and the organizations that use them, will evolve rapidly over the next few years. IBM is investing heavily in providing highly scalable, flexible, and robust systems for identifying emerging problems as early as possible.

We expect analytics to evolve along multiple dimensions:

1. Improvements in analytics learning and data exchange with existing application and service discovery, topology, and CMDB data to combine the strengths of traditional IT tools with analytics learning solutions. This will accelerate the statistical learning process and allow the learned relationships to be built back into the visible topology of the environment.

2. Apply analytics solutions to additional IT management domains to include Smarter Infrastructures, improved detection of security problems, asset management and maintenance scheduling and additional problems

3. Further improve feedback and integration of learning technologies, process optimization, and analytics in general with operations processes.

About Matthew Ellis

Matthew Ellis is the Vice President of Development for Tivoli's Service Availability & Performance Management product portfolio with IBM. This product suite enables monitoring and modeling the utilization, performance, capacity and energy-use of distributed, mainframe and virtualized platforms and associated application software. Ellis joined IBM in 2006 through the Micromuse acquisition, where he was the Vice President of Software Development.

Click here to read Part One of the Q&A with IBM VP Matthew Ellis.

Hot Topic
The Latest
The Latest 10

The Latest

AI is becoming the operating system of the enterprise. It acts as an invisible coordination layer that understands intent, connects systems, and executes work across complex SaaS environments. Previously, employees had to click through multiple systems — CRM, ERP, support tools, collaboration platforms — to complete a single task. Now, instead of navigating each application manually, they can simply state what they need to accomplish ...

In 2026, the cost of downtime or an outage is no longer just a technical inconvenience; it's a $600 billion wake up call for global businesses. As our digital ecosystems become  more interconnected, each touchpoint introduces new risks and multiplies the consequences when things go wrong. And the data is clear: aggregate downtime costs  for Global 2,000 companies have surged 50% since 2024, reaching a staggering $600 billion ...

Deloitte found that 74% of enterprises expect to deploy agentic AI solutions in the next 24 months. However, the rush to deployment is outpacing foundational work, though. Only 21% of enterprises have fully formed agent governance models in place. The result? AI agents deployed without guidance or governance begin to function as fragmented islands of complexity ...

Cloud spending is no longer viewed as a passthrough IT expense, but as a strategic financial lever that directly impacts innovation capacity, profitability and enterprise resilience, according to the CFO Cloud Cost Optimization Report from Azul ...

As AI moves from generating responses to performing actions, the need for trust increases exponentially. And as organizations enlist AI agents for increasingly sophisticated business processes, trust is going to be the single most important theme for spurring adoption. What can organizations do to build trustworthy AI agents? ...

I've spent a lot of time in the channel, and one thing I keep coming back to is this: a partner program is only as good as what it looks like in the field. Many programs look great on paper, but when a partner is in front of a customer navigating a complex hybrid environment or trying to make the case for AI-powered observability, the gap between what a vendor promises and what it actually delivers becomes very clear, very fast ...

Enterprises today operate in a real-time environment where uninterrupted access to trusted data has become a baseline expectation for users, applications and automated systems. Traditional DataOps models, built on manual effort and human triage, cannot keep pace with this always active demand. AI agents are emerging as the operational backbone, ensuring consistent data availability, reinforcing trustworthiness and enabling a level of scale that manual processes cannot achieve ...

For decades, trust in the digital workplace rested on familiar signals. We trusted faces on video calls, voices on the phone, and emails that appeared to come from people we knew. These cues felt human and intuitive. They anchored how decisions were made, approvals were granted, and access was authorized. AI-powered deepfakes have quietly broken that model ...

Cloud migration was supposed to be a one-way door. For most enterprises, it turns out it isn't. Cloud data repatriation is a real and growing trend. A new survey ... finds that 89% of organizations plan to expand their on-premises infrastructure footprint over the next two years — and 75% have already moved at least some workloads back from public cloud in the past 24 months. The findings point to a broad rethinking of where data belongs ...

Over the past few years, large language models (LLMs) have revolutionized the software industry. Given their ability to excel at multi-step reasoning, LLMs have helped enterprises streamline workflows and adapt to the unknown. However, employing such models comes with sky-high costs, latency issues, and limited flexibility. In the realm of IT operations, it is generally wiser to employ smaller, domain-specific models instead ...