Skip to main content

Q&A Part Two: IBM Talks About Predictive Analytics

Pete Goldin
Editor and Publisher
APMdigest

In Part Two of APMdigest's exclusive interview, Matthew Ellis, IBM Vice President of Service Availability and Performance, discusses predictive analytics.

Click here to start with Part One of the Q&A with IBM VP Matthew Ellis.

APM: Why is predictive analytics gaining so much momentum recently, especially with respect to APM?

ME: Analytics is important to all phases of operations. In all areas of business it is axiomatic that more data enables better decisions, and operations and application management are no exceptions.

Just as important, however, is sorting that data to identify the critical context for decision makers to act on, and this is where analytics come in.

IBM is investing in analytics very seriously, and from an operations management perspective, we apply analytics in three categories: Simplify Operations Management, Avoid Business Disruption, and Enable Optimization.

Simplify Operations Management is a class of analytics technology that enables our customers to do the work that they do today more easily. This includes historical analysis of data to recommend and establish dynamic thresholds, and trending of performance and capacity data to identify areas that may become bottlenecks based on historical behavior.

Avoid Business Disruption is the key driver for the predictive analytics component. The goal is early identification of environmental changes that indicate a significant change in the behavior of an application or service, and to bring this information to the attention of the operations management team so that problems can be identified and addressed before they ever impact a customer. We have identified emerging problems days before traditional management tools saw signs of trouble and in some situations, discovered problems in unmonitored resources that were affecting the behavior of critical applications.

Enable Optimization is the ability to mine collected data across multiple dimensions enabling insight and optimization of services and applications by enabling rich insight. It is also known as business analytics.

APM: What specific functionality should an organization look for in predictive analytics technology?

ME: At IBM, we believe there are three key capabilities that any analytics solution must have to provide maximum predictive capability:

1. Algorithms: Multivariate Analytic techniques are critical to identifying emerging problems early, while all metric data is still well within their normal range.

The key to this statistical approach is to monitor the relationships of important related data metrics and raise an exception when the relationships of data change in significant ways. Any single metric displays a wide range of variability during a normal day, increasing and decreasing with changing workloads, and daily, weekly and seasonal behavior.

In general, however, related metrics will follow the same pattern all the time in a healthy system. Successfully identifying these relationships, and accurately determining when these relationships diverge in an important way is key to accurate early identification of problems.

Our algorithms are developed and refined by one of the largest private math departments in the world; the same organization that developed Watson to win at Jeopardy.

2. Scalability: Analytics solutions work better when they have more data upon which to base their conclusions. The IBM analytics solutions directly leverage proven data collection technologies that have been in use for most of a decade and have seen continual refinement. This capability is proven to be able to collect millions of data points per second, and deliver that data to the analytics engine with very low latency offering real-time evaluation of very large data streams. We believe that the data collection technology we are using is the most scalable and high performance in the industry.

3. Breadth of Monitored Resources: One of our design requirements was to deliver an easily extensible mediation capability allowing customers (or our services teams) to connect any data source to our data collection solution in a matter of hours or days.

During our pilot, we have worked with many products from non-IBM vendors and our team has found that almost all data integration work can be done in a very short time without ever requiring a visit to the customer site, saving time and money while maximizing data availability for analysis.

APM: How do you see Predictive Analytics evolving over the next few years?

ME: IBM expects that analytics tools, and the organizations that use them, will evolve rapidly over the next few years. IBM is investing heavily in providing highly scalable, flexible, and robust systems for identifying emerging problems as early as possible.

We expect analytics to evolve along multiple dimensions:

1. Improvements in analytics learning and data exchange with existing application and service discovery, topology, and CMDB data to combine the strengths of traditional IT tools with analytics learning solutions. This will accelerate the statistical learning process and allow the learned relationships to be built back into the visible topology of the environment.

2. Apply analytics solutions to additional IT management domains to include Smarter Infrastructures, improved detection of security problems, asset management and maintenance scheduling and additional problems

3. Further improve feedback and integration of learning technologies, process optimization, and analytics in general with operations processes.

About Matthew Ellis

Matthew Ellis is the Vice President of Development for Tivoli's Service Availability & Performance Management product portfolio with IBM. This product suite enables monitoring and modeling the utilization, performance, capacity and energy-use of distributed, mainframe and virtualized platforms and associated application software. Ellis joined IBM in 2006 through the Micromuse acquisition, where he was the Vice President of Software Development.

Click here to read Part One of the Q&A with IBM VP Matthew Ellis.

Hot Topic
The Latest
The Latest 10

The Latest

According to Auvik's 2025 IT Trends Report, 60% of IT professionals feel at least moderately burned out on the job, with 43% stating that their workload is contributing to work stress. At the same time, many IT professionals are naming AI and machine learning as key areas they'd most like to upskill ...

Businesses that face downtime or outages risk financial and reputational damage, as well as reducing partner, shareholder, and customer trust. One of the major challenges that enterprises face is implementing a robust business continuity plan. What's the solution? The answer may lie in disaster recovery tactics such as truly immutable storage and regular disaster recovery testing ...

IT spending is expected to jump nearly 10% in 2025, and organizations are now facing pressure to manage costs without slowing down critical functions like observability. To meet the challenge, leaders are turning to smarter, more cost effective business strategies. Enter stage right: OpenTelemetry, the missing piece of the puzzle that is no longer just an option but rather a strategic advantage ...

Amidst the threat of cyberhacks and data breaches, companies install several security measures to keep their business safely afloat. These measures aim to protect businesses, employees, and crucial data. Yet, employees perceive them as burdensome. Frustrated with complex logins, slow access, and constant security checks, workers decide to completely bypass all security set-ups ...

Image
Cloudbrink's Personal SASE services provide last-mile acceleration and reduction in latency

In MEAN TIME TO INSIGHT Episode 13, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses hybrid multi-cloud networking strategy ... 

In high-traffic environments, the sheer volume and unpredictable nature of network incidents can quickly overwhelm even the most skilled teams, hindering their ability to react swiftly and effectively, potentially impacting service availability and overall business performance. This is where closed-loop remediation comes into the picture: an IT management concept designed to address the escalating complexity of modern networks ...

In 2025, enterprise workflows are undergoing a seismic shift. Propelled by breakthroughs in generative AI (GenAI), large language models (LLMs), and natural language processing (NLP), a new paradigm is emerging — agentic AI. This technology is not just automating tasks; it's reimagining how organizations make decisions, engage customers, and operate at scale ...

In the early days of the cloud revolution, business leaders perceived cloud services as a means of sidelining IT organizations. IT was too slow, too expensive, or incapable of supporting new technologies. With a team of developers, line of business managers could deploy new applications and services in the cloud. IT has been fighting to retake control ever since. Today, IT is back in the driver's seat, according to new research by Enterprise Management Associates (EMA) ...

In today's fast-paced and increasingly complex network environments, Network Operations Centers (NOCs) are the backbone of ensuring continuous uptime, smooth service delivery, and rapid issue resolution. However, the challenges faced by NOC teams are only growing. In a recent study, 78% state network complexity has grown significantly over the last few years while 84% regularly learn about network issues from users. It is imperative we adopt a new approach to managing today's network experiences ...

Image
Broadcom

From growing reliance on FinOps teams to the increasing attention on artificial intelligence (AI), and software licensing, the Flexera 2025 State of the Cloud Report digs into how organizations are improving cloud spend efficiency, while tackling the complexities of emerging technologies ...

Q&A Part Two: IBM Talks About Predictive Analytics

Pete Goldin
Editor and Publisher
APMdigest

In Part Two of APMdigest's exclusive interview, Matthew Ellis, IBM Vice President of Service Availability and Performance, discusses predictive analytics.

Click here to start with Part One of the Q&A with IBM VP Matthew Ellis.

APM: Why is predictive analytics gaining so much momentum recently, especially with respect to APM?

ME: Analytics is important to all phases of operations. In all areas of business it is axiomatic that more data enables better decisions, and operations and application management are no exceptions.

Just as important, however, is sorting that data to identify the critical context for decision makers to act on, and this is where analytics come in.

IBM is investing in analytics very seriously, and from an operations management perspective, we apply analytics in three categories: Simplify Operations Management, Avoid Business Disruption, and Enable Optimization.

Simplify Operations Management is a class of analytics technology that enables our customers to do the work that they do today more easily. This includes historical analysis of data to recommend and establish dynamic thresholds, and trending of performance and capacity data to identify areas that may become bottlenecks based on historical behavior.

Avoid Business Disruption is the key driver for the predictive analytics component. The goal is early identification of environmental changes that indicate a significant change in the behavior of an application or service, and to bring this information to the attention of the operations management team so that problems can be identified and addressed before they ever impact a customer. We have identified emerging problems days before traditional management tools saw signs of trouble and in some situations, discovered problems in unmonitored resources that were affecting the behavior of critical applications.

Enable Optimization is the ability to mine collected data across multiple dimensions enabling insight and optimization of services and applications by enabling rich insight. It is also known as business analytics.

APM: What specific functionality should an organization look for in predictive analytics technology?

ME: At IBM, we believe there are three key capabilities that any analytics solution must have to provide maximum predictive capability:

1. Algorithms: Multivariate Analytic techniques are critical to identifying emerging problems early, while all metric data is still well within their normal range.

The key to this statistical approach is to monitor the relationships of important related data metrics and raise an exception when the relationships of data change in significant ways. Any single metric displays a wide range of variability during a normal day, increasing and decreasing with changing workloads, and daily, weekly and seasonal behavior.

In general, however, related metrics will follow the same pattern all the time in a healthy system. Successfully identifying these relationships, and accurately determining when these relationships diverge in an important way is key to accurate early identification of problems.

Our algorithms are developed and refined by one of the largest private math departments in the world; the same organization that developed Watson to win at Jeopardy.

2. Scalability: Analytics solutions work better when they have more data upon which to base their conclusions. The IBM analytics solutions directly leverage proven data collection technologies that have been in use for most of a decade and have seen continual refinement. This capability is proven to be able to collect millions of data points per second, and deliver that data to the analytics engine with very low latency offering real-time evaluation of very large data streams. We believe that the data collection technology we are using is the most scalable and high performance in the industry.

3. Breadth of Monitored Resources: One of our design requirements was to deliver an easily extensible mediation capability allowing customers (or our services teams) to connect any data source to our data collection solution in a matter of hours or days.

During our pilot, we have worked with many products from non-IBM vendors and our team has found that almost all data integration work can be done in a very short time without ever requiring a visit to the customer site, saving time and money while maximizing data availability for analysis.

APM: How do you see Predictive Analytics evolving over the next few years?

ME: IBM expects that analytics tools, and the organizations that use them, will evolve rapidly over the next few years. IBM is investing heavily in providing highly scalable, flexible, and robust systems for identifying emerging problems as early as possible.

We expect analytics to evolve along multiple dimensions:

1. Improvements in analytics learning and data exchange with existing application and service discovery, topology, and CMDB data to combine the strengths of traditional IT tools with analytics learning solutions. This will accelerate the statistical learning process and allow the learned relationships to be built back into the visible topology of the environment.

2. Apply analytics solutions to additional IT management domains to include Smarter Infrastructures, improved detection of security problems, asset management and maintenance scheduling and additional problems

3. Further improve feedback and integration of learning technologies, process optimization, and analytics in general with operations processes.

About Matthew Ellis

Matthew Ellis is the Vice President of Development for Tivoli's Service Availability & Performance Management product portfolio with IBM. This product suite enables monitoring and modeling the utilization, performance, capacity and energy-use of distributed, mainframe and virtualized platforms and associated application software. Ellis joined IBM in 2006 through the Micromuse acquisition, where he was the Vice President of Software Development.

Click here to read Part One of the Q&A with IBM VP Matthew Ellis.

Hot Topic
The Latest
The Latest 10

The Latest

According to Auvik's 2025 IT Trends Report, 60% of IT professionals feel at least moderately burned out on the job, with 43% stating that their workload is contributing to work stress. At the same time, many IT professionals are naming AI and machine learning as key areas they'd most like to upskill ...

Businesses that face downtime or outages risk financial and reputational damage, as well as reducing partner, shareholder, and customer trust. One of the major challenges that enterprises face is implementing a robust business continuity plan. What's the solution? The answer may lie in disaster recovery tactics such as truly immutable storage and regular disaster recovery testing ...

IT spending is expected to jump nearly 10% in 2025, and organizations are now facing pressure to manage costs without slowing down critical functions like observability. To meet the challenge, leaders are turning to smarter, more cost effective business strategies. Enter stage right: OpenTelemetry, the missing piece of the puzzle that is no longer just an option but rather a strategic advantage ...

Amidst the threat of cyberhacks and data breaches, companies install several security measures to keep their business safely afloat. These measures aim to protect businesses, employees, and crucial data. Yet, employees perceive them as burdensome. Frustrated with complex logins, slow access, and constant security checks, workers decide to completely bypass all security set-ups ...

Image
Cloudbrink's Personal SASE services provide last-mile acceleration and reduction in latency

In MEAN TIME TO INSIGHT Episode 13, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses hybrid multi-cloud networking strategy ... 

In high-traffic environments, the sheer volume and unpredictable nature of network incidents can quickly overwhelm even the most skilled teams, hindering their ability to react swiftly and effectively, potentially impacting service availability and overall business performance. This is where closed-loop remediation comes into the picture: an IT management concept designed to address the escalating complexity of modern networks ...

In 2025, enterprise workflows are undergoing a seismic shift. Propelled by breakthroughs in generative AI (GenAI), large language models (LLMs), and natural language processing (NLP), a new paradigm is emerging — agentic AI. This technology is not just automating tasks; it's reimagining how organizations make decisions, engage customers, and operate at scale ...

In the early days of the cloud revolution, business leaders perceived cloud services as a means of sidelining IT organizations. IT was too slow, too expensive, or incapable of supporting new technologies. With a team of developers, line of business managers could deploy new applications and services in the cloud. IT has been fighting to retake control ever since. Today, IT is back in the driver's seat, according to new research by Enterprise Management Associates (EMA) ...

In today's fast-paced and increasingly complex network environments, Network Operations Centers (NOCs) are the backbone of ensuring continuous uptime, smooth service delivery, and rapid issue resolution. However, the challenges faced by NOC teams are only growing. In a recent study, 78% state network complexity has grown significantly over the last few years while 84% regularly learn about network issues from users. It is imperative we adopt a new approach to managing today's network experiences ...

Image
Broadcom

From growing reliance on FinOps teams to the increasing attention on artificial intelligence (AI), and software licensing, the Flexera 2025 State of the Cloud Report digs into how organizations are improving cloud spend efficiency, while tackling the complexities of emerging technologies ...