Using Analytics to Detect Application Performance Anomalies
March 04, 2014
Charley Rich
Share this

IT organizations are under more pressure to deliver exceptional business performance than ever. Further complicating the challenge is the evolving nature of Information Technology (IT). The rise of Big Data, mobile, cloud, and BYOD have added complexity, making it ever more challenging for IT to acquire the visibility they need to detect anomalies.

Today, an organization’s application infrastructure typically includes Web components, messaging middleware and mainframes. Application performance is impacted by many factors coming from multiple sources—application servers, messaging protocols, virtualized systems, capacity issues and many more. Inevitably, failures in one or more of these systems occur — and IT is left to deal with the result.

Such situations are why Application Performance Management (APM) solutions exist. To be effective, APM must deliver three major benefits:

- Gain enough visibility to see an entire system

- Track activities through the infrastructure chain as they occur

- Correlate events—many of which might seem unrelated—in order to spot developing trends before users are impacted.

Surprisingly, a number of APM platforms miss on one or more of these key functions.

Monitoring is Not Enough

To be sure, most APM solutions do a good job of monitoring individual applications. But, monitoring is not enough. When problems arise, especially in today's complex topologies, the failure of a single application is rarely the culprit. Performance threats usually are the result of multiple issues — and many of these, if caught early in the process using real-time analytics, could prevent much larger failures from occurring. Evading cascading failures is essential. Ideally, IT Specialists should avoid being in the position of putting out fires — they should be able to make sure the fire never starts. But, without the necessary visibility, this is no simple task.

To properly manage today's application environment, organizations must be able to analyze the entire application chain from end to end, understanding the dependencies between the links in the chain. It must also be able to focus on early detection of abnormalities, differentiating symptom from cause rather than simply reacting to an outage. The combination of these two factors provides the level of assurance IT needs in its key mission: to reduce the frequency and duration of outages.

End-to-end performance monitoring and analysis must embrace the entire IT environment, from .NET to mainframes. It must cover a wide range of components from J2EE application servers, Web Services to middleware messaging, brokers and even legacy applications. It must also be elastic, having the ability to transparently scale to meet unexpected surges in demand.

Analyzing Situations with Complex Event Processing

Accomplishing the second requirement — proactive analytics, rather than reactive response — requires a sophisticated technology, one example being Complex Event Processing (CEP). CEP engines, along with business policies, analyze situations or "business views" comprised of multiple events and key performance indicators.

Instead of alerts based on individual events passing a threshold, the analytical approach is analyzing situations. It compares application behavior against your norms, looking for anomalies that indicate potential problems. Norms are established dynamically using statistical functions such as Bollinger bands, momentum oscillators, standard deviation, velocity, fluctuation and rates of change.

This approach ensures that real problems — not just transient variations, a.k.a. "false alarms" — are identified and ensures true readings of real-time performance.

With CEP-based analytics, IT Specialists are assisted in quickly identifying root causes, instead of merely chasing symptoms. By dynamically analyzing event streams, the CEP approach can differentiate symptoms from cause — even inferring an explanation where there is signal loss.

APM solutions using real-time anomaly detection have the ability to maintain SLAs in the most high-demand deployments including payments, EFT, trading, settlement, compliance patient data, claims processing and retail order management. They not only bring developing situations to the attention of IT staff before users are aware, but also assist in diagnosing and correcting the underlying causes quickly and efficiently.

In an era when business functions are more sophisticated, diverse, integrated and immediate than ever, analytical Application Performance Management plays an essential role for IT professionals and their customers.

Charley Rich is VP Product Management and Marketing at Nastel Technologies.

Share this

The Latest

February 27, 2020

The Cloud Performance Benchmark from ThousandEyes compares global network performance and connectivity differences between the five major public cloud providers — Amazon Web Services (AWS), Google Cloud Platform (GCP), Microsoft Azure, Alibaba Cloud and IBM Cloud — proving that, when it comes to performance, not all clouds are created equal ...

February 26, 2020

For the past 10 years, the majority of CIOs have had a transformational focus (currently 42%), however, this year, there is strong momentum in CIOs taking on more strategic responsibilities (40%), according to the 2020 State of the CIO research from IDG's CIO ...

February 25, 2020
A widening gap between IT resources and the demands of managing the increasing scale and complexity of enterprise cloud ecosystems is evident, according to <span style="font-style: italic;">Top challenges for CIOs on the road to the AI-driven autonomous cloud</span>, a new report based on a global survey of 800 CIOs conducted by Vanson Bourne and commissioned by Dynatrace ...
February 24, 2020

The tech world may be falling in love with artificial intelligence and automation, but when it comes to managing critical assets, old school tools like spreadsheets are still in common use. A new survey by Ivanti illustrates how these legacy tools are forcing IT to waste valuable time analyzing assets due to incomplete data ...

February 20, 2020

Over 70% of C-Suite decision makers believe business innovation and staff retention are driven by improved visibility into network and application performance, according to Rethink Possible: Visibility and Network Performance – The Pillars of Business Success, a survey
conducted by Riverbed ...

February 19, 2020

Modern enterprises rely upon their IT departments to deliver a seamless digital customer experience. Performance and availability are the foundational stepping stones to delivering that customer experience. Along those lines, this month we released a new research study titled the IT Downtime Detection and Mitigation Report that contains recommendations on how to best prevent, detect or mitigate brownouts and outages, given the context of today’s IT transformation trends ...

February 18, 2020

While Application Performance Management (APM) has become mainstream, with a majority of tech pros using APM tools regularly, there's work to be done to move beyond troubleshooting ...

February 13, 2020

Over the last few decades, IT departments have decreased budgets in part because of recession. As a result, they have are being asked to do more with less. The increase in work has amplified the need for automation ...

February 12, 2020

Many variables must align for optimum APM, and security is certainly among them. I offer the following APM predictions for 2020, which revolve around the reality that we will definitely begin to see much deeper integration of WAN technology on the security front. Look for this integration to take shape in the following ways ...

February 11, 2020

When it comes to growing a successful company, research shows it isn't about getting the most out of employees, but delivering an experience that empowers them to be and do their best. And according to Priming a New Era of Digital Wellness, a new study conducted by Quartz Insights in partnership with Citrix Systems, technology is the secret to doing so ...