5 APM Techniques to Troubleshoot Application Slow Down in Minutes
October 27, 2014

Payal Chakravarty
IBM

Share this

Applications are getting more complex by the day. First you have the various hosting platforms that your app can span across like private cloud, public cloud, your own data center.

Second, you have applications for the web being accessed through different browsers and mobile apps being accessed from several hundred different devices and various device OSs.

Third, the same app is being accessed from around the world, 24X7.

Fourth, the number of users accessing apps have grown significantly requiring rapid scalability of the app's infrastructure.

To top it all, users, today, have very little patience to deal with poor performance.

Application Performance Management (APM) tools have evolved over the last decade to cater to this complexity and yet be able to troubleshoot application performance issues quickly. Let us look at some of the key features and visualization techniques that are enabling quicker troubleshooting:

1. End User Experience Metrics sliced by different dimensions

As an app developer or app owner, the first step to troubleshooting a performance problem is to narrow the scope of it. By comparing how long it is taking a web page to load for a user using your app through Firefox on Mac vs how long it is taking for the same web page to load for a user using Chrome on iOS, you can narrow down which browser and device to troubleshoot on. You could also compare how long the response time is for a user in California vs a user in Australia when accessing the same page and executing the same transaction. By slicing and dicing response time by various dimensions like geography, browser, device, network carrier etc isolation of problem areas have become easier.

2. Code level stack traces

For every business transaction that fails or is slow, you can find out what line of code is causing the slowdown by looking at its stack trace. APM tools today show the class name, method name and exact line of source code (e.g., SQL query, line number of code in a specific browser session trace) that led to a slow request. Further, you can see the pre- and post-code deployment patterns for your apps.

3. Transaction Topologies

Today, APM tools can automatically discover your end-to-end distributed application environment in minutes, showing you a topological view of all the components that your app depends on and hence aid visual detection of bottlenecks. A few of these tools not only show an aggregated transaction topology, but also show the detailed topological mapping for single transaction instances, capturing network hops and sub-transaction nodes to help you see where the time is spent during that instance. With the evolution of big data technologies, it is now possible to capture 100% transactions instead of sampling. This ensures you will not lose out on any key business transactions that may have failed.

4. Log analytics

Searching for errors across application stacks can be a laborious task. Earlier, while troubleshooting, operators, administrators and app owners would have to look through logs from different components independently, in silos. With integrated log analytics, you can now search for errors across log files for any component in your app stack in the context of the application. For example, you can correlate errors in your app server with an error in your database that may be impacting a transaction.

5. One pane-of-glass to view health of all components in the app stack

As opposed to looking at multiple panes of glass to see details of your application's health, today, at a glance in one UI you will be able to visualize the detailed health of all your app components. Spotting the problem area is as easy as spotting a color difference. For example, key metrics — like Garbage collection statistics from your code's runtime, memory usage of your VM, space utilization of your database server, bandwidth utilization of your network, http request response times of your web requests — can all be seen in one user interface.

With the evolution of big data, improved algorithms for search and correlation, smart dashboards/visualization and diagnostic capabilities, APM tools have matured to provide insights that you could never have before, thereby cutting troubleshooting time from days to minutes.

Payal Chakravarty is Senior Product Manager for IBM Application Performance Management.

Share this

The Latest

January 18, 2022

As part of APMdigest's list of 2022 predictions, industry experts offer thoughtful, insightful, and often controversial predictions on how Network Performance Management (NPM) and related technologies will evolve and impact business in 2022 ...

January 13, 2022

Gartner highlighted 6 trends that infrastructure and operations (I&O) leaders must start preparing for in the next 12-18 months ...

January 11, 2022

Technology is now foundational to financial companies' operations with many institutions relying on tech to deliver critical services. As a result, uptime is essential to customer satisfaction and company success, and systems must be subject to continuous monitoring. But modern IT architectures are disparate, complex and interconnected, and the data is too voluminous for the human mind to handle. Enter AIOps ...

January 11, 2022

Having a variety of tools to choose from creates challenges in telemetry data collection. Organizations find themselves managing multiple libraries for logging, metrics, and traces, with each vendor having its own APIs, SDKs, agents, and collectors. An open source, community-driven approach to observability will gain steam in 2022 to remove unnecessary complications by tapping into the latest advancements in observability practice ...

January 10, 2022

These are the trends that will set up your engineers and developers to deliver amazing software that powers amazing digital experiences that fuel your organization's growth in 2022 — and beyond ...

January 06, 2022

In a world where digital services have become a critical part of how we go about our daily lives, the risk of undergoing an outage has become even more significant. Outages can range in severity and impact companies of every size — while outages from larger companies in the social media space or a cloud provider tend to receive a lot of coverage, application downtime from even the most targeted companies can disrupt users' personal and business operations ...

January 05, 2022

Move fast and break things: A phrase that has been a rallying cry for many SREs and DevOps practitioners. After all, these teams are charged with delivering rapid and unceasing innovation to wow customers and keep pace with competitors. But today's society doesn't tolerate broken things (aka downtime). So, what if you can move fast and not break things? Or at least, move fast and rapidly identify or even predict broken things? It's high time to rethink the old rallying cry, and with AI and observability working in tandem, it's possible ...

January 04, 2022

AIOps is still relatively new compared to existing technologies such as enterprise data warehouses, and early on many AIOps projects suffered hiccups, the aftereffects of which are still felt today. That's why, for some IT Ops teams and leaders, the prospect of transforming their IT operations using AIOps is a cause for concern ...

December 16, 2021

This year is the first time APMdigest is posting a separate list of Remote Work Predictions. Due to the drastic changes in the way we work and do business since the COVID pandemic started, and how significantly these changes have impacted IT operations, APMdigest asked industry experts — from analysts and consultants to users and the top vendors — how they think the work from home (WFH) revolution will evolve into 2022, with a special focus on IT operations and performance. Here are some very interesting and insightful predictions that may change what you think about the future of work and IT ...

December 15, 2021

Industry experts offer thoughtful, insightful, and often controversial predictions on how APM, AIOps, Observability, OpenTelemetry, and related technologies will evolve and impact business in 2022. Part 6 covers the user experience ...