Skip to main content

5 APM Techniques to Troubleshoot Application Slow Down in Minutes

Payal Chakravarty

Applications are getting more complex by the day. First you have the various hosting platforms that your app can span across like private cloud, public cloud, your own data center.

Second, you have applications for the web being accessed through different browsers and mobile apps being accessed from several hundred different devices and various device OSs.

Third, the same app is being accessed from around the world, 24X7.

Fourth, the number of users accessing apps have grown significantly requiring rapid scalability of the app's infrastructure.

To top it all, users, today, have very little patience to deal with poor performance.

Application Performance Management (APM) tools have evolved over the last decade to cater to this complexity and yet be able to troubleshoot application performance issues quickly. Let us look at some of the key features and visualization techniques that are enabling quicker troubleshooting:

1. End User Experience Metrics sliced by different dimensions

As an app developer or app owner, the first step to troubleshooting a performance problem is to narrow the scope of it. By comparing how long it is taking a web page to load for a user using your app through Firefox on Mac vs how long it is taking for the same web page to load for a user using Chrome on iOS, you can narrow down which browser and device to troubleshoot on. You could also compare how long the response time is for a user in California vs a user in Australia when accessing the same page and executing the same transaction. By slicing and dicing response time by various dimensions like geography, browser, device, network carrier etc isolation of problem areas have become easier.

2. Code level stack traces

For every business transaction that fails or is slow, you can find out what line of code is causing the slowdown by looking at its stack trace. APM tools today show the class name, method name and exact line of source code (e.g., SQL query, line number of code in a specific browser session trace) that led to a slow request. Further, you can see the pre- and post-code deployment patterns for your apps.

3. Transaction Topologies

Today, APM tools can automatically discover your end-to-end distributed application environment in minutes, showing you a topological view of all the components that your app depends on and hence aid visual detection of bottlenecks. A few of these tools not only show an aggregated transaction topology, but also show the detailed topological mapping for single transaction instances, capturing network hops and sub-transaction nodes to help you see where the time is spent during that instance. With the evolution of big data technologies, it is now possible to capture 100% transactions instead of sampling. This ensures you will not lose out on any key business transactions that may have failed.

4. Log analytics

Searching for errors across application stacks can be a laborious task. Earlier, while troubleshooting, operators, administrators and app owners would have to look through logs from different components independently, in silos. With integrated log analytics, you can now search for errors across log files for any component in your app stack in the context of the application. For example, you can correlate errors in your app server with an error in your database that may be impacting a transaction.

5. One pane-of-glass to view health of all components in the app stack

As opposed to looking at multiple panes of glass to see details of your application's health, today, at a glance in one UI you will be able to visualize the detailed health of all your app components. Spotting the problem area is as easy as spotting a color difference. For example, key metrics — like Garbage collection statistics from your code's runtime, memory usage of your VM, space utilization of your database server, bandwidth utilization of your network, http request response times of your web requests — can all be seen in one user interface.

With the evolution of big data, improved algorithms for search and correlation, smart dashboards/visualization and diagnostic capabilities, APM tools have matured to provide insights that you could never have before, thereby cutting troubleshooting time from days to minutes.

Payal Chakravarty is Senior Product Manager for IBM Application Performance Management.

The Latest

AI is the catalyst for significant investment in data teams as enterprises require higher-quality data to power their AI applications, according to the State of Analytics Engineering Report from dbt Labs ...

Misaligned architecture can lead to business consequences, with 93% of respondents reporting negative outcomes such as service disruptions, high operational costs and security challenges ...

A Gartner analyst recently suggested that GenAI tools could create 25% time savings for network operational teams. Where might these time savings come from? How are GenAI tools helping NetOps teams today, and what other tasks might they take on in the future as models continue improving? In general, these savings come from automating or streamlining manual NetOps tasks ...

IT and line-of-business teams are increasingly aligned in their efforts to close the data gap and drive greater collaboration to alleviate IT bottlenecks and offload growing demands on IT teams, according to The 2025 Automation Benchmark Report: Insights from IT Leaders on Enterprise Automation & the Future of AI-Driven Businesses from Jitterbit ...

A large majority (86%) of data management and AI decision makers cite protecting data privacy as a top concern, with 76% of respondents citing ROI on data privacy and AI initiatives across their organization, according to a new Harris Poll from Collibra ...

According to Gartner, Inc. the following six trends will shape the future of cloud over the next four years, ultimately resulting in new ways of working that are digital in nature and transformative in impact ...

2020 was the equivalent of a wedding with a top-shelf open bar. As businesses scrambled to adjust to remote work, digital transformation accelerated at breakneck speed. New software categories emerged overnight. Tech stacks ballooned with all sorts of SaaS apps solving ALL the problems — often with little oversight or long-term integration planning, and yes frequently a lot of duplicated functionality ... But now the music's faded. The lights are on. Everyone from the CIO to the CFO is checking the bill. Welcome to the Great SaaS Hangover ...

Regardless of OpenShift being a scalable and flexible software, it can be a pain to monitor since complete visibility into the underlying operations is not guaranteed ... To effectively monitor an OpenShift environment, IT administrators should focus on these five key elements and their associated metrics ...

An overwhelming majority of IT leaders (95%) believe the upcoming wave of AI-powered digital transformation is set to be the most impactful and intensive seen thus far, according to The Science of Productivity: AI, Adoption, And Employee Experience, a new report from Nexthink ...

Overall outage frequency and the general level of reported severity continue to decline, according to the Outage Analysis 2025 from Uptime Institute. However, cyber security incidents are on the rise and often have severe, lasting impacts ...

5 APM Techniques to Troubleshoot Application Slow Down in Minutes

Payal Chakravarty

Applications are getting more complex by the day. First you have the various hosting platforms that your app can span across like private cloud, public cloud, your own data center.

Second, you have applications for the web being accessed through different browsers and mobile apps being accessed from several hundred different devices and various device OSs.

Third, the same app is being accessed from around the world, 24X7.

Fourth, the number of users accessing apps have grown significantly requiring rapid scalability of the app's infrastructure.

To top it all, users, today, have very little patience to deal with poor performance.

Application Performance Management (APM) tools have evolved over the last decade to cater to this complexity and yet be able to troubleshoot application performance issues quickly. Let us look at some of the key features and visualization techniques that are enabling quicker troubleshooting:

1. End User Experience Metrics sliced by different dimensions

As an app developer or app owner, the first step to troubleshooting a performance problem is to narrow the scope of it. By comparing how long it is taking a web page to load for a user using your app through Firefox on Mac vs how long it is taking for the same web page to load for a user using Chrome on iOS, you can narrow down which browser and device to troubleshoot on. You could also compare how long the response time is for a user in California vs a user in Australia when accessing the same page and executing the same transaction. By slicing and dicing response time by various dimensions like geography, browser, device, network carrier etc isolation of problem areas have become easier.

2. Code level stack traces

For every business transaction that fails or is slow, you can find out what line of code is causing the slowdown by looking at its stack trace. APM tools today show the class name, method name and exact line of source code (e.g., SQL query, line number of code in a specific browser session trace) that led to a slow request. Further, you can see the pre- and post-code deployment patterns for your apps.

3. Transaction Topologies

Today, APM tools can automatically discover your end-to-end distributed application environment in minutes, showing you a topological view of all the components that your app depends on and hence aid visual detection of bottlenecks. A few of these tools not only show an aggregated transaction topology, but also show the detailed topological mapping for single transaction instances, capturing network hops and sub-transaction nodes to help you see where the time is spent during that instance. With the evolution of big data technologies, it is now possible to capture 100% transactions instead of sampling. This ensures you will not lose out on any key business transactions that may have failed.

4. Log analytics

Searching for errors across application stacks can be a laborious task. Earlier, while troubleshooting, operators, administrators and app owners would have to look through logs from different components independently, in silos. With integrated log analytics, you can now search for errors across log files for any component in your app stack in the context of the application. For example, you can correlate errors in your app server with an error in your database that may be impacting a transaction.

5. One pane-of-glass to view health of all components in the app stack

As opposed to looking at multiple panes of glass to see details of your application's health, today, at a glance in one UI you will be able to visualize the detailed health of all your app components. Spotting the problem area is as easy as spotting a color difference. For example, key metrics — like Garbage collection statistics from your code's runtime, memory usage of your VM, space utilization of your database server, bandwidth utilization of your network, http request response times of your web requests — can all be seen in one user interface.

With the evolution of big data, improved algorithms for search and correlation, smart dashboards/visualization and diagnostic capabilities, APM tools have matured to provide insights that you could never have before, thereby cutting troubleshooting time from days to minutes.

Payal Chakravarty is Senior Product Manager for IBM Application Performance Management.

The Latest

AI is the catalyst for significant investment in data teams as enterprises require higher-quality data to power their AI applications, according to the State of Analytics Engineering Report from dbt Labs ...

Misaligned architecture can lead to business consequences, with 93% of respondents reporting negative outcomes such as service disruptions, high operational costs and security challenges ...

A Gartner analyst recently suggested that GenAI tools could create 25% time savings for network operational teams. Where might these time savings come from? How are GenAI tools helping NetOps teams today, and what other tasks might they take on in the future as models continue improving? In general, these savings come from automating or streamlining manual NetOps tasks ...

IT and line-of-business teams are increasingly aligned in their efforts to close the data gap and drive greater collaboration to alleviate IT bottlenecks and offload growing demands on IT teams, according to The 2025 Automation Benchmark Report: Insights from IT Leaders on Enterprise Automation & the Future of AI-Driven Businesses from Jitterbit ...

A large majority (86%) of data management and AI decision makers cite protecting data privacy as a top concern, with 76% of respondents citing ROI on data privacy and AI initiatives across their organization, according to a new Harris Poll from Collibra ...

According to Gartner, Inc. the following six trends will shape the future of cloud over the next four years, ultimately resulting in new ways of working that are digital in nature and transformative in impact ...

2020 was the equivalent of a wedding with a top-shelf open bar. As businesses scrambled to adjust to remote work, digital transformation accelerated at breakneck speed. New software categories emerged overnight. Tech stacks ballooned with all sorts of SaaS apps solving ALL the problems — often with little oversight or long-term integration planning, and yes frequently a lot of duplicated functionality ... But now the music's faded. The lights are on. Everyone from the CIO to the CFO is checking the bill. Welcome to the Great SaaS Hangover ...

Regardless of OpenShift being a scalable and flexible software, it can be a pain to monitor since complete visibility into the underlying operations is not guaranteed ... To effectively monitor an OpenShift environment, IT administrators should focus on these five key elements and their associated metrics ...

An overwhelming majority of IT leaders (95%) believe the upcoming wave of AI-powered digital transformation is set to be the most impactful and intensive seen thus far, according to The Science of Productivity: AI, Adoption, And Employee Experience, a new report from Nexthink ...

Overall outage frequency and the general level of reported severity continue to decline, according to the Outage Analysis 2025 from Uptime Institute. However, cyber security incidents are on the rise and often have severe, lasting impacts ...