5 APM Techniques to Troubleshoot Application Slow Down in Minutes
October 27, 2014

Payal Chakravarty
IBM

Share this

Applications are getting more complex by the day. First you have the various hosting platforms that your app can span across like private cloud, public cloud, your own data center.

Second, you have applications for the web being accessed through different browsers and mobile apps being accessed from several hundred different devices and various device OSs.

Third, the same app is being accessed from around the world, 24X7.

Fourth, the number of users accessing apps have grown significantly requiring rapid scalability of the app's infrastructure.

To top it all, users, today, have very little patience to deal with poor performance.

Application Performance Management (APM) tools have evolved over the last decade to cater to this complexity and yet be able to troubleshoot application performance issues quickly. Let us look at some of the key features and visualization techniques that are enabling quicker troubleshooting:

1. End User Experience Metrics sliced by different dimensions

As an app developer or app owner, the first step to troubleshooting a performance problem is to narrow the scope of it. By comparing how long it is taking a web page to load for a user using your app through Firefox on Mac vs how long it is taking for the same web page to load for a user using Chrome on iOS, you can narrow down which browser and device to troubleshoot on. You could also compare how long the response time is for a user in California vs a user in Australia when accessing the same page and executing the same transaction. By slicing and dicing response time by various dimensions like geography, browser, device, network carrier etc isolation of problem areas have become easier.

2. Code level stack traces

For every business transaction that fails or is slow, you can find out what line of code is causing the slowdown by looking at its stack trace. APM tools today show the class name, method name and exact line of source code (e.g., SQL query, line number of code in a specific browser session trace) that led to a slow request. Further, you can see the pre- and post-code deployment patterns for your apps.

3. Transaction Topologies

Today, APM tools can automatically discover your end-to-end distributed application environment in minutes, showing you a topological view of all the components that your app depends on and hence aid visual detection of bottlenecks. A few of these tools not only show an aggregated transaction topology, but also show the detailed topological mapping for single transaction instances, capturing network hops and sub-transaction nodes to help you see where the time is spent during that instance. With the evolution of big data technologies, it is now possible to capture 100% transactions instead of sampling. This ensures you will not lose out on any key business transactions that may have failed.

4. Log analytics

Searching for errors across application stacks can be a laborious task. Earlier, while troubleshooting, operators, administrators and app owners would have to look through logs from different components independently, in silos. With integrated log analytics, you can now search for errors across log files for any component in your app stack in the context of the application. For example, you can correlate errors in your app server with an error in your database that may be impacting a transaction.

5. One pane-of-glass to view health of all components in the app stack

As opposed to looking at multiple panes of glass to see details of your application's health, today, at a glance in one UI you will be able to visualize the detailed health of all your app components. Spotting the problem area is as easy as spotting a color difference. For example, key metrics — like Garbage collection statistics from your code's runtime, memory usage of your VM, space utilization of your database server, bandwidth utilization of your network, http request response times of your web requests — can all be seen in one user interface.

With the evolution of big data, improved algorithms for search and correlation, smart dashboards/visualization and diagnostic capabilities, APM tools have matured to provide insights that you could never have before, thereby cutting troubleshooting time from days to minutes.

Payal Chakravarty is Senior Product Manager for IBM Application Performance Management.

Share this

The Latest

November 06, 2024

Artificial intelligence (AI) is rapidly reshaping industries around the world. From optimizing business processes to unlocking new levels of innovation, AI is a critical driver of success for modern enterprises. As a result, business leaders — from DevOps engineers to CTOs — are under pressure to incorporate AI into their workflows to stay competitive. But the question isn't whether AI should be adopted — it's how ...

November 05, 2024

The mobile app industry continues to grow in size, complexity, and competition. Also not slowing down? Consumer expectations are rising exponentially along with the use of mobile apps. To meet these expectations, mobile teams need to take a comprehensive, holistic approach to their app experience ...

November 04, 2024

Users have become digital hoarders, saving everything they handle, including outdated reports, duplicate files and irrelevant documents that make it difficult to find critical information, slowing down systems and productivity. In digital terms, they have simply shoved the mess off their desks and into the virtual storage bins ...

November 01, 2024

Today we could be witnessing the dawn of a new age in software development, transformed by Artificial Intelligence (AI). But is AI a gateway or a precipice? Is AI in software development transformative, just the latest helpful tool, or a bunch of hype? To help with this assessment, DEVOPSdigest invited experts across the industry to comment on how AI can support the SDLC. In this epic multi-part series to be posted over the next several weeks, DEVOPSdigest will explore the advantages and disadvantages; the current state of maturity and adoption; and how AI will impact the processes, the developers, and the future of software development ...

October 31, 2024

Half of all employees are using Shadow AI (i.e. non-company issued AI tools), according to a new report by Software AG ...

October 30, 2024

On their digital transformation journey, companies are migrating more workloads to the cloud, which can incur higher costs during the process due to the higher volume of cloud resources needed ... Here are four critical components of a cloud governance framework that can help keep cloud costs under control ...

October 29, 2024

Operational resilience is an organization's ability to predict, respond to, and prevent unplanned work to drive reliable customer experiences and protect revenue. This doesn't just apply to downtime; it also covers service degradation due to latency or other factors. But make no mistake — when things go sideways, the bottom line and the customer are impacted ...

October 28, 2024

Organizations continue to struggle to generate business value with AI. Despite increased investments in AI, only 34% of AI professionals feel fully equipped with the tools necessary to meet their organization's AI goals, according to The Unmet AI Needs Surveywas conducted by DataRobot ...

October 24, 2024

High-business-impact outages are costly, and a fast MTTx (mean-time-to-detect (MTTD) and mean-time-to-resolve (MTTR)) is crucial, with 62% of businesses reporting a loss of at least $1 million per hour of downtime ...

October 23, 2024

Organizations recognize the benefits of generative AI (GenAI) yet need help to implement the infrastructure necessary to deploy it, according to The Future of AI in IT Operations: Benefits and Challenges, a new report commissioned by ScienceLogic ...