In the world of Application Performance Management (APM) it is always better to enlist more than one entity to help solve the mystery of performance problems.
It's kind of like arriving at the scene of the crime on foreign soil, being blindfolded, shoved out the door, and then asked to help solve the injustice without any insight. All you can do is begin by asking people in the vicinity, providing you speak their language, for information on what they have seen (i.e. end-user-experience).
Gathering facts related to a crime is essential, and can be likened to utilizing an APM solution for solving application performance problems. The more information about an application’s behavior that you can obtain, along with understanding its idiosyncrasies within the environment, the more likely you will be able to pinpoint root causes of performance issues.
The Three People You Need
Wouldn't it be helpful if there was an eye witness you could interview, a watchman who was on duty during the time of the incident, and an agent you could hire to translate the native tongue and provide insight into the culture?
In much the same way, a smart APM strategy enlists the help from these three entities: the Witness, the Watchman, and the Agent. You start by listening to the testimony from the eye witness (aka. wire data), collecting the observations from the watchman (aka. web robots), and analyzing details from the agent (aka. code level instrumentation).
Passive monitoring, wire-data analytics
The Witness reports what they see within their field of vision, (aka. passive monitoring, wire-data analytics). The Witness is watching everything in their purview and sees things as they happen, which corresponds to what is coming across "the wire" in front of them.
The Witness will tell you how many people were involved, if anyone was injured, and what time the event occurred, (e.g. user names, packet loss, timelines, etc.). She can tell you what doors the people went through, how wide the aisles were, and how fast people were traveling, (e.g. network port listeners, realized bandwidth, round-trip-time, etc.).
Active monitoring - synthetic transactions
The Watchman (aka. web robot) is actively checking and is always on patrol, methodically taking the same path every time. He will tell you what doors are locked and monitor the ones that are open, collecting measurements along the way on how long it takes to complete his rounds, (i.e. synthetic transactions).
The Watchman will report the status of the rooms and buildings on his patrol and will note if anything happens to him along the way, (e.g. application availability, transaction errors, timeouts, etc.).
Application code instrumentation
The Agent you hire is critical for solving the crime within the territory you're operating in. The Agent will watch activity from specific vantage points throughout the environment and report back his findings. It's crucial he speaks the local language, (e.g. Java, .Net, PHP) and can easily translate for you.
His approach will be to deploy probes on rooftops and inside the buildings for monitoring all conversations and actions in the environment, (aka. application code instrumentation). He will also tap the communication systems, (i.e. script injection) when appropriate and capture specific measurements from each conversation and record them.
Going from Red to Green
Identifying an application that has gone catatonic is one thing, but assessing the insidious slow performance of a complex multi-tiered application and fixing it, can be very time consuming and costly. Enlisting all three entities described above to assist is a thoughtful strategy for any IT Leader to consider.
Based on eye witness testimony, the forensics collected, and the conversations recorded, you will be well on your way to providing an accurate account of what has transpired and why, (i.e. root cause analysis).
Remember, the end-user is the supreme judge in this case and if performance is chronically slow, your sentence could be harsh. Either directly by inundating you with complaints creating bad press or indirectly by abandoning your site in favor of one that is much faster and more intuitive to use.
Embracing a smart but simple APM Methodology within your environment may be the only thing that exonerates you when the verdict for your slow application is "guilty as charged."
In order to properly sort through all monitoring noise and identify true problems, their causes, and to prioritize them for response by the IT team, they have created and built a revolutionary new system using a meta-cognitive model ...
As we shift further into a digital-first world, where having a reliable online experience becomes more essential, Site Reliability Engineers remain in-demand among organizations of all sizes ... This diverse set of skills and values can be difficult to interview for. In this blog, we'll get you started with some example questions and processes to find your ideal SRE ...
US government agencies are bringing more of their employees back into the office and implementing hybrid work schedules, but federal workers are worried that their agencies' IT architectures aren't built to handle the "new normal." They fear that the reactive, manual methods used by the current systems in dealing with user, IT architecture and application problems will degrade the user experience and negatively affect productivity. In fact, according to a recent survey, many federal employees are concerned that they won't work as effectively back in the office as they did at home ...
Users today expect a seamless, uninterrupted experience when interacting with their web and mobile apps. Their expectations have continued to grow in tandem with their appetite for new features and consistent updates. Mobile apps have responded by increasing their release cadence by up to 40%, releasing a new full version of their app every 4-5 days, as determined in this year's SmartBear State of Software Quality | Application Stability Index report ...
In this second part of the blog series, we look at how adopting AIOps capabilities can drive business value for an organization ...
ITOPS and DevOps is in the midst of a surge of innovation. New devices and new systems are appearing at an unprecedented rate. There are many drivers of this phenomenon, from virtualization and containerization of applications and services to the need for improved security and the proliferation of 5G and IOT devices. The interconnectedness and the interdependencies of these technologies also greatly increase systems complexity and therefore increase the sheer volume of things that need to be integrated, monitored, and maintained ...
IT talent acquisition challenges are now heavily influencing technology investment decisions, according to new research from Salesforce's MuleSoft. The 2022 IT Leaders Pulse Report reveals that almost three quarters (73%) of senior IT leaders agree that acquiring IT talent has never been harder, and nearly all (98%) respondents say attracting IT talent influences their organization's technology investment choices ...
The findings of the 2022 Observability Forecast offer a detailed view of how this practice is shaping engineering and the technologies of the future. Here are 10 key takeaways from the forecast ...
Data professionals are spending 40% of their time evaluating or checking data quality and that poor data quality impacts 26% of their companies' revenue, according to The State of Data Quality 2022, a report commissioned by Monte Carlo and conducted by Wakefield Research ...