APMdigest asked experts from across the IT industry for their opinions on what IT departments should be monitoring to ensure digital performance. Part 3 covers the development side.
Code-level issues are a common cause of application slowness and have fueled the need for distributed transaction tracing, which can help isolate the exact line of code with errors. This type of monitoring can also be effectively applied in both pre- and post-production environments, enabling us to prevent performance issues before they impact end users as well as help isolate them when they do occur.
When this type of application monitoring is done in context of infrastructure dependencies, it helps distinguish if there are other issues affecting application code processing, such as a bottleneck in the application server, long-running database queries, slow third-party calls, or other issues that may be associated with the application ecosystem. Applications are the heart of IT workloads, and application performance monitoring is critical to effectively ensure the performance of digital services.
Director, Product Marketing, eG Innovations
Digital performance is complex and can be measured in many ways, but one critical consideration is how well does the application do what it is supposed to do? Is it meeting a functional performance metric for customer expectations? To ensure this, organizations need to look at the "fingerprint" of each error in code to discern its importance as well as look at the number of critical errors per release. This dictates the overall functional reliability of the code. It also requires you to be code-aware, monitoring from inside the application at runtime, not surrounding it or listening to the exhaust.
CTO and Co-Founder, OverOps
Most people already know to monitor the obvious things, like total latency to response. But my favorite monitor comes from Anatoly Mikhaylov's talk at DASH this year. He spoke about finding massive infrastructure costs hidden in error codes. Adding APM monitoring to the errors in your endpoints can show costs you wouldn't otherwise see.
APM Developer Advocate, Datadog
When automating you application release, it's important to remember what you need to monitor. This will allow you to go as fast as possible, but also make sure you are doing it efficiently. Monitor your lead time, success vs failure rate and mean time to recovery will ensure you focus on value rather than on effort.
Co-Founder and CTO, DBmaestro
One key area to make sure you monitor: API calls. There aren't many applications I come across these days that do not include some 3rd-party API, be it for authentication, analytics, storage, or customer relationship management. Such API calls can so greatly impact digital performance that not monitoring them to identify things such as performance slowdowns and dependencies is a prescription for pain.
Senior Consultant and Founder of RootPerformance
Cloud, containers and microservices are creating increasingly ephemeral, modular and volatile IT environments. In these dynamic environments, traditional monitoring approaches fail. A modern monitoring approach is required to provide complete visibility into the applications, containers, host and underlying supporting infrastructure. This includes having visibility into the performance of and data returning from APIs which have become a key component to any microservices architecture. A modern monitoring approach includes the analytics and intelligence to understand how changes might impact the overall user experience and flexible monitoring techniques that don't overload the containerized application environment.
Director, Product Marketing, CA Technologies
Finding a tool that fits seamlessly into your workflows, setting performance benchmarks, validating payloads, and getting visibility into the performance of API transactions is critical to help teams get rapidly identify and fix issues in production so that the delivered digital experience matches the vision for end-users.
VP of Product, AlertSite UXM, SmartBear
APIs are the fundamental building block of modern software. While engineering teams have built extensive monitoring systems to check the health of code execution paths, they have little visibility into what's going on with APIs. An API failure can bring down systems and without proper monitoring in place, it can be very hard to debug what's going on.
The nature of development means systems are going to spring into existence and then back out again often, and that this rapid change is OK, which means your monitoring needs to be OK with it. The ability to monitor containers, ephemeral services, and the like, is a must.
Head Geek, SolarWinds
Let's go to the extreme and say you could only monitor one thing — that one thing would be microservice response time. In this brave new world, it's actually quite difficult to understand how well your revenue-critical application is performing. While traditional metrics still matter (CPU, memory, disk, etc), your response time on a microservice-by-microservice basis is the thing that matters the most. This single metric will tell you more about the customer experience than anything else. It will indicate downtime or more subtle performance problems in your application. While this metric alone will not tell you "why" something is going on, it will tell you "what" is happening and allow you to quickly isolate a problem to a handful of services or some set of underlying infrastructure.
As you evolve and enhance your company's hybrid data center infrastructure to keep pace with your industry, understanding your unique workload I/O DNA is paramount to success. Real-time monitoring of the I/O path – from the virtual server to the storage array – is essential to ensuring digital performance. For mission-critical applications, understanding the performance of each and every transaction is the cornerstone of customer satisfaction and revenue assurance.
CMO, Virtual Instruments
Read Len Rosenthal's new blog on APMdigest: Infrastructure Monitoring for Digital Performance Assurance.
Read What You Should Be Monitoring to Ensure Digital Performance - Part 4, covering the infrastructure, including the cloud and the network.
Return to work is happening. After working remotely from home for months due to Covid, companies are now offering employees the option to return to their workplace. But how will IT prioritize its activities towards this new paradigm shift? ...
The "APM" solutions we've come to love over the last 2 decades can't handle Serverless Functions or deliver the same performance and operational details that they deliver for other architectural constructs — including App Servers, Frameworks, Cloud, even Containers. And the reason is that they're methodologies for collecting performance data simply won't operate with the same characteristics as it would in persistent code ...
I asked myself this important question: Can APM tools Manage Serverless Workloads? And the answer is "No, not really." It is true that every monitoring solution in the world claims support for monitoring serverless platforms (at least one of them). What I mean by my answer is that the "APM" solutions we've come to love over the last 2 decades can't handle Serverless Functions or deliver the same performance and operational details that they deliver for other architectural constructs ...
In Episode 11, Andrew Tunall, GM, New Relic Serverless & Emerging Cloud Services, joins the AI+ITOPS Podcast to discuss the challenges and advantages of distributed tracing ...
IT teams critically require better visibility into the network driven by a number of factors, including tremendous disruption from the COVID-19 pandemic, relentless technological advances, remote working reaching an all-time high and the expanding security threatscape, according to State of the Network 2020, a study conducted by VIAVI Solutions ...
Mobile commerce offers several benefits for retailers. But all this potential can only be fully realized if retailers can manage the associated challenges that mobile commerce introduces. Anyone involved in the development, operation or troubleshooting of a mobile shopping app needs to be aware of the three following technical obstacles and plan accordingly ...
Although cost control/expense management remains top of mind, organizations are realizing the necessity of technology solutions to enable them to steer the business during these turbulent times, according to IDG's CIO Pandemic Business Impact Study ...
The COVID-19 pandemic has compressed six years of modernization projects into 6 months. According to a recent report, IT leaders have accelerated projects aimed at increasing productivity and business agility, improving application performance and end-user experience, and driving additional revenue through existing channels ...
There is no doubt that automation has become the key aspect of modern IT management. The end-user computing market is no exception. With a large and complex technology stack and a huge number of applications, EUC specialists need to handle an ever-increasing number of changes at an ever-increasing rate. Many IT organizations are starting to realize that they can no longer control the flow of changes. It is time to think about how to facilitate change ...
Starting this September, the lifespan of an SSL/TLS certificate has been limited to 398 days, a reduction from the previous maximum certificate lifetime of 825 days. With this change, everyone needs to more carefully monitor SSL certificate expiration and server characteristics ...