APMdigest asked experts from across the IT industry for their opinions on what IT departments should be monitoring to ensure digital performance. Part 3 covers the development side.
Code-level issues are a common cause of application slowness and have fueled the need for distributed transaction tracing, which can help isolate the exact line of code with errors. This type of monitoring can also be effectively applied in both pre- and post-production environments, enabling us to prevent performance issues before they impact end users as well as help isolate them when they do occur.
When this type of application monitoring is done in context of infrastructure dependencies, it helps distinguish if there are other issues affecting application code processing, such as a bottleneck in the application server, long-running database queries, slow third-party calls, or other issues that may be associated with the application ecosystem. Applications are the heart of IT workloads, and application performance monitoring is critical to effectively ensure the performance of digital services.
Director, Product Marketing, eG Innovations
Digital performance is complex and can be measured in many ways, but one critical consideration is how well does the application do what it is supposed to do? Is it meeting a functional performance metric for customer expectations? To ensure this, organizations need to look at the "fingerprint" of each error in code to discern its importance as well as look at the number of critical errors per release. This dictates the overall functional reliability of the code. It also requires you to be code-aware, monitoring from inside the application at runtime, not surrounding it or listening to the exhaust.
CTO and Co-Founder, OverOps
Most people already know to monitor the obvious things, like total latency to response. But my favorite monitor comes from Anatoly Mikhaylov's talk at DASH this year. He spoke about finding massive infrastructure costs hidden in error codes. Adding APM monitoring to the errors in your endpoints can show costs you wouldn't otherwise see.
APM Developer Advocate, Datadog
When automating you application release, it's important to remember what you need to monitor. This will allow you to go as fast as possible, but also make sure you are doing it efficiently. Monitor your lead time, success vs failure rate and mean time to recovery will ensure you focus on value rather than on effort.
Co-Founder and CTO, DBmaestro
One key area to make sure you monitor: API calls. There aren't many applications I come across these days that do not include some 3rd-party API, be it for authentication, analytics, storage, or customer relationship management. Such API calls can so greatly impact digital performance that not monitoring them to identify things such as performance slowdowns and dependencies is a prescription for pain.
Senior Consultant and Founder of RootPerformance
Cloud, containers and microservices are creating increasingly ephemeral, modular and volatile IT environments. In these dynamic environments, traditional monitoring approaches fail. A modern monitoring approach is required to provide complete visibility into the applications, containers, host and underlying supporting infrastructure. This includes having visibility into the performance of and data returning from APIs which have become a key component to any microservices architecture. A modern monitoring approach includes the analytics and intelligence to understand how changes might impact the overall user experience and flexible monitoring techniques that don't overload the containerized application environment.
Director, Product Marketing, CA Technologies
Finding a tool that fits seamlessly into your workflows, setting performance benchmarks, validating payloads, and getting visibility into the performance of API transactions is critical to help teams get rapidly identify and fix issues in production so that the delivered digital experience matches the vision for end-users.
VP of Product, AlertSite UXM, SmartBear
APIs are the fundamental building block of modern software. While engineering teams have built extensive monitoring systems to check the health of code execution paths, they have little visibility into what's going on with APIs. An API failure can bring down systems and without proper monitoring in place, it can be very hard to debug what's going on.
The nature of development means systems are going to spring into existence and then back out again often, and that this rapid change is OK, which means your monitoring needs to be OK with it. The ability to monitor containers, ephemeral services, and the like, is a must.
Head Geek, SolarWinds
Let's go to the extreme and say you could only monitor one thing — that one thing would be microservice response time. In this brave new world, it's actually quite difficult to understand how well your revenue-critical application is performing. While traditional metrics still matter (CPU, memory, disk, etc), your response time on a microservice-by-microservice basis is the thing that matters the most. This single metric will tell you more about the customer experience than anything else. It will indicate downtime or more subtle performance problems in your application. While this metric alone will not tell you "why" something is going on, it will tell you "what" is happening and allow you to quickly isolate a problem to a handful of services or some set of underlying infrastructure.
As you evolve and enhance your company's hybrid data center infrastructure to keep pace with your industry, understanding your unique workload I/O DNA is paramount to success. Real-time monitoring of the I/O path – from the virtual server to the storage array – is essential to ensuring digital performance. For mission-critical applications, understanding the performance of each and every transaction is the cornerstone of customer satisfaction and revenue assurance.
CMO, Virtual Instruments
Read Len Rosenthal's new blog on APMdigest: Infrastructure Monitoring for Digital Performance Assurance.
Read What You Should Be Monitoring to Ensure Digital Performance - Part 4, covering the infrastructure, including the cloud and the network.
Retail companies typically start planning and testing in August and freeze code in September, but — according to a new survey commissioned by Catchpoint — due to COVID-19, most respondents (58%) are starting their planning and testing earlier than before ...
The outsourcing of IT infrastructure to a dedicated provider can make it difficult for organizations to understand where and how their operations are running and can become a breeding ground for misunderstanding and myths. To help clear up some of these myths, I've put together a guide to support organizations in the decision-making process and help them understand whether moving to the cloud is the right option for their business ...
Rapid adoption of cloud services, widespread use of SaaS applications, and reliance on the Internet has created business continuity risks for enterprises, according to the 2020 Internet Performance Report: COVID-19 Impact Edition from ThousandEyes ...
In Episode 2, Jonah Kowall, CTO of Logz.io and former Gartner Research VP, joins the AI+ITOPS Podcast to discuss some of the hottest topics in ITOps today, including AIOps, Open Telemetry, Observability, and the challenges of Big Data in AI ...
Dennis Drogseth, VP at EMA, on the AI+ITOPS Podcast: "Digital transformation ... and the need for IT to enable digital business outcomes, is greater than ever, and all the tools including AIOps and automation ... are critical in making the difference ..."
Application or network downtime is expensive, and given the growing numbers and types of high-availability and mission-critical applications, systems and networks — and our increasing reliance on them — ensuring consistent access to mission-critical applications is essential for ensuring customer loyalty and keeping employees productive. Businesses must recognize that applications availability depends on the network and implement a strategy to ensure network-aware application performance monitoring ...
Business leaders around the world are comfortable and optimistic about the broad scale shift toward remote work, according to the Riverbed Future of Work Global Survey 2020 that included 700 business decision makers, however findings also indicate the vast majority of organizations were not well prepared when the COVID-19 pandemic began ...
APMdigest and The Field CTO joined forces to launch the AI+ITOPS Podcast. The mission of the podcast is to discuss the struggles faced by ITOps — such as digital transformation and the need to keep IT services "always on" — and explore how AI/ML, AIOps, APM and other ITOps and DevOps technologies can help. Episode 1 features guest Dennis Drogseth, VP at Enterprise Management Associates (EMA) ...
One of the most frustrating experiences for website visitors is a slow, unresponsive website. Worst-case scenario, a web bounce causes prospects to permanently bounce from your company. In an effort to help companies improve web performance, Google launched the Web Vitals initiative in May and announced three new search engine ranking factors ...