APMdigest asked experts from across the IT industry for their opinions on what IT departments should be monitoring to ensure digital performance. Part 3 covers the development side.
Code-level issues are a common cause of application slowness and have fueled the need for distributed transaction tracing, which can help isolate the exact line of code with errors. This type of monitoring can also be effectively applied in both pre- and post-production environments, enabling us to prevent performance issues before they impact end users as well as help isolate them when they do occur.
When this type of application monitoring is done in context of infrastructure dependencies, it helps distinguish if there are other issues affecting application code processing, such as a bottleneck in the application server, long-running database queries, slow third-party calls, or other issues that may be associated with the application ecosystem. Applications are the heart of IT workloads, and application performance monitoring is critical to effectively ensure the performance of digital services.
Director, Product Marketing, eG Innovations
Digital performance is complex and can be measured in many ways, but one critical consideration is how well does the application do what it is supposed to do? Is it meeting a functional performance metric for customer expectations? To ensure this, organizations need to look at the "fingerprint" of each error in code to discern its importance as well as look at the number of critical errors per release. This dictates the overall functional reliability of the code. It also requires you to be code-aware, monitoring from inside the application at runtime, not surrounding it or listening to the exhaust.
CTO and Co-Founder, OverOps
Most people already know to monitor the obvious things, like total latency to response. But my favorite monitor comes from Anatoly Mikhaylov's talk at DASH this year. He spoke about finding massive infrastructure costs hidden in error codes. Adding APM monitoring to the errors in your endpoints can show costs you wouldn't otherwise see.
APM Developer Advocate, Datadog
When automating you application release, it's important to remember what you need to monitor. This will allow you to go as fast as possible, but also make sure you are doing it efficiently. Monitor your lead time, success vs failure rate and mean time to recovery will ensure you focus on value rather than on effort.
Co-Founder and CTO, DBmaestro
One key area to make sure you monitor: API calls. There aren't many applications I come across these days that do not include some 3rd-party API, be it for authentication, analytics, storage, or customer relationship management. Such API calls can so greatly impact digital performance that not monitoring them to identify things such as performance slowdowns and dependencies is a prescription for pain.
Senior Consultant and Founder of RootPerformance
Cloud, containers and microservices are creating increasingly ephemeral, modular and volatile IT environments. In these dynamic environments, traditional monitoring approaches fail. A modern monitoring approach is required to provide complete visibility into the applications, containers, host and underlying supporting infrastructure. This includes having visibility into the performance of and data returning from APIs which have become a key component to any microservices architecture. A modern monitoring approach includes the analytics and intelligence to understand how changes might impact the overall user experience and flexible monitoring techniques that don't overload the containerized application environment.
Director, Product Marketing, CA Technologies
Finding a tool that fits seamlessly into your workflows, setting performance benchmarks, validating payloads, and getting visibility into the performance of API transactions is critical to help teams get rapidly identify and fix issues in production so that the delivered digital experience matches the vision for end-users.
VP of Product, AlertSite UXM, SmartBear
APIs are the fundamental building block of modern software. While engineering teams have built extensive monitoring systems to check the health of code execution paths, they have little visibility into what's going on with APIs. An API failure can bring down systems and without proper monitoring in place, it can be very hard to debug what's going on.
The nature of development means systems are going to spring into existence and then back out again often, and that this rapid change is OK, which means your monitoring needs to be OK with it. The ability to monitor containers, ephemeral services, and the like, is a must.
Head Geek, SolarWinds
Let's go to the extreme and say you could only monitor one thing — that one thing would be microservice response time. In this brave new world, it's actually quite difficult to understand how well your revenue-critical application is performing. While traditional metrics still matter (CPU, memory, disk, etc), your response time on a microservice-by-microservice basis is the thing that matters the most. This single metric will tell you more about the customer experience than anything else. It will indicate downtime or more subtle performance problems in your application. While this metric alone will not tell you "why" something is going on, it will tell you "what" is happening and allow you to quickly isolate a problem to a handful of services or some set of underlying infrastructure.
As you evolve and enhance your company's hybrid data center infrastructure to keep pace with your industry, understanding your unique workload I/O DNA is paramount to success. Real-time monitoring of the I/O path – from the virtual server to the storage array – is essential to ensuring digital performance. For mission-critical applications, understanding the performance of each and every transaction is the cornerstone of customer satisfaction and revenue assurance.
CMO, Virtual Instruments
Read Len Rosenthal's new blog on APMdigest: Infrastructure Monitoring for Digital Performance Assurance.
Read What You Should Be Monitoring to Ensure Digital Performance - Part 4, covering the infrastructure, including the cloud and the network.
Business leaders are in the unique position of having immediate access to huge amounts of data in today's smartphone and laptop-dominated world. They are also under pressure to make data-driven decisions and mobile business intelligence can one of the most valuable decision making tools in their arsenal ...
Unlike some AI initiatives, AIOps doesn't always necessitate the use of a data scientist, so don't let hiring expenses put your AIOps initiatives on hold. It is always nice to have IT team members with AI skills, but this becomes less critical as more intelligent solutions come into prominence that offer AIOps features out of the box, a readily deployable option for IT ...
AIOps is rapidly becoming a de-facto option for enterprises' IT strategies, with nearly immeasurable benefits to be provided. However, AIOps is still a relatively new discipline and misconceptions surrounding the technology's capabilities and uses have caused bottlenecks and roadblocks in its widespread adoption. So, what should organizations expect from AIOps? How can organizations that want to digitally transform their IT pursue AIOps for maximum benefit? ...
In response to the global pandemic, companies have given their workforce the tools they need to work remote. And research shows it has increased their engagement and productivity. But these gains are on the brink of being wiped out. According to a new study from Citrix Systems, Inc., employees feel they've been given too many tools and not enough efficient ways to execute. And it's hindering their ability to get things done ...
The third installment of Aptum's four-part Cloud Impact Study, A Bright Forecast on Cloud, presents data showing the benefits organizations gain from cloud computing, as well as mistakes to avoid during migration. As organizations migrate workloads to different cloud platforms, they often run into unexpected challenges due to a lack of proactive planning. Here are a few key findings from Part 3 of the Cloud Impact Study ...
Currently, (and most likely well into the future) the overwhelming majority of organizations still need to monitor and maintain enterprise applications. Moreover, where these are complex systems developed, debugged and refined over years, often decades, around a business's core processes, there can also be very strong practical arguments for viewing them as classics. They can offer a valuable legacy, one best left where it is, doing what it does, how it always has done ...
Anti-patterns involve realizing a problem and implementing a non-optimal solution that is broadly embraced as the go-to method for solving that problem. This solution sounds good in theory, but for one reason or another it is not the best means of solving the problem. Anti-patterns are common across IT as well, especially around application monitoring and observability. One that is particularly prevalent is in response to the increasing complexity of cloud-native infrastructure and applications ...
The introduction of the latest technology — such as AI and machine learning — can be seen as a way for organizations to accelerate growth, increase efficiency, and improve customer service. However, the truth is that the technology alone will do little to deliver on these business outcomes. AI for IT operations (AIOps) is one area where the application of technology, if not matched with organizational maturity readiness, will fail to deliver all the promised benefits ...
SREs that fail to deliver customer value run the risk of being stuck in an operational toil rut. Conversely, businesses failing to recognize the bi-modal nature and importance of SRE activities run the risk of losing talented employees and their competitive edge ...
As part of digital transformation initiatives, IT teams are quickly adopting AIOps solutions to accommodate a new multifaceted infrastructure. However, there are still several roadblocks IT leaders must overcome when adopting AIOps — namely, understanding how to showcase ROI and changing their team's cultural mindset around adopting a new strategy ...