Skip to main content

4 Key Resources to Monitor in the Cloud

Good application performance monitoring in the cloud involves repeatedly monitoring and testing a few key areas that act differently in most cloud environments than they do in traditional situations. Tracking the resulting values over time allows you to track normal usage patterns and trends, and determine normal behavior for your provider's resources.

Valuable resources to monitor in the cloud include:

1. Network Latency

If your application depends on access to a network resource, like DNS for reverse lookup of domain names for example, then the application should regularly test this resource and your monitoring system should record its results in an easily visualized format. Also, the access time to the hosts application from both cloud and non-cloud locations should be checked and tracked. This will allow differential latency comparisons that will help reduce uncertainty about the root cause of slow response time. For instance, if the application is fast from within the cloud, and slow from without, is there a network issue on the cloud provider's Internet facing systems?

2. Cloud API Feature Availability

If your application is dynamic, and needs to use features of the Cloud vendor's API to function, you should script and test those functions to ensure they are available, and that they perform fast enough to meet your needs. Functions like instance launching, taking a volume snapshot, or adding a new volume to a running instance are good things to test periodically.

3. Virtualization Overhead

Differential monitoring of instances in the cloud versus instances on actual hardware can help you determine overall virtualization overhead for your application. Knowing the relative performance will help you size the instances you launch, and let you calculate the cost of operation on cloud infrastructure versus in-house. This makes cost-benefit analysis and cost-based justification for using cloud systems possible.

4. Configuration Tracking

So many of the failures experienced by computing infrastructures are the result of improperly managed configuration changes. The knowledge of the last time a configuration was changed becomes a critical piece of information in root cause analysis. At a minimum, the monitoring system should have a record of boot time (often associated with updates or other configuration changes) and ideally it will also have some indication of the nature of the change.

While moving to the cloud can be cost-effective in the abstract, as with any technology project it’s important to validate the assumptions you make when determining what to move, and what the cost savings actually end up to be.

About Roger Ruttiman

Roger Ruttiman, VP of Engineering & Quality at GroundWork, has 18 years of software development and leadership experience. Ruttiman is the lead architect responsible for product architecture, building and managing local and offshore teams. Before joining GroundWork, Ruttiman was a lead engineer at Advent Software in San Francisco, and at Autodesk in the US and Europe.

Hot Topics

The Latest

In MEAN TIME TO INSIGHT Episode 12, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses purchasing new network observability solutions.... 

There's an image problem with mobile app security. While it's critical for highly regulated industries like financial services, it is often overlooked in others. This usually comes down to development priorities, which typically fall into three categories: user experience, app performance, and app security. When dealing with finite resources such as time, shifting priorities, and team skill sets, engineering teams often have to prioritize one over the others. Usually, security is the odd man out ...

Image
Guardsquare

IT outages, caused by poor-quality software updates, are no longer rare incidents but rather frequent occurrences, directly impacting over half of US consumers. According to the 2024 Software Failure Sentiment Report from Harness, many now equate these failures to critical public health crises ...

In just a few months, Google will again head to Washington DC and meet with the government for a two-week remedy trial to cement the fate of what happens to Chrome and its search business in the face of ongoing antitrust court case(s). Or, Google may proactively decide to make changes, putting the power in its hands to outline a suitable remedy. Regardless of the outcome, one thing is sure: there will be far more implications for AI than just a shift in Google's Search business ... 

Image
Chrome

In today's fast-paced digital world, Application Performance Monitoring (APM) is crucial for maintaining the health of an organization's digital ecosystem. However, the complexities of modern IT environments, including distributed architectures, hybrid clouds, and dynamic workloads, present significant challenges ... This blog explores the challenges of implementing application performance monitoring (APM) and offers strategies for overcoming them ...

Service disruptions remain a critical concern for IT and business executives, with 88% of respondents saying they believe another major incident will occur in the next 12 months, according to a study from PagerDuty ...

IT infrastructure (on-premises, cloud, or hybrid) is becoming larger and more complex. IT management tools need data to drive better decision making and more process automation to complement manual intervention by IT staff. That is why smart organizations invest in the systems and strategies needed to make their IT infrastructure more resilient in the event of disruption, and why many are turning to application performance monitoring (APM) in conjunction with high availability (HA) clusters ...

In today's data-driven world, the management of databases has become increasingly complex and critical. The following are findings from Redgate's 2025 The State of the Database Landscape report ...

With the 2027 deadline for SAP S/4HANA migrations fast approaching, organizations are accelerating their transition plans ... For organizations that intend to remain on SAP ECC in the near-term, the focus has shifted to improving operational efficiencies and meeting demands for faster cycle times ...

As applications expand and systems intertwine, performance bottlenecks, quality lapses, and disjointed pipelines threaten progress. To stay ahead, leading organizations are turning to three foundational strategies: developer-first observability, API platform adoption, and sustainable test growth ...

4 Key Resources to Monitor in the Cloud

Good application performance monitoring in the cloud involves repeatedly monitoring and testing a few key areas that act differently in most cloud environments than they do in traditional situations. Tracking the resulting values over time allows you to track normal usage patterns and trends, and determine normal behavior for your provider's resources.

Valuable resources to monitor in the cloud include:

1. Network Latency

If your application depends on access to a network resource, like DNS for reverse lookup of domain names for example, then the application should regularly test this resource and your monitoring system should record its results in an easily visualized format. Also, the access time to the hosts application from both cloud and non-cloud locations should be checked and tracked. This will allow differential latency comparisons that will help reduce uncertainty about the root cause of slow response time. For instance, if the application is fast from within the cloud, and slow from without, is there a network issue on the cloud provider's Internet facing systems?

2. Cloud API Feature Availability

If your application is dynamic, and needs to use features of the Cloud vendor's API to function, you should script and test those functions to ensure they are available, and that they perform fast enough to meet your needs. Functions like instance launching, taking a volume snapshot, or adding a new volume to a running instance are good things to test periodically.

3. Virtualization Overhead

Differential monitoring of instances in the cloud versus instances on actual hardware can help you determine overall virtualization overhead for your application. Knowing the relative performance will help you size the instances you launch, and let you calculate the cost of operation on cloud infrastructure versus in-house. This makes cost-benefit analysis and cost-based justification for using cloud systems possible.

4. Configuration Tracking

So many of the failures experienced by computing infrastructures are the result of improperly managed configuration changes. The knowledge of the last time a configuration was changed becomes a critical piece of information in root cause analysis. At a minimum, the monitoring system should have a record of boot time (often associated with updates or other configuration changes) and ideally it will also have some indication of the nature of the change.

While moving to the cloud can be cost-effective in the abstract, as with any technology project it’s important to validate the assumptions you make when determining what to move, and what the cost savings actually end up to be.

About Roger Ruttiman

Roger Ruttiman, VP of Engineering & Quality at GroundWork, has 18 years of software development and leadership experience. Ruttiman is the lead architect responsible for product architecture, building and managing local and offshore teams. Before joining GroundWork, Ruttiman was a lead engineer at Advent Software in San Francisco, and at Autodesk in the US and Europe.

Hot Topics

The Latest

In MEAN TIME TO INSIGHT Episode 12, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses purchasing new network observability solutions.... 

There's an image problem with mobile app security. While it's critical for highly regulated industries like financial services, it is often overlooked in others. This usually comes down to development priorities, which typically fall into three categories: user experience, app performance, and app security. When dealing with finite resources such as time, shifting priorities, and team skill sets, engineering teams often have to prioritize one over the others. Usually, security is the odd man out ...

Image
Guardsquare

IT outages, caused by poor-quality software updates, are no longer rare incidents but rather frequent occurrences, directly impacting over half of US consumers. According to the 2024 Software Failure Sentiment Report from Harness, many now equate these failures to critical public health crises ...

In just a few months, Google will again head to Washington DC and meet with the government for a two-week remedy trial to cement the fate of what happens to Chrome and its search business in the face of ongoing antitrust court case(s). Or, Google may proactively decide to make changes, putting the power in its hands to outline a suitable remedy. Regardless of the outcome, one thing is sure: there will be far more implications for AI than just a shift in Google's Search business ... 

Image
Chrome

In today's fast-paced digital world, Application Performance Monitoring (APM) is crucial for maintaining the health of an organization's digital ecosystem. However, the complexities of modern IT environments, including distributed architectures, hybrid clouds, and dynamic workloads, present significant challenges ... This blog explores the challenges of implementing application performance monitoring (APM) and offers strategies for overcoming them ...

Service disruptions remain a critical concern for IT and business executives, with 88% of respondents saying they believe another major incident will occur in the next 12 months, according to a study from PagerDuty ...

IT infrastructure (on-premises, cloud, or hybrid) is becoming larger and more complex. IT management tools need data to drive better decision making and more process automation to complement manual intervention by IT staff. That is why smart organizations invest in the systems and strategies needed to make their IT infrastructure more resilient in the event of disruption, and why many are turning to application performance monitoring (APM) in conjunction with high availability (HA) clusters ...

In today's data-driven world, the management of databases has become increasingly complex and critical. The following are findings from Redgate's 2025 The State of the Database Landscape report ...

With the 2027 deadline for SAP S/4HANA migrations fast approaching, organizations are accelerating their transition plans ... For organizations that intend to remain on SAP ECC in the near-term, the focus has shifted to improving operational efficiencies and meeting demands for faster cycle times ...

As applications expand and systems intertwine, performance bottlenecks, quality lapses, and disjointed pipelines threaten progress. To stay ahead, leading organizations are turning to three foundational strategies: developer-first observability, API platform adoption, and sustainable test growth ...