Maintaining Application Performance with Distributed Users
November 30, 2021

Nadeem Zahid
cPacket Networks

Share this

Thanks to pandemic-related work-from-home (WFH) and digital/mobile customer experience initiatives, employees and users are more distributed than ever. At the same time, organizations everywhere are adopting a cloud-first or cloud-smart architecture, distributing their business applications across private and public cloud infrastructures. Private data centers continue to be consolidated, while more and more branch offices are connecting to data centers and the public cloud simultaneously. Maintaining application performance for distributed users in this increasingly hybrid environment is a significant challenge for IT teams.

Application performance depends on network performance — networks connect end-users and IoT devices with applications and connect application components such as application servers, database servers and microservices together. Whether users are internal employees or external customers, their experience with enterprise and web-based and SaaS applications directly affect an organization's success, either through sales and revenue or employee productivity. Maintaining good application performance through network and application monitoring and troubleshooting helps the business stay on top of their mission-critical business applications to succeed.

IT faces many new challenges when trying to do this for a distributed user base, including:

No visibility into WFH and SaaS traffic: IT no longer has full visibility into traffic from users working from home or remote locations and using SaaS applications that cross the public internet. They'll be blind to any issues and forced to rely on user complaints to diagnose any problems — not a recipe for success.

Tapping the public cloud: The cloud is often a major blind spot to the Application Operations (AppOps) team. How can they measure, much less assure, application performance and dependencies for traffic they can't see? Cloud-native monitoring tools can help observe infrastructure and application layers, but they come with significant limitations. They are vendor-specific, often lack features and visibility compared to on-premises tools, and typically do not integrate well with those on-premises tools.

Troubleshooting without control: Remote employees might be working from a variety of locations — home, public networks, branch offices, or headquarters — and key applications may be virtualized, in the cloud, or located on premise. Traffic going between these many locations that does not pass through a physical switch or firewall and is invisible to traditional network traffic collection and analysis tools. The pressure on IT to ensure a good experience for users in all these scenarios has increased, but their control and ability to troubleshoot has gone down.

To ensure application performance for distributed users, IT must reliably monitor traffic across physical, virtual and cloud-native elements deployed across data centers, branch offices, and multi-cloud environments. Here are some techniques for accomplishing this:

Getting the Right Data

The first step toward ensuring application performance for distributed users is data mining. This starts with tapping strategic points in the network across physical, virtual and cloud infrastructure. IT must collect data from all critical locations including north-south traffic into and out of data centers and cloud as well as east-west traffic between virtual machines and/or application and database components of a software-defined data center. Speeds and feeds, scale, and cost matter at this stage. Then IT needs an analysis tool to make sense out of the accumulated packets, flow and metadata. This quickly gets complicated, but in general, IT should be able to measure baselines for application and network performance (latency and connection errors, for example), set thresholds for normal behavior, map dependencies, and generate alerts for service level monitoring. This last part is vital — alerting when performance deviates from a normal range allows IT to proactively investigate and fix issues before users complain.

Tapping the Cloud

One successful approach to collecting, consolidating, and analyzing traffic in the cloud involves a software-only solution natively integrated with leading Virtual Private Cloud (VPC) traffic-mirroring services. Advanced functions such as filtering, load balancing, slicing, etc. can be applied to the cloud application workloads. This not only enables seamless access to the VPC's network data, but it also reduces complexity and cost. By natively replicating and monitoring network traffic to tools within their VPC, IT teams can avoid using forwarding agents or container-based sensors.

By monitoring application traffic before a cloud migration, IT can build a baseline of normal performance. During and after the migration, they can continue monitoring to see if performance deviates, and proactively identify issues before they affect users.

Distributed user bases are here to stay, thanks to hybrid work schedules, cloud migrations, virtualization and data center consolidation. IT must adapt to this new reality and ensure their monitoring capabilities can proactively identify linked network and application issues and reduce cost and complexity no matter where users are located.

Nadeem Zahid is VP of Product Management & Marketing at cPacket Networks
Share this

The Latest

July 25, 2024

The 2024 State of the Data Center Report from CoreSite shows that although C-suite confidence in the economy remains high, a VUCA (volatile, uncertain, complex, ambiguous) environment has many business leaders proceeding with caution when it comes to their IT and data ecosystems, with an emphasis on cost control and predictability, flexibility and risk management ...

July 24, 2024

In June, New Relic published the State of Observability for Energy and Utilities Report to share insights, analysis, and data on the impact of full-stack observability software in energy and utilities organizations' service capabilities. Here are eight key takeaways from the report ...

July 23, 2024

The rapid rise of generative AI (GenAI) has caught everyone's attention, leaving many to wonder if the technology's impact will live up to the immense hype. A recent survey by Alteryx provides valuable insights into the current state of GenAI adoption, revealing a shift from inflated expectations to tangible value realization across enterprises ... Here are five key takeaways that underscore GenAI's progression from hype to real-world impact ...

July 22, 2024
A defective software update caused what some experts are calling the largest IT outage in history on Friday, July 19. The impact reverberated through multiple industries around the world ...
July 18, 2024

As software development grows more intricate, the challenge for observability engineers tasked with ensuring optimal system performance becomes more daunting. Current methodologies are struggling to keep pace, with the annual Observability Pulse surveys indicating a rise in Mean Time to Remediation (MTTR). According to this survey, only a small fraction of organizations, around 10%, achieve full observability today. Generative AI, however, promises to significantly move the needle ...

July 17, 2024

While nearly all data leaders surveyed are building generative AI applications, most don't believe their data estate is actually prepared to support them, according to the State of Reliable AI report from Monte Carlo Data ...

July 16, 2024

Enterprises are putting a lot of effort into improving the digital employee experience (DEX), which has become essential to both improving organizational performance and attracting and retaining talented workers. But to date, most efforts to deliver outstanding DEX have focused on people working with laptops, PCs, or thin clients. Employees on the frontlines, using mobile devices to handle logistics ... have been largely overlooked ...

July 15, 2024

The average customer-facing incident takes nearly three hours to resolve (175 minutes) while the estimated cost of downtime is $4,537 per minute, meaning each incident can cost nearly $794,000, according to new research from PagerDuty ...

July 12, 2024

In MEAN TIME TO INSIGHT Episode 8, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses AutoCon with the conference founders Scott Robohn and Chris Grundemann ...

July 11, 2024

Numerous vendors and service providers have recently embraced the NaaS concept, yet there is still no industry consensus on its definition or the types of networks it involves. Furthermore, providers have varied in how they define the NaaS service delivery model. I conducted research for a new report, Network as a Service: Understanding the Cloud Consumption Model in Networking, to refine the concept of NaaS and reduce buyer confusion over what it is and how it can offer value ...