Pepperdata Introduces Observability and Optimization for GPUs Running Big Data Applications
August 17, 2021
Share this

Pepperdata announced that the Pepperdata product portfolio now includes the ability to monitor Graphics Processing Units (GPUs) running big data applications like Spark on Kubernetes.

Workloads that harness tremendous amounts of data, such as machine learning (ML) and artificial intelligence (AI) applications, require GPUs, which were originally designed to accelerate graphics rendering. That extra processing power comes with a high price tag, and it requires near-constant monitoring for resource waste to get the best performance at the lowest possible cost.

Pepperdata now monitors GPU performance, providing the visibility needed for Spark applications running on Kubernetes and utilizing the processing power of GPUs. With this new visibility, companies can improve the performance of their Spark apps running on those GPUs and manage costs at a granular level.

Unlike traditional infrastructure monitoring, which is limited to the platform, the Pepperdata solution provides visibility into GPU resource utilization at the application level. Pepperdata also provides instant recommendations for optimization. Features include:

- Visibility into GPU memory usage and waste

- Fine-tuning of GPU usage through end-user recommendations

- Ability to attribute usage and cost to specific end-users

“Spark on Kubernetes is quickly becoming a dominant part of the compute infrastructure as data-intensive ML and AI applications proliferate,” said Ash Munshi, CEO, Pepperdata. “GPUs can handle these workloads, but they are expensive to buy and are power-intensive. Until now, there hasn’t been a way to view and manage the infrastructure and applications, which can lead to unnecessary waste and overspending for big data workloads. With Pepperdata, organizations can properly size their GPU hardware investments and have the confidence that they are utilizing them well.”

There are products on the market for monitoring GPUs, but they typically lack long-term storage, the ability to scale, and often do not correlate infrastructure metrics to applications. Pepperdata solves these problems with insight for data center operators, data scientists, and ML/AI developers. They can now understand who is using what resources, optimize to eliminate waste so jobs can be tuned and prioritized, and make sure costs are assigned appropriately to the right users or groups across the enterprise.

Share this

The Latest

April 24, 2024

Over the last 20 years Digital Employee Experience has become a necessity for companies committed to digital transformation and improving IT experiences. In fact, by 2025, more than 50% of IT organizations will use digital employee experience to prioritize and measure digital initiative success ...

April 23, 2024

While most companies are now deploying cloud-based technologies, the 2024 Secure Cloud Networking Field Report from Aviatrix found that there is a silent struggle to maximize value from those investments. Many of the challenges organizations have faced over the past several years have evolved, but continue today ...

April 22, 2024

In our latest research, Cisco's The App Attention Index 2023: Beware the Application Generation, 62% of consumers report their expectations for digital experiences are far higher than they were two years ago, and 64% state they are less forgiving of poor digital services than they were just 12 months ago ...

April 19, 2024

In MEAN TIME TO INSIGHT Episode 5, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses the network source of truth ...

April 18, 2024

A vast majority (89%) of organizations have rapidly expanded their technology in the past few years and three quarters (76%) say it's brought with it increased "chaos" that they have to manage, according to Situation Report 2024: Managing Technology Chaos from Software AG ...

April 17, 2024

In 2024 the number one challenge facing IT teams is a lack of skilled workers, and many are turning to automation as an answer, according to IT Trends: 2024 Industry Report ...

April 16, 2024

Organizations are continuing to embrace multicloud environments and cloud-native architectures to enable rapid transformation and deliver secure innovation. However, despite the speed, scale, and agility enabled by these modern cloud ecosystems, organizations are struggling to manage the explosion of data they create, according to The state of observability 2024: Overcoming complexity through AI-driven analytics and automation strategies, a report from Dynatrace ...

April 15, 2024

Organizations recognize the value of observability, but only 10% of them are actually practicing full observability of their applications and infrastructure. This is among the key findings from the recently completed Logz.io 2024 Observability Pulse Survey and Report ...

April 11, 2024

Businesses must adopt a comprehensive Internet Performance Monitoring (IPM) strategy, says Enterprise Management Associates (EMA), a leading IT analyst research firm. This strategy is crucial to bridge the significant observability gap within today's complex IT infrastructures. The recommendation is particularly timely, given that 99% of enterprises are expanding their use of the Internet as a primary connectivity conduit while facing challenges due to the inefficiency of multiple, disjointed monitoring tools, according to Modern Enterprises Must Boost Observability with Internet Performance Monitoring, a new report from EMA and Catchpoint ...

April 10, 2024

Choosing the right approach is critical with cloud monitoring in hybrid environments. Otherwise, you may drive up costs with features you don’t need and risk diminishing the visibility of your on-premises IT ...