Pepperdata Introduces Observability and Optimization for GPUs Running Big Data Applications
August 17, 2021
Share this

Pepperdata announced that the Pepperdata product portfolio now includes the ability to monitor Graphics Processing Units (GPUs) running big data applications like Spark on Kubernetes.

Workloads that harness tremendous amounts of data, such as machine learning (ML) and artificial intelligence (AI) applications, require GPUs, which were originally designed to accelerate graphics rendering. That extra processing power comes with a high price tag, and it requires near-constant monitoring for resource waste to get the best performance at the lowest possible cost.

Pepperdata now monitors GPU performance, providing the visibility needed for Spark applications running on Kubernetes and utilizing the processing power of GPUs. With this new visibility, companies can improve the performance of their Spark apps running on those GPUs and manage costs at a granular level.

Unlike traditional infrastructure monitoring, which is limited to the platform, the Pepperdata solution provides visibility into GPU resource utilization at the application level. Pepperdata also provides instant recommendations for optimization. Features include:

- Visibility into GPU memory usage and waste

- Fine-tuning of GPU usage through end-user recommendations

- Ability to attribute usage and cost to specific end-users

“Spark on Kubernetes is quickly becoming a dominant part of the compute infrastructure as data-intensive ML and AI applications proliferate,” said Ash Munshi, CEO, Pepperdata. “GPUs can handle these workloads, but they are expensive to buy and are power-intensive. Until now, there hasn’t been a way to view and manage the infrastructure and applications, which can lead to unnecessary waste and overspending for big data workloads. With Pepperdata, organizations can properly size their GPU hardware investments and have the confidence that they are utilizing them well.”

There are products on the market for monitoring GPUs, but they typically lack long-term storage, the ability to scale, and often do not correlate infrastructure metrics to applications. Pepperdata solves these problems with insight for data center operators, data scientists, and ML/AI developers. They can now understand who is using what resources, optimize to eliminate waste so jobs can be tuned and prioritized, and make sure costs are assigned appropriately to the right users or groups across the enterprise.

Share this

The Latest

April 15, 2024

Organizations recognize the value of observability, but only 10% of them are actually practicing full observability of their applications and infrastructure. This is among the key findings from the recently completed Logz.io 2024 Observability Pulse Survey and Report ...

April 11, 2024

Businesses must adopt a comprehensive Internet Performance Monitoring (IPM) strategy, says Enterprise Management Associates (EMA), a leading IT analyst research firm. This strategy is crucial to bridge the significant observability gap within today's complex IT infrastructures. The recommendation is particularly timely, given that 99% of enterprises are expanding their use of the Internet as a primary connectivity conduit while facing challenges due to the inefficiency of multiple, disjointed monitoring tools, according to Modern Enterprises Must Boost Observability with Internet Performance Monitoring, a new report from EMA and Catchpoint ...

April 10, 2024

Choosing the right approach is critical with cloud monitoring in hybrid environments. Otherwise, you may drive up costs with features you don’t need and risk diminishing the visibility of your on-premises IT ...

April 09, 2024

Consumers ranked the marketing strategies and missteps that most significantly impact brand trust, which 73% say is their biggest motivator to share first-party data, according to The Rules of the Marketing Game, a 2023 report from Pantheon ...

April 08, 2024

Digital experience monitoring is the practice of monitoring and analyzing the complete digital user journey of your applications, websites, APIs, and other digital services. It involves tracking the performance of your web application from the perspective of the end user, providing detailed insights on user experience, app performance, and customer satisfaction ...

April 04, 2024
Modern organizations race to launch their high-quality cloud applications as soon as possible. On the other hand, time to market also plays an essential role in determining the application's success. However, without effective testing, it's hard to be confident in the final product ...
April 03, 2024

Enterprises are experiencing a 13% year-over-year increase in customer-facing incidents, reflecting rising levels of complexity and risk as businesses drive operational transformation at scale, according to the 2024 State of Digital Operations study from PagerDuty ...

April 02, 2024

According to Grafana Labs' 2024 Observability Survey, it doesn't matter what industry a company is in or the number of employees they have, the truth is: the more mature their observability practices are, the more time and money they save. From AI to OpenTelemetry — here are four key takeaways from this year's report ...

April 01, 2024

In an age where technology evolves at a breakneck pace, it's crucial to explore how AI assistants can revolutionize our work processes and daily lives, ultimately enhancing overall performance ...

March 28, 2024

Nearly all (99%) globa IT decision makers, regardless of region or industry, recognize generative AI's (GenAI) transformative potential to influence change within their organizations, according to The Elastic Generative AI Report ...