Improving Application Performance with NVMe Storage - Part 3
NVMe Storage Use Cases and Summary: Benefits of NVMe storage for AI/ML
May 01, 2019

Zivan Ori
E8 Storage

Share this

Start with Part 1: The Rise of AI and ML Driving Parallel Computing Requirements

Start with Part 2: Local versus Shared Storage for Artificial Intelligence (AI) and Machine Learning (ML)

NVMe Storage Use Cases

NVMe storage's strong performance, combined with the capacity and data availability benefits of shared NVMe storage over local SSD, makes it a strong solution for AI / ML infrastructures of any size. There are several AI / ML focused use cases to highlight.

■ Financial Analytics – Financial services and financial technology (FinTech) are increasingly turning to automation and artificial intelligence to fuel their decision making processes for investments. Using a mix of historical data and financial modeling, one platform can provide the horsepower required for predicting future investment strategies for their financial customers.

■ Image Recognition in Manufacturing – Manufacturing has long used automation in their production lines to increase the output capacity of their production systems, scaling from hundreds of units to thousands or even millions of units per hour. The financial impact of a quality issue on the production line can be devastating if not caught in a timely manner. Real-time image recognition of photos of manufactured parts is essential to determining whether a part meets the quality standards required, as well as capturing systematic quality issues in real-time.

■ Car Services – Ride sharing apps have given rise to a new paradigm in public transit, allowing users and drivers to connect quickly and easily as needed. Ride sharing companies use AI / ML for traffic modeling to position drivers where they are most needed based on both past and current ride sharing requests. This increases the drivers' potential revenue by reducing drive times as well as increases customer satisfaction through reduced wait times, both of which improve the revenue potential for the ride sharing company.

Beyond AI / ML, one vendor also provides more generalized computing services for their customers. They provide storage capacity for cloud services, using OpenStack and Kubernetes in conjunction with NVMe storage for high performance storage. In addition, they also leverage NVMe storage for big data analytics, using spark applications to perform multiple types of data analytics tasks, such as SQL, data mining and more.

Summary: Benefits of NVMe storage for AI/ML

NVMe storage is an ideal solution for countless AI / ML workloads, especially machine learning for multiple applications. With NVMe storage, you can:

■ Create and manage larger shared data-sets for training – By separating out storage capacity from the compute nodes, data-sets for machine learning training can scale up to 1PB. As the data-set grows and more NVMe storage is brought online, performance grows as well, rather than being limited by legacy storage controller bottlenecks.

■ Overcome the capacity limitations of local SSDs in GPU nodes – With limited space for SSD media, GPU nodes have limited capacity to manage larger datasets. With NVMe storage, NVMe volumes can be dynamically provisioned over high performance Ethernet or InfiniBand networks.

■ Accelerate epoch time of machine learning by as much as 10x – By leveraging high performance NVMe-oF, NVMe storage eliminates the latency bottlenecks of older storage protocols and unleashes the parallelism inherent to the NVMe protocol. Every GPU node has direct, parallel access to the media at the lowest possible latency.

■ Improve the utilization of GPUs – Having GPUs rest idle due to slow access to data for processing is costly. By offloading storage access to the idle CPUs, and delivering storage performance at the speed of local SSD, NVMe storage ensures that the GPU-nodes are kept busy with fast access to data.

Zivan Ori is CEO and Co-Founder of E8 Storage
Share this

The Latest

October 16, 2019

Modern enterprises are generating data at an unprecedented rate but aren't taking advantage of all the data available to them in order to drive real-time, actionable insights. According to a recent study commissioned by Actian, more than half of enterprises today are unable to efficiently manage nor effectively use data to drive decision-making ...

October 15, 2019

According to a study by Forrester Research, an enhanced UX design can increase the conversion rate by 400%. If UX has become the ultimate arbiter in determining the success or failure of a product or service, let us first understand what UX is all about ...

October 10, 2019

The requirements of an APM tool are now much more complex than they've ever been. Not only do they need to trace a user transaction across numerous microservices on the same system, but they also need to happen pretty fast ...

October 09, 2019

Performance monitoring is an old problem. As technology has advanced, we've had to evolve how we monitor applications. Initially, performance monitoring largely involved sending ICMP messages to start troubleshooting a down or slow application. Applications have gotten much more complex, so this is no longer enough. Now we need to know not just whether an application is broken, but why it broke. So APM has had to evolve over the years for us to get there. But how did this evolution take place, and what happens next? Let's find out ...

October 08, 2019

There are some IT organizations that are using DevOps methodology but are wary of getting bogged down in ITSM procedures. But without at least some ITSM controls in place, organizations lose their focus on systematic customer engagement, making it harder for them to scale ...

October 07, 2019
OK, I admit it. "Service modeling" is an awkward term, especially when you're trying to frame three rather controversial acronyms in the same overall place: CMDB, CMS and DDM. Nevertheless, that's exactly what we did in EMA's most recent research: <span style="font-style: italic;">Service Modeling in the Age of Cloud and Containers</span>. The goal was to establish a more holistic context for looking at the synergies and differences across all these areas ...
October 03, 2019

If you have deployed a Java application in production, you've probably encountered a situation where the application suddenly starts to take up a large amount of CPU. When this happens, application response becomes sluggish and users begin to complain about slow response. Often the solution to this problem is to restart the application and, lo and behold, the problem goes away — only to reappear a few days later. A key question then is: how to troubleshoot high CPU usage of a Java application? ...

October 02, 2019

Operations are no longer tethered tightly to a main office, as the headquarters-centric model has been retired in favor of a more decentralized enterprise structure. Rather than focus the business around a single location, enterprises are now comprised of a web of remote offices and individuals, where network connectivity has broken down the geographic barriers that in the past limited the availability of talent and resources. Key to the success of the decentralized enterprise model is a new generation of collaboration and communication tools ...

October 01, 2019

To better understand the AI maturity of businesses, Dotscience conducted a survey of 500 industry professionals. Research findings indicate that although enterprises are dedicating significant time and resources towards their AI deployments, many data science and ML teams don't have the adequate tools needed to properly collaborate on, build and deploy AI models efficiently ...

September 30, 2019

Digital transformation, migration to the enterprise cloud and increasing customer demands are creating a surge in IT complexity and the associated costs of managing it. Technical leaders around the world are concerned about the effect this has on IT performance and ultimately, their business according to a new report from Dynatrace, based on an independent global survey of 800 CIOs, Top Challenges for CIOs in a Software-Driven, Hybrid, Multi-Cloud World ...