Connect with us

Hi, what are you looking for?

Biz Solutions

Mellanox works with VMware and NVIDIA to enable virtualized machine learning solutions

Benchmarks demonstrate that the NVIDIA vComputeServer (vCS) for virtualized GPUs achieve two times better efficiency by using VMware’s paravirtualized RDMA (PVRDMA) technology than when using traditional networking protocols.

Mellanox Technologies announced that its RDMA (Remote Direct Memory Access) networking solutions for VMware vSphere enable virtualized Machine Learning solutions that achieve higher GPU utilization and efficiency.

Benchmarks demonstrate that the NVIDIA vComputeServer (vCS) for virtualized GPUs achieve two times better efficiency by using VMware’s paravirtualized RDMA (PVRDMA) technology than when using traditional networking protocols. The benchmark was performed on a four-node cluster running vSphere 6.7 equipped with NVIDIA T4 GPUs with vCS software and Mellanox ConnectX-5 100 GbE SmartNICs, all connected by a Mellanox Spectrum SN2700 100 GbE switch.

The PVRDMA Ethernet solution enables VM-to-VM communication over RDMA, which boosts data communication performance in virtualized environments while achieving significantly higher efficiency compared with legacy TCP/IP transports. Additionally, PVRDMA retains core virtual machine capabilities such as vMotion. This translates to real-world customer advantages including optimized server and GPU utilization, reduce machine learning training time and improved scalability. Using PVRDMA also reduces backup times, improves data center simplicity, simplifies consolidation, lowers power consumption and reduces total cost of ownership.

“As Moore’s Law has slowed, traditional CPU and networking technologies are no longer sufficient to support the emerging machine learning workloads,” said Kevin Deierling, senior vice president marketing, Mellanox Technologies. “Using hardware compute accelerators such as NVIDIA T4 GPUs and Mellanox’s RDMA networking solutions has proven to boost application performance in virtualized deployments.”

Advertisement. Scroll to continue reading.

NVIDIA T4 GPUs supercharge the world’s most trusted mainstream servers, easily fitting into standard data center infrastructures. Their low-profile, 70-watt design is powered by NVIDIA Turing™ Tensor Cores, delivering revolutionary multi-precision performance to accelerate a wide range of modern applications, including machine learning, deep learning, and virtual desktops. With the latest vComputeServer software for GPU virtualization, it also provides maximum performance and manageability for AI, ML and data science workloads in a virtualized server environment.

“Machine learning has become extremely important and every company, regardless of size, must leverage its power to remain competitive,” said Bob Pette, vice president, Professional Visualization NVIDIA. “Our collaboration with VMware and Mellanox creates a high-performance GPU platform that enables acceleration for compute-intensive workloads in the most efficient way.”

Machine learning workloads are extremely resource intensive, often relying on hardware acceleration to achieve the performance necessary to solve large, complex problems in a timely manner. Interconnect acceleration – special hardware that delivers extremely high bandwidth and low latency, and compute acceleration – often delivered through exploitation of very highly-parallel GPU compute engines, are the most common forms of such acceleration. While both types of acceleration have long been available on vSphere, it is now possible with vSphere to combine these technologies to support advanced machine learning applications that allow applications to combine the compute power of NVIDIA GPUs with the high-performance data transfer capabilities of Mellanox RDMA capable adapters, enabling linear scalability.

“Modern data center infrastructures need to keep pace with the compute and efficiency requirements for the exceedingly complex machine learning computational models,” said Sudhanshu (Suds) Jain, Product Management, Cloud Platform Business Units, VMware. “The ability to virtualize GPUs using the latest NVIDIA vComputeServer product and Mellanox’s high-speed networking solutions over vSphere makes it possible to meet those requirements while keeping the cost intact.”

VMware vSphere is fully qualified with Mellanox ConnectX 10/25/40/50/100G adapters today. All Mellanox adapters support PVRDMA over RoCE (RDMA over Converged Ethernet), enabling advanced capabilities like GPU virtualization, and making data center infrastructure RoCE-Ready as new technologies over RDMA become generally available. PVRDMA will also be supported by the latest ConnectX-6 Dx and BlueField-2 SmartNICs.

Advertisement. Scroll to continue reading.

Advertisement
Advertisement
Advertisement

Like Us On Facebook

You May Also Like

HEADLINES

Available to customers today, the new AI agents help supply chain professionals transform the way they work and achieve new levels of productivity by...

HEADLINES

This in-vehicle visual communication kit is designed to bridge the communication gap between passengers and deaf driver-partners, ensuring a safe, smooth, and comfortable ride. 

HEADLINES

This collaboration leverages GCash’s extensive reach and digital platform to make CCP’s offerings more accessible to a wider audience.

HEADLINES

Already available for e-tailers operating in some Asia Pacific markets such as China, Hong Kong SAR, and Japan, the service now empowers Philippine-based e-tailers...

HEADLINES

The security evaluation was performed by Brightsight in Delft, the Netherlands, and the certificate was issued by TrustCB.

HEADLINES

Now, customers using Oracle ECP to power Oracle’s extensive portfolio of industry applications can connect from almost anywhere, including some of the world’s most...

White Papers

7 out of 10 surveyed companies with a high degree of security platformization report their cybersecurity investments have helped business outcomes such as operational...

HEADLINES

The telco leader empowers its vendors to adopt sustainability practices via capacity-building initiatives, enabling them to enhance their competitive edge while meeting evolving consumer...

Advertisement