Connect with us

Hi, what are you looking for?

Biz Solutions

Mellanox works with VMware and NVIDIA to enable virtualized machine learning solutions

Benchmarks demonstrate that the NVIDIA vComputeServer (vCS) for virtualized GPUs achieve two times better efficiency by using VMware’s paravirtualized RDMA (PVRDMA) technology than when using traditional networking protocols.

Mellanox Technologies announced that its RDMA (Remote Direct Memory Access) networking solutions for VMware vSphere enable virtualized Machine Learning solutions that achieve higher GPU utilization and efficiency.

Benchmarks demonstrate that the NVIDIA vComputeServer (vCS) for virtualized GPUs achieve two times better efficiency by using VMware’s paravirtualized RDMA (PVRDMA) technology than when using traditional networking protocols. The benchmark was performed on a four-node cluster running vSphere 6.7 equipped with NVIDIA T4 GPUs with vCS software and Mellanox ConnectX-5 100 GbE SmartNICs, all connected by a Mellanox Spectrum SN2700 100 GbE switch.

The PVRDMA Ethernet solution enables VM-to-VM communication over RDMA, which boosts data communication performance in virtualized environments while achieving significantly higher efficiency compared with legacy TCP/IP transports. Additionally, PVRDMA retains core virtual machine capabilities such as vMotion. This translates to real-world customer advantages including optimized server and GPU utilization, reduce machine learning training time and improved scalability. Using PVRDMA also reduces backup times, improves data center simplicity, simplifies consolidation, lowers power consumption and reduces total cost of ownership.

“As Moore’s Law has slowed, traditional CPU and networking technologies are no longer sufficient to support the emerging machine learning workloads,” said Kevin Deierling, senior vice president marketing, Mellanox Technologies. “Using hardware compute accelerators such as NVIDIA T4 GPUs and Mellanox’s RDMA networking solutions has proven to boost application performance in virtualized deployments.”

Advertisement. Scroll to continue reading.

NVIDIA T4 GPUs supercharge the world’s most trusted mainstream servers, easily fitting into standard data center infrastructures. Their low-profile, 70-watt design is powered by NVIDIA Turing™ Tensor Cores, delivering revolutionary multi-precision performance to accelerate a wide range of modern applications, including machine learning, deep learning, and virtual desktops. With the latest vComputeServer software for GPU virtualization, it also provides maximum performance and manageability for AI, ML and data science workloads in a virtualized server environment.

“Machine learning has become extremely important and every company, regardless of size, must leverage its power to remain competitive,” said Bob Pette, vice president, Professional Visualization NVIDIA. “Our collaboration with VMware and Mellanox creates a high-performance GPU platform that enables acceleration for compute-intensive workloads in the most efficient way.”

Machine learning workloads are extremely resource intensive, often relying on hardware acceleration to achieve the performance necessary to solve large, complex problems in a timely manner. Interconnect acceleration – special hardware that delivers extremely high bandwidth and low latency, and compute acceleration – often delivered through exploitation of very highly-parallel GPU compute engines, are the most common forms of such acceleration. While both types of acceleration have long been available on vSphere, it is now possible with vSphere to combine these technologies to support advanced machine learning applications that allow applications to combine the compute power of NVIDIA GPUs with the high-performance data transfer capabilities of Mellanox RDMA capable adapters, enabling linear scalability.

“Modern data center infrastructures need to keep pace with the compute and efficiency requirements for the exceedingly complex machine learning computational models,” said Sudhanshu (Suds) Jain, Product Management, Cloud Platform Business Units, VMware. “The ability to virtualize GPUs using the latest NVIDIA vComputeServer product and Mellanox’s high-speed networking solutions over vSphere makes it possible to meet those requirements while keeping the cost intact.”

VMware vSphere is fully qualified with Mellanox ConnectX 10/25/40/50/100G adapters today. All Mellanox adapters support PVRDMA over RoCE (RDMA over Converged Ethernet), enabling advanced capabilities like GPU virtualization, and making data center infrastructure RoCE-Ready as new technologies over RDMA become generally available. PVRDMA will also be supported by the latest ConnectX-6 Dx and BlueField-2 SmartNICs.

Advertisement. Scroll to continue reading.

Advertisement
Advertisement
Advertisement

Like Us On Facebook

You May Also Like

HEADLINES

NutriAsia has demonstrated how local businesses can effectively engage with online audiences as part of TikTok Shop's Buy Local, Shop Local campaign.

HEADLINES

Building on the legacy of the flagship Dimensity 9400, the MediaTek Dimensity 8400 brings an All Big Core design to the premium smartphone market...

HEADLINES

In collaboration with PLDT Enterprise, SGS has implemented e-waste bins in its facilities, ensuring the proper disposal of outdated and broken electronic devices. This...

HEADLINES

Running until January 15, 2025, every eligible transaction you make using PalawanPay gives you a chance to win investment-grade gold bars from the exclusive...

HEADLINES

In partnership with Caritas Manila, Converge has been supporting the Unang Yakap Program that aims to provide healthcare and nutritional assistance to underprivileged pregnant...

HEADLINES

This collaboration marks a significant milestone in enhancing mobile connectivity and capacity across the country.

HEADLINES

This recognition underscores HUAWEI’s influential success and dedication to revolutionize the wearables technology market.

HEADLINES

While over three-quarters of chief financial officers (CFOs) and tax leaders (SEA 82%, global 87%) say GenAI will drive increased efficiency and effectiveness, up...

Advertisement