HEADLINES

Google Cloud and NVIDIA expand partnership to scale AI development

Published

April 24, 2024

Google Cloud and NVIDIA announced a deepened partnership to enable the machine learning (ML) community with technology that accelerates their efforts to easily build, scale and manage generative AI applications.

To continue bringing AI breakthroughs to its products and developers, Google announced its adoption of the new NVIDIA Grace Blackwell AI computing platform, as well as the NVIDIA DGX Cloud service on Google Cloud. Additionally, the NVIDIA H100-powered DGX Cloud platform is now generally available on Google Cloud.

Building on their recent collaboration to optimize the Gemma family of open models, Google also will adopt NVIDIA NIM inference microservices to provide developers with an open, flexible platform to train and deploy using their preferred tools and frameworks. The companies also announced support for JAX on NVIDIA GPUs and Vertex AI instances powered by NVIDIA H100 and L4 Tensor Core GPUs.

“The strength of our long-lasting partnership with NVIDIA begins at the hardware level and extends across our portfolio – from state-of-the-art GPU accelerators, to the software ecosystem, to our managed Vertex AI platform,” said Google Cloud CEO Thomas Kurian. “Together with NVIDIA, our team is committed to providing a highly accessible, open and comprehensive AI platform for ML developers.”

Advertisement. Scroll to continue reading.

“Enterprises are looking for solutions that empower them to take full advantage of generative AI in weeks and months instead of years,” said Jensen Huang, founder and CEO of NVIDIA. “With expanded infrastructure offerings and new integrations with NVIDIA’s full-stack AI, Google Cloud continues to provide customers with an open, flexible platform to easily scale generative AI applications.”

The new integrations between NVIDIA and Google Cloud build on the companies’ longstanding commitment to providing the AI community with leading capabilities at every layer of the AI stack. Key components of the partnership expansion include:

Adoption of NVIDIA Grace Blackwell: The new Grace Blackwell platform enables organizations to build and run real-time inference on trillion-parameter large language models. Google is adopting the platform for various internal deployments and will be one of the first cloud providers to offer Blackwell-powered instances.
Grace Blackwell-powered DGX Cloud coming to Google Cloud: Google will bring NVIDIA GB200 NVL72 systems, which combine 72 Blackwell GPUs and 36 Grace CPUs interconnected by fifth-generation NVLink®, to its highly scalable and performant cloud infrastructure. Designed for energy-efficient training and inference in an era of trillion-parameter LLMs, NVIDIA GB200 NVL72 systems will be available via DGX Cloud, an AI platform offering a serverless experience for enterprise developers building and serving LLMs. DGX Cloud is now generally available on Google Cloud A3 VM instances powered by NVIDIA H100 Tensor Core GPUs.
Support for JAX on GPUs: Google Cloud and NVIDIA collaborated to bring the advantages of JAX to NVIDIA GPUs, widening access to large-scale LLM training among the broader ML community. JAX is a framework for high-performance machine learning that is compiler-oriented and Python-native, making it one of the easiest to use and most performant frameworks for LLM training. AI practitioners can now use JAX with NVIDIA H100 GPUs on Google Cloud through MaxText and Accelerated Processing Kit (XPK).
NVIDIA NIM on Google Kubernetes Engine (GKE): NVIDIA NIM inference microservices, a part of the NVIDIA AI Enterprise software platform, will be integrated into GKE. Built on inference engines including TensorRT-LLM, NIM helps speed up generative AI deployment in enterprises, supports a wide range of leading AI models and ensures seamless, scalable AI inferencing.
Support for NVIDIA NeMo: Google Cloud has made it easier to deploy the NVIDIA NeMo framework across its platform via Google Kubernetes Engine (GKE) and Google Cloud HPC Toolkit. This enables developers to automate and scale the training and serving of generative AI models, and it allows them to rapidly deploy turnkey environments through customizable blueprints that jump-start the development process. NVIDIA NeMo, part of NVIDIA AI Enterprise, is also available in the Google Marketplace, providing customers with another way to easily access NeMo and other frameworks to accelerate AI development.
Vertex AI and Dataflow expand support for NVIDIA GPUs: To advance data science and analytics, Vertex AI now supports Google Cloud A3 VMs powered by NVIDIA H100 GPUs and G2 VMs powered by NVIDIA L4 Tensor Core GPUs. This provides MLOps teams with scalable infrastructure and tooling to confidently manage and deploy AI applications. Dataflow has also expanded support for accelerated data processing on NVIDIA GPUs.

Google Cloud has long offered GPU VM instances powered by NVIDIA’s cutting-edge hardware coupled with leading Google innovations. NVIDIA GPUs are a core component of the Google Cloud AI Hypercomputer – a supercomputing architecture that unifies performance-optimized hardware, open software and flexible consumption models. The holistic partnership enables AI researchers, scientists and developers to train, fine-tune and serve the largest and most sophisticated AI models – now with even more of their favorite tools and frameworks jointly optimized and available on Google Cloud.

In this article:Google Cloud, NVIDIA, technology adaption, technology investment

HEADLINES

Vertiv CoolLoop Trim Cooler introduced

Integrating with high-density, liquid-cooled environments, the Vertiv CoolLoop Trim Cooler delivers operational efficiency and aligns with the industry's evolving needs for energy-efficient and compact...

HEADLINES

Faith meets tech as PLDT, Smart connect Fiesta Señor volunteers

PLDT and Smart’s full support for the recent Fiesta Señor and Sinulog Festival 2025 aligns with the PLDT Group’s commitment to fostering Cebuano faith...

Upgrade Staff4 days ago

Search UpgradeMag.com

HEADLINES

Kaspersky joins GITEX Asia 2025 as the Cyber Immunity partner

HEADLINES

56% of network breaches initiated through external remote services exploits

HEADLINES

TCL achieves triple Top 1 rankings in global TV shipments across key categories

HEADLINES

Vertiv CoolLoop Trim Cooler introduced

HEADLINES

Apple partner marks history with first-ever store & service center in BARMM, now present in all regions

Phones

Galaxy A06 5G launched by Samsung PH

HEADLINES

Appdome recognized by G2

HEADLINES

TP ranks in Top 10 Best Workplaces for 2025

GAMING

‘Proud Dad Simulator’ launched

MOTORING

Toyota Motor Philippines inspires racers to push limits for better through TOYOTA GAZOO Racing Academy

HEADLINES

GCash, National Privacy Commission form partnership to enhance data privacy awareness, provide data privacy-related resources

HEADLINES

Majority of Filipinos believe AI enhances creativity and efficiency for communication, Samsung PH study shows

HEADLINES

Lenovo opens 4 new Exclusive Stores

HEADLINES

Alibaba Cloud launches Qwen2.5-Omni-7B unified end-to-end multimodal model in Qwen series

ELECTRONICS

Beko launches new line of air conditioners

Phones

Affordable gaming-centric units via nubia Neo 3 series

Like Us On Facebook

You May Also Like

HEADLINES

Vertiv CoolLoop Trim Cooler introduced

HEADLINES

Apple partner marks history with first-ever store & service center in BARMM, now present in all regions

HEADLINES

Oracle empowers utilities with new advanced distribution management system enhancements

HEADLINES

Globe Business collaborates with AORA to drive the rapid adoption of Open RAN technology in PH

HEADLINES

Tessie’s Grills and Roasters embraces modern technology, partners with Converge

HEADLINES

AI, data analytics and financial inclusion take center stage at Surfin’s AI Fintech Forum in Manila 2025

HEADLINES

Schneider Electric champions energy efficiency as key to business resilience

HEADLINES

Faith meets tech as PLDT, Smart connect Fiesta Señor volunteers