Cisco announced a breakthrough AI cluster solution with NVIDIA for the data center that transforms how customers build, manage and optimize infrastructure and software.
Delivering on the Cisco Networking Cloud vision to simplify networking, Cisco is bringing to market a new enterprise-ready, end-to-end infrastructure solution to scale generative AI workloads. The Cisco Nexus HyperFabric AI cluster solution combines Cisco AI-native networking with NVIDIA accelerated computing and AI software, and a robust VAST data store. It is designed to enable customers to focus on AI-driven innovation and new revenue opportunities rather than IT management.
According to Cisco’s recent Global Networking Trends Report in the next two years, 60% of IT leaders and professionals expect to deploy AI-enabled predictive network automation across all domains to better manage NetOps1. Additionally, 75% plan to deploy tools that offer end-to-end visibility via a single console into different network domains including campus and branch, WAN, data center, internet, public clouds and industrial networks.
“While the promise of AI is clear, the path forward for many just starting out is not. Customers often face economic and operational challenges to get an AI stack up and running.” said Jonathan Davidson, Executive Vice President and General Manager, Cisco Networking. “Cisco is committed to making the deployment and operation of AI infrastructure simpler. Together with NVIDIA, we are delivering a simple-to-deploy, cloud-operated AI-stack solution for on-premises deployments that builds on our Cisco Networking Cloud platform vision for automation and simplicity.”
“Generative AI requires purpose-built infrastructure and software that enables enterprises to securely turn their data into fuel for business transformation,” said Kevin Deierling, Senior Vice President of Networking at NVIDIA. “NVIDIA and Cisco are providing an enterprise-ready AI platform and control plane to simplify deployment of the accelerated computing, networking and software needed for generative AI workloads.”
At Cisco Live, the company is demonstrating how it is committed to helping its customers quickly deploy AI infrastructure. Cisco is also putting the right tools in the hands of its customers to build intuitive AI-native networks, anticipate failures and quickly diagnose and remediate problems.
How Cisco Nexus HyperFabric AI Cluster Works
The on-premises solution features a single place to design, deploy, monitor and assure the AI pods and data center workloads. It guides users from design, to validated deployment, to monitoring and assurance for enterprise-ready AI infrastructure. With its cloud management capabilities, customers can easily deploy and manage large scale fabrics across data centers, colocation facilities and edge sites.
The Cisco Nexus HyperFabric AI cluster solution offers automated, cloud-managed operations across a unified compute and networking fabric combining Cisco’s Ethernet switching expertise founded on Cisco Silicon One, integrated with NVIDIA’s accelerated computing and NVIDIA AI Enterprise software, and VAST’s data storage platform. This will include:
- Cisco cloud management capabilities to simplify IT operations across all phases of the workflow.
- Cisco Nexus 6000 series switches for spine and leaf that deliver 400G and 800G Ethernet fabric performance.
- Cisco Optics family of QSFP-DD modules to offer customer choice and deliver super high densities.
- NVIDIA AI Enterprise software to streamline the development and deployment of production-grade generative AI workloads
- NVIDIA NIM inference microservices that accelerate the deployment of foundation models while ensuring data security, and are available with NVIDIA AI Enterprise
- NVIDIA Tensor Core GPUs starting with the NVIDIA H200 NVL, designed from the ground up to supercharge generative AI workloads with game-changing performance and memory capabilities.
- NVIDIA BlueField-3 data processing unit DPU processor and BlueField-3 SuperNIC for accelerating AI compute networking, data access and security workloads.
- Enterprise reference design for AI built on NVIDIA MGX, a modular and flexible server architecture.
- The VAST Data Platform, which offers unified storage, database and a data-driven function engine built for AI.
Availability
Select customers may have early trial access to this AI solution in Q4 of CY 2024, with general availability expected shortly thereafter.