VMware and NVIDIA are joining forces to prepare enterprises for the age of generative AI. With their expanded strategic partnership, they aim to assist the hundreds of thousands of enterprises running on VMware’s cloud infrastructure in leveraging generative AI. The collaboration will provide a complete solution, including generative AI software and accelerated computing from NVIDIA, built on VMware Cloud Foundation and optimized for AI.
The VMware Private AI Foundation with NVIDIA enables enterprises to customize models and deploy generative AI applications such as intelligent chatbots, assistants, search engines, and summarization tools. This platform will empower enterprises to run their generative AI workloads alongside their data, addressing corporate data privacy, security, and control concerns.
Enterprises are eagerly integrating generative AI into their businesses to reap the benefits. McKinsey estimates that generative AI could contribute up to $4.4 trillion annually to the global economy. To expedite the realization of these benefits, enterprises are looking for ways to streamline the development, testing, and deployment of generative AI applications.
The VMware Private AI Foundation with NVIDIA allows enterprises to do just that. It provides the capability to customize large language models, produce secure and private models for internal use, offer generative AI as a service to users, and securely run inference workloads at scale. The platform will include integrated AI tools that enable enterprises to run proven models trained on their private data in a cost-efficient manner. It will be built on VMware Cloud Foundation and NVIDIA AI Enterprise software, delivering various benefits, including privacy, choice, performance, data-center scale, lower costs, accelerated storage and networking, rapid deployment, and time to value.
The platform will feature NVIDIA NeMo, an end-to-end, cloud-native framework included in NVIDIA AI Enterprise, allowing enterprises to build, customize, and deploy generative AI models virtually anywhere. NeMo combines customization frameworks, guardrail toolkits, data curation tools, and pretrained models to offer enterprises an easy, cost-effective, and fast way to adopt generative AI.
To support the VMware Private AI Foundation with NVIDIA, Dell Technologies, Hewlett Packard Enterprise, and Lenovo will be among the first to offer systems equipped with NVIDIA L40S GPUs, NVIDIA BlueField-3 DPUs, and NVIDIA ConnectX-7 SmartNICs. These technologies will supercharge enterprise large language model customization and inference workloads.
This collaborative effort between VMware and NVIDIA builds upon their decade-long partnership. Their co-engineering work has optimized VMware’s cloud infrastructure to run NVIDIA AI Enterprise with performance comparable to bare metal. Mutual customers will continue to benefit from the resource and infrastructure management and flexibility provided by VMware Cloud Foundation.
VMware plans to release the VMware Private AI Foundation with NVIDIA in early 2024, heralding an exciting new era for enterprises ready to embrace generative AI.