AI Infrastructure Milestone Achieved
In a pivotal moment for artificial intelligence, Microsoft has achieved a significant technological feat by becoming the world's inaugural cloud provider
to successfully install and validate NVIDIA's Vera Rubin NVL72 system. This momentous occasion underscores Microsoft's commitment to advancing the very foundation of AI, a critical element in the ongoing global competition to build superior AI capabilities. The Vera Rubin NVL72, introduced by NVIDIA CEO Jensen Huang, promises a dramatic enhancement in AI inference performance, boasting up to five times the speed and a tenfold reduction in cost per token compared to NVIDIA's current top-tier Blackwell chip. Microsoft's CEO, Satya Nadella, shared this exciting development, emphasizing it as another crucial step in building the next generation of AI infrastructure in collaboration with NVIDIA. This strategic move positions Microsoft at the forefront of AI innovation, enabling them to offer unparalleled processing power and efficiency to their clients.
Deepened NVIDIA Partnership
The integration of the Vera Rubin NVL72 system builds upon a long-standing and robust collaboration between Microsoft and NVIDIA, aimed at accelerating AI-driven industrial advancements. Their partnership, previously strengthened in October of the previous year, has consistently delivered sophisticated supercomputing power to the cloud, facilitating the creation of advanced AI models and making AI more accessible to a broad spectrum of organizations. This latest development further solidifies their joint efforts. NVIDIA's enhanced support for the Nvidia RTX PRO 6000 Blackwell Server Edition on Azure Local is a testament to this, enabling customers to manage distributed AI and visual computing workloads with cloud-like ease. Furthermore, the introduction of new NVIDIA Nemotron and NVIDIA Cosmos models within Azure AI Foundry provides businesses with a comprehensive platform for developing, deploying, and scaling AI applications and agents. The integration of NVIDIA Run:ai on Azure is also set to optimize GPU utilization for enterprises, streamlining operations and speeding up AI initiatives. Ultimately, Microsoft's pioneering deployment of the NVIDIA GB300 NVL72 marks a redefined landscape for AI infrastructure.
Understanding Vera Rubin NVL72
The Vera Rubin NVL72 represents NVIDIA's most ambitious design for AI data center architecture to date. Described by NVIDIA CEO Jensen Huang, it is the culmination of what the company terms 'extreme co-design,' a process that seamlessly integrates six distinct chip types into a single, unified system. These integral components include the Vera CPU, the Rubin GPU, the NVLink 6 switch, the ConnectX-9 SuperNIC, the BlueField-4 data processing unit, and the Spectrum-6 Ethernet switch. Collectively, these elements form the fundamental building blocks of the Vera Rubin NVL72 rack. This singular unit of AI computing infrastructure surpasses any previous creation from NVIDIA in terms of sheer power and integrated functionality, setting a new benchmark for what is achievable in AI hardware.













