Jensen Huang (NVIDIA Co-Founder) – Siggraph 2023 (Aug 2023)

The canonical use case of the future is a large language model on the front end of just about everything. Every single application, every single database, whenever you interact with a computer, you will likely be first engaging a large language model. That large language model will figure out what is your intention, what is your desire, what are you trying to do, given the context, and present the information to you in the best possible way.

– Huang @ 20:24

Chapters

00:26 Evolution of Computer Graphics and GPUs: A Glimpse into NVIDIA's Journey
07:04 The Era of Generative AI
19:26 Evolution and Impact of Accelerated Computing
24:30 Omniverse
33:48 Hugging Face Integration & AI Workbench
39:54 NVIDIA AI Enterprise and Ada Lovelace GPU
48:12 OpenUSD and Its Impact on 3D Industry
54:03 Connecting OpenUSD, Omniverse, and ChatUSD
01:05:58 Largest Opportunity for Software: Digitizing Heavy Industries
01:13:24 Industrial Digitization & Recap

Abstract

At the 2023 SIGGRAPH conference, NVIDIA founder and CEO Jensen Huang provided a comprehensive look into the company’s technological advancements over the past several years, offering insights into their vision to transform industries through graphics processing units (GPUs), artificial intelligence (AI), and simulation platforms like Omniverse.

The Evolution of GPUs – From Programmable Shading to Integrating AI

Huang reflected on NVIDIA’s 20-year journey beginning with the introduction of the world’s first programmable shading GPU. This laid the foundation for the eventual realization of real-time ray tracing via the launch of their RTX platform in 2018, which required a complete redesign of the GPU architecture to incorporate specialized ray tracing accelerators.

To achieve the vision of RTX, NVIDIA also had to bridge computer graphics and AI, as exemplified in their debut demo of the Turing GPU called “Star Wars Reflections.” This 720p 30fps demo utilized ray tracing and some rasterization, with effects like ambient occlusion, area lights, and specular reflections. It was enhanced to 4K using the DLSS AI super sampling technology.

Within five years, rapid advancements were made evident through the new “Racer RTX” demo which boasted 250 million polygons rendered entirely with path tracing at 1080p, and upscaled to 4K using DLSS. Compared to just 2.5 million polygons previously, this monumental improvement highlights the immense progress in real-time computer graphics and NVIDIA’s pioneering efforts.

Reinventing GPU Architecture for the AI Era

While revolutionizing computer graphics by harnessing AI, NVIDIA also transformed the fundamental GPU architecture to optimize performance for AI workloads. Evolving from the Turing GPU presented five years prior, the new Hopper GPU represents the culmination of this evolution with its 1 trillion transistors, 35,000 components, and manufacturing precision comparable to electric vehicles.

With over a decade dedicated to advancing AI, NVIDIA heralded the onset of the generative AI era which combines large language models and generative networks to produce intelligent content. The profound potential of this technology has garnered tremendous attention, spurring extensive research and funding globally.

Pushing the Boundaries with Next-Generation Hardware

The newly announced Grace Hopper Superchip GH200 exemplifies the tailored approach for the AI era. Designed for giant-scale cloud infrastructure, it replaces generic CPUs with specialized components like Arm cores, boosting both performance and efficiency. NVIDIA also highlighted the scalability of GH200 systems, with configurations of up to 1200 nodes promised to deliver unmatched capabilities.

Alongside the Hopper GPU and GH200, NVIDIA unveiled the latest Ada Lovelace GPUs and enterprise servers packed with L40S GPUs. While not meant for training colossal models, these servers can efficiently fine-tune mainstream AI models, offering up to 1.5x higher throughput over previous hardware generations. Paired with NVIDIA’s comprehensive software stack encompassing AI Enterprise and AI Workbench, users can tap into state-of-the-art tools to engage with generative AI.

Pioneering the Next Computing Platform – Large Language Models

Huang emphasized the rise of large language models as the new computing platform, with generative AI representing the ultimate application. The astounding capabilities of models like the 40 billion parameter GPT-3 which can understand user intentions, facilitate searches, and transform experiences by acting as intermediaries between users and data/applications, highlight this monumental shift.

With human language becoming the programming paradigm, these models promise to reshape computing by acting as intermediaries between users and data or applications. Their ability to understand natural language queries and intentions will transform user experiences across industries.

Revolutionizing 3D – The OpenUSD Vision

Huang likened the potential impact of the new OpenUSD standard as akin to the birth of HTML for the 2D web. By unifying diverse tools under a common interchange format for constructing 3D worlds, it tackles workflow inefficiencies stemming from fragmented tools and serialized processes. With growing traction across sectors, NVIDIA envisions OpenUSD establishing a parallel “spoke and hub” workflow with centralized USD data assets.

Omniverse, NVIDIA’s platform for connecting tools, serves as the vehicle for realizing the OpenUSD vision. Built from the ground up around USD, Omniverse facilitates efficient live collaboration, interchange, and physics-based simulation. NVIDIA has contributed extensively to OpenUSD’s evolution, ensuring assets are physics-accurate and hyperscale-ready.

Infusing AI into Applications – From Avatars to Robotics

NVIDIA incorporated AI capabilities like facial animation from speech and generating virtual environments by querying language models into Omniverse through new APIs. These innovations enable smart avatars, automated repetitive tasks, and quick scene prototyping.

The fusion of simulation and AI also allows robots to be trained virtually within Omniverse before real-world deployment. Companies like BMW, Mercedes, and Boeing leverage Omniverse for design collaboration and digital twins. Huang envisions future robot programming through natural language prompts and examples rather than explicit coding.

The Future of Industrial Digitalization

Omniverse, combined with AI, can propel the digital transformation of heavy industries seeking enhanced productivity, reduced waste, and minimized physical prototyping. Adoption is already underway at automotive companies, while NVIDIA is creating a digital twin of Earth’s climate. OpenUSD and simulation hold the key to unifying data and tools across complex industrial workflows.

The Legacy of Computer Graphics

In closing, Huang paid homage to the field of computer graphics, emphasizing its importance to NVIDIA. The company has dedicated three decades to advancing this sector, reflecting its deep commitment. He credited the SIGGRAPH community for enabling innovations like Omniverse and OpenUSD. Huang left the audience with a whimsical note about the cost-saving advantages of accelerated computing architectures like Grace Hopper, and the promise of generative AI.


Notes by: empiricist