NVIDIA Brings Generative AI on Windows PCs and Workstations

NVIDIA, a leading technology company, is revolutionizing the world of artificial intelligence (AI) by introducing powerful capabilities to Windows PCs and workstations. Their innovative Tensor Cores, integrated into their RTX GPUs, are driving the development and deployment of generative AI models. With upcoming advancements like Max-Q low-power AI inferencing, NVIDIA is improving efficiency and performance in AI applications.

Generative AI: A New Era of Computing

Generative AI is transforming various fields, including productivity, content creation, and gaming. This technology utilizes neural networks to recognize patterns and structures in existing data, generating new and original content. NVIDIA’s NeMo, DLSS 3 Frame Generation, Meta LLaMa, ChatGPT, Adobe Firefly, and Stable Diffusion are just a few examples of powerful generative AI models and applications.

The Power of Tensor Cores

Tensor Cores are a special part of some computer chips that help speed up tasks related to artificial intelligence and machine learning. They are designed to perform mathematical operations called matrix multiplications much faster than regular computer chips. These matrix multiplications are used in many AI and machine learning tasks, like recognizing objects in images, understanding spoken language, and making predictions based on data. By making these matrix operations faster, Tensor Cores help AI programs run more quickly and efficiently.

NVIDIA’s GeForce RTX and NVIDIA RTX GPUs, featuring Tensor Cores, provide up to 1,400 Tensor TFLOPS for AI inferencing. When optimized, generative AI models can run up to 5 times faster than on competing devices. Tensor Cores are dedicated hardware specifically designed to accelerate AI calculations, resulting in improved performance. Regular software updates further enhance the capabilities of Tensor Cores, as demonstrated by the recent DirectML optimizations presented at the Microsoft Build conference.

Efficient AI Workloads with Max-Q

As AI inferencing increasingly takes place on local devices, powerful and efficient hardware becomes essential. To address this need, RTX GPUs will introduce Max-Q low-power inferencing for AI workloads. This feature allows GPUs to operate at lower power levels for lighter tasks while scaling up performance for heavy generative AI workloads.

Complete AI Development Stack

NVIDIA offers developers a comprehensive RTX-accelerated AI development stack on Windows 11, simplifying the creation, training, and deployment of advanced AI models. The development process begins with optimizing and fine-tuning models using deep learning frameworks available via Windows Subsystem for Linux. Developers can seamlessly transition to the cloud for training using the same NVIDIA AI stack, available from major cloud service providers. Finally, they can optimize their models for fast inferencing using tools like Microsoft Olive and deploy AI-enabled applications to a vast user base of over 100 million AI-optimized RTX PCs and workstations.

The Future of AI Innovation

Microsoft’s Pavan Davuluri acknowledges that AI will drive significant innovation for Windows users in the years to come. Through close collaboration with NVIDIA, hardware and software optimizations equip developers with a transformative, high-performance, and easy-to-deploy AI experience. Over 400 RTX AI-accelerated apps and games have already been released, with more to come.

NVIDIA Avatar Cloud Engine (ACE) for Games

During COMPUTEX 2023, NVIDIA’s CEO, Jensen Huang, introduced the NVIDIA Avatar Cloud Engine (ACE) for Games. This custom AI model foundry service brings intelligence to non-playable characters in games, enabling natural language interactions. Game developers can utilize ACE for Games to build and deploy customized speech, conversation, and animation AI models, revolutionizing the gaming experience.

Generative AI on RTX, Anywhere

Generative AI powered by RTX GPUs is accessible across various platforms, from servers to the cloud and devices. NVIDIA’s accelerated AI computing approach spans the entire technology stack, leveraging optimized hardware and software, including fourth-generation Tensor Cores. NVIDIA’s commitment to AI optimization ensures peak performance, with recent driver updates, Olive-optimized models, and enhancements to DirectML delivering significant speedups for developers on Windows 11.

The Future of AI on the Go

With the latest generation of RTX laptops and mobile workstations based on NVIDIA’s Ada Lovelace architecture, generative AI is more portable than ever. These next-gen mobile platforms offer exceptional performance in compact sizes, enabling users to harness the power of AI wherever they go. Leading manufacturers such as Dell, HP, Lenovo, and ASUS are pushing the boundaries of the generative AI era with their RTX GPU-powered devices, supported by Tensor Cores.

Prominent industry leaders, including Lenovo, HP, Dell, and ASUS, emphasize the transformative nature of generative AI and its impact on various sectors. They recognize the role of NVIDIA’s RTX GPU-powered PCs in enhancing user experiences, powering AI workloads, and fostering innovation.

NVIDIA’s Tensor Cores integrated into RTX GPUs have revolutionized the AI landscape, enabling faster and more efficient generative AI models. By optimizing hardware and software, NVIDIA empowers developers with a complete AI development stack, while advancements like Max-Q further enhance AI workloads on local devices. With the widespread adoption of generative AI and the continuous advancements made by NVIDIA, the future of AI innovation is brighter than ever.