kapynAI / Models

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

DiffusionGemma is an open model for fast text generation. NVIDIA has optimized it to run even faster on GeForce RTX GPUs, the RTX PRO platform, and DGX Spark systems, enabling low-latency single-user workloads. This breakthrough allows for parallel word generation, unlocking new possibilities for developers working with text generation models locally and in the cloud.

NVIDIA AI·Jun 10, 2026

Opening Kapyn…