kapynAI / Models

DiffusionGemma

DiffusionGemma is an open-weight model for faster text generation. This Apache 2.0 licensed model, based on Gemma, offers speeds of over 500 tokens per second and is available through NVIDIA's free cloud API.

Simon Willison·Jun 10, 2026

Opening Kapyn…