kapynAI / Models

Introducing Gemma 4 12B: a unified, encoder-free multimodal model

Gemma 4 12B is a new unified, encoder-free multimodal model. It offers strong performance across vision and language tasks without requiring a separate vision encoder. This advancement streamlines multimodal AI development and deployment.

Google DeepMind·Jun 9, 2026

Opening Kapyn…