ByteDance's "iLLaDA" is a diffusion language model that keeps up with Qwen2.5
ByteDance unveils iLLaDA, an 8B diffusion language model competitive with Qwen2.5. This new model generates text via a diffusion process, achieving comparable base performance to Qwen2.5 but lagging in fine-tuned capabilities.