Residual Context Diffusion (RCD) enhances diffusion large language models by reusing computation from discarded tokens. This new module retains contextual information from less confident tokens, improving subsequent decoding iterations and reducing wasted computation for parallel token generation.
Opening Kapyn…