Mirage, a video world model, learns persistent spatial memory directly in latent space for consistent long camera moves. This approach significantly reduces compute time and graphics memory compared to pixel-based methods. While impressive for scene consistency, it still struggles with reliably tracking moving objects across segments.
Opening Kapyn…