Prediction: diffusion-based media models (images, video, music) are a dead end, and whoever cracks reasoning-based media models will win. We already saw a glimpse of this with OpenAI’s partially autoregressive native image generation – the level of control is far beyond what is
Reasoning Models Will Surpass Diffusion-Based Media Generation
By
–