Sequence models have skyrocketed in popularity for their ability to analyze data & predict what to do next.
— MIT CSAIL (@MIT_CSAIL) 17 octobre 2024
MIT’s "Diffusion Forcing" method combines the strengths of next-token prediction (like w/ChatGPT) & video diffusion (like w/Sora), training neural networks to handle… pic.twitter.com/u9CPIGq0tj
Sequence models have skyrocketed in popularity for their ability to analyze data & predict what to do next. MIT’s "Diffusion Forcing" method combines the strengths of next-token prediction (like w/ChatGPT) & video diffusion (like w/Sora), training neural networks to handle