Can AI finally create multi-shot videos with seamless narrative flow and cinematic flair? Researchers from Beijing University of Posts and Telecommunications and Peking University present STAGE. This new method rethinks video generation by planning full shot-by-shot storyboards using start-to-end frame pairs. It employs smart memory and clever encoding to keep characters and scenes consistent, ensuring smooth visuals within and between shots. STAGE significantly outperforms existing methods, achieving superior narrative control and unparalleled visual consistency across cinematic sequences. STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative Paper: arxiv.org/abs/2512.12372 Code: github.com/escapistmost/Stor… Our report: mp.weixin.qq.com/s/rmeF2tbIu… 📬 #PapersAccepted by Jiqizhixin
→ View original post on X — @jiqizhixin, 2026-04-04 18:55 UTC








