AI Dynamics

Global AI News Aggregator

Real-time world model generation from text and image prompts

In our research lab, we are building “real-time dreaming” – the ability to generate fully playable video worlds prompted from any text or image. Our real-time, action conditioned world model (currently running internally at 16fps at 832x480p) is trained on a combination of data, including proprietary Roblox 3D avatar/world interaction data. World models are different from multiplayer engines in that they store state and memory in video latents. Roblox is multiplayer, and we are actively researching optimal ways to simultaneously store state for thousands of players, and keep them in sync with their environment. Our world model leverages database technology which stores all user interactions on Roblox in a vector format that can be used to re-render video and interaction from any camera angle. We see several immediate uses for our Roblox world model. We will use it side-by-side text, image and video prompts as a way to launch auto-generation of immersive worlds. In Roblox Studio, a creator could walk around and use prompts to “paint” a world and then convert it into a 3D representation or direct to Roblox native as a way for many people to play simultaneously. All of this comes alive as we explore the notion of a “Dream Theater” – where one user is dreaming, while others watch and prompt them. 2/4 Community note: The beginning of the video has stolen assets/design from Clair Obscur: Expedition 33 by Sandfall Interactive. The environment is the Flying Waters area and the girl in the video is Maelle. store.steampowered.com/app/1903340/Cl… expedition33.wiki.fextralife.com/Flying+Waters clair-obscur.fandom.com/wiki/Maelle

→ View original post on X — @shiqi_yang_147, 2026-02-05 01:28 UTC

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *