Real-time world model generation from text and image prompts

AI Dynamics

Global AI News Aggregator

Real-time world model generation from text and image prompts

–

05 February 2026 2h28

In our research lab, we are building “real-time dreaming” – the ability to generate fully playable video worlds prompted from any text or image.

Our real-time, action conditioned world model (currently running internally at 16fps at 832x480p) is trained on a combination of data,… pic.twitter.com/R5CKJkFuh2
— Roblox (@Roblox) 5 février 2026

In our research lab, we are building “real-time dreaming” – the ability to generate fully playable video worlds prompted from any text or image. Our real-time, action conditioned world model (currently running internally at 16fps at 832x480p) is trained on a combination of data, including proprietary Roblox 3D avatar/world interaction data. World models are different from multiplayer engines in that they store state and memory in video latents. Roblox is multiplayer, and we are actively researching optimal ways to simultaneously store state for thousands of players, and keep them in sync with their environment. Our world model leverages database technology which stores all user interactions on Roblox in a vector format that can be used to re-render video and interaction from any camera angle. We see several immediate uses for our Roblox world model. We will use it side-by-side text, image and video prompts as a way to launch auto-generation of immersive worlds. In Roblox Studio, a creator could walk around and use prompts to “paint” a world and then convert it into a 3D representation or direct to Roblox native as a way for many people to play simultaneously. All of this comes alive as we explore the notion of a “Dream Theater” – where one user is dreaming, while others watch and prompt them. 2/4 Community note: The beginning of the video has stolen assets/design from Clair Obscur: Expedition 33 by Sandfall Interactive. The environment is the Flying Waters area and the girl in the video is Maelle. store.steampowered.com/app/1903340/Cl… expedition33.wiki.fextralife.com/Flying+Waters clair-obscur.fandom.com/wiki/Maelle

→ View original post on X — @shiqi_yang_147, 2026-02-05 01:28 UTC

5 February 2026

AI CREATIVE AI GENERATIVE AI INNOVATION MACHINE LEARNING MULTIMODAL AI RESEARCH

Real-time world model generation from text and image prompts

MORE ARTICLES

Paper praised for executing Gato idea with humanoid; more work desired

Skild Brain AI enables robots to handle unfamiliar environments

Proposal to replace Google Search with Gemini

Using video to learn control representations, touch important