Google DeepMind has introduced Genie 3, its most advanced “world model” to date. The system can create navigable, fully interactive 3D environments from a single text prompt, rendering them in real time at 720p and 24 frames per second. Genie 3 keeps track of prior frames for roughly one minute, allowing objects, lighting and physics to remain coherent for several minutes of exploration. Users can also trigger “promptable world events”—such as changing the weather or adding new characters—without restarting the simulation. The model marks a substantial leap over December’s Genie 2, which was limited to 360p output and about 10-to-20 seconds of stable play. In an internal test, DeepMind’s generalist SIMA agent successfully executed instructions inside a Genie-generated warehouse, underscoring the system’s potential for training embodied AI. DeepMind says the technology could streamline game development, provide infinite training grounds for robots and serve as a step toward artificial general intelligence. For now, Genie 3 is available only in a restricted research preview to select academics and creators while the company studies safety, scalability and longer-duration performance.
Here me out: Genie 3 World that live processes into a Gaussian splat as you move. So exploring actively builds the 3D environment. No more 1-minute history limit, just tangible exploration that stays consistent. https://t.co/asa1V8t8Ri
Is Genie 3 a larger step towards AGI than gpt5? 🤔 https://t.co/rA7HhDZ8jQ
Using a text prompt, Google's latest Genie 3 model can create interactive 3D worlds that can be navigated with a mouse and keyboard. https://t.co/NqhCHyvdo5