Genie 3 icon

Genie 3

Genie 3 is Google DeepMind’s world model for generating interactive, photorealistic environments from text or images via Project Genie.

Genie 3

Overview

Genie 3 is Google DeepMind’s general-purpose world model for generating interactive, photorealistic environments from simple text descriptions. The page frames it as a real-time system for creating worlds that can be explored, with a focus on physical simulation, controllable interaction, and visual consistency.

The product is surfaced through Project Genie, an experimental research prototype, and a prompt guide that explains how to shape a world before entering it. DeepMind positions Genie 3 as useful for modelling the physical world, simulating nature, and producing animation or fiction-like scenes, while also describing it as a step toward more capable AI agents that can reason and act in complex environments.

Capabilities

Text-to-world generation

Genie 3 is presented as a general-purpose world model that turns simple text descriptions into photorealistic environments that can be explored in real time.

Real-time interaction

The page says Genie 3 supports fluid interaction at 20-24 frames per second, allowing users and agents to move through generated worlds without long pauses between actions.

720p photorealistic output

The model renders output at 720p resolution, giving generated scenes enough visual detail for exploratory use and agent training scenarios.

World consistency over time

The homepage says previously seen details are recalled when revisited and that environments can sustain interaction without degrading as quickly.

Street View grounding

Genie is grounded in Street View data from Google Maps, which the site says helps anchor new worlds in reality while still allowing unexpected combinations.

Prompt-guided world building

The prompt guide explains that users can build worlds with text or images, then refine the environment, character, and preview before entering the world.

Use Cases

  • Build custom exploratory worlds

    Describe a landscape, character, and movement pattern to create a controllable world for interactive exploration or prototype testing.

  • Model physical environments

    Use the model’s real-time, photorealistic output to model deserts, seas, weather, and other physical environments for simulation-style exploration.

  • Simulate natural scenes

    Generate vibrant ecosystems or other nature-inspired scenes when the goal is to explore animal behavior, plant life, and environmental detail.

  • Produce fictional worlds

    Create imaginary settings, fantastical scenarios, or expressive animated characters for fiction and animation-oriented experimentation.

  • Iterate on prompt ideas

    Follow the prompt guide to sketch, upload, and remix worlds before entering them, which is useful when iterating on a scene concept or reference image.

Pros and Cons

Pros

  • Generates interactive worlds from short text prompts.
  • Supports real-time exploration with photorealistic 720p output.
  • Revisits previously seen details with better consistency than a short-lived simulation.
  • Includes a prompt guide with environment, character, and world sketch guidance.
  • Can also be driven with images through Project Genie and the prompt workflow.

Cons

  • The public materials do not include pricing, plan details, or a standard self-serve onboarding flow.
  • The page does not publish a full list of constraints, supported devices, or output/export options.

FAQ

How do you access Genie 3?

Genie 3 is accessed through Project Genie, which the page describes as an experimental research prototype for creating and exploring worlds. The homepage also links to a prompt guide for learning how to prompt it.

What kinds of prompts does Genie 3 use?

The prompt guide says Genie 3 can be prompted with text or images. It uses three elements in prompting: the environment, the character, and a world sketch preview generated by Nano Banana Pro, which can also be adjusted with a user-uploaded image.

What is the workflow for creating a world?

The homepage says Genie 3 generates photorealistic environments that can be explored in real time. The prompt guide adds that the workflow includes world sketching, image upload, and world remixing.

Is there public pricing for Genie 3?

The source describes Genie 3 as a general-purpose world model with real-time interaction, 720p output, and 20-24 frames per second. It is presented as a research model rather than a public consumer product with listed plans.

What are Genie 3's main limitations in the public materials?

The source highlights strengths in real-time interaction, photorealistic output, world consistency, and grounding in Street View data. It does not provide a full public list of product limits, onboarding steps, or team features.

Quick Facts

Product
Genie 3
Category
World model
Primary access
Project Genie
Source domain
deepmind.google
Output
Photorealistic interactive worlds
Pricing
Not publicly listed