Topview Logo
  • Create viral videos with
    GPT-4o + Ads library
    Use GPT-4o to edit video empowered by Youtube & Tiktok & Facebook ads library. Turns your links or media assets into viral videos in one click.
    Try it free
    gpt video

    Google presents Genie: Text to Video Game AI

    blog thumbnail

    Google presents Genie: Text to Video Game AI

    Google has introduced Genie, the first generative interactive environment trained in an unsupervised manner using unlabeled internet videos. This model is capable of generating a wide array of controllable Virtual Worlds through text, synthetic images, photographs, and even sketches. With 11 billion parameters, Genie serves as a foundational World model consisting of a spatio-temporal video tokenizer, an auto-aggressive Dynamics model, and a simple and scalable latent action model. Despite being trained without ground truth action labels or other domain-specific requirements typically found in world model literature, Genie allows users to interact within the generated environments on a frame-by-frame basis. Moreover, the learned latent action space enables training agents to replicate behaviors from unseen videos, paving the way for the development of versatile agents in the future.


    • Genie
    • Generative interactive environments
    • Unsupervised training
    • Virtual Worlds
    • Latent action model
    • Behavior imitation
    • Generalist agents


    1. What is Genie? Genie is a generative interactive environment developed by Google that is trained in an unsupervised manner using unlabeled internet videos. It can create controllable Virtual Worlds through various mediums such as text, images, and sketches.

    2. What sets Genie apart from other models? Genie's unique aspect lies in its ability to generate environments without the need for ground truth action labels or specific domain requirements typically seen in similar models. This enables users to engage in these environments interactively on a frame-by-frame basis.

    3. How does Genie facilitate agent training? The latent action space learned by Genie allows for the training of agents to imitate behaviors from videos that were not part of the training data. This capability opens up possibilities for developing generalist agents with a wide range of skills and abilities.

    One more thing

    In addition to the incredible tools mentioned above, for those looking to elevate their video creation process even further, stands out as a revolutionary online AI video editor. provides two powerful tools to help you make ads video in one click.

    Materials to Video: you can upload your raw footage or pictures, will edit video based on media you uploaded for you.

    Link to Video: you can paste an E-Commerce product link, will generate a video for you.

    You may also like