# Creating videos with GenAI

Turn ideas or photos into cinematic, holographic-ready videos using [**Google AI Studio**](https://aistudio.google.com/) and Gemini’s **Nano-Banana** (AKA 2.5 Flash Image) and **Veo 3.1** models.\
This guide walks you through how to:

* Adapt a **photo** into an HLD-ready video
* Create a **new image entirely from text** then animate that image into a short, realistic video for the HLD.

While we'll be using [Google's AI Studio](https://aistudio.google.com/) for this tutorial, the same process is possible in a variety of other tools if you adapt the core logic.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FBDEw7bAttVznZdHBC18O%2Fdivider.png?alt=media&#x26;token=fc4ff755-dfd1-4ec2-9088-9164e0c2447c" alt=""><figcaption></figcaption></figure>

## Overview

To create an HLD-style video, use **Gemini’s image and video generation tools** in [Google AI Studio](https://aistudio.google.com/) in this order:

1. Style or image generation (Nano-Banana)
   * Upload a subject image *or* describe it in text
   * Include our reference image to guide lighting and composition
2. Video generation (Veo 3.1)
   * Animate your resulting image into an HLD-ready video

The output is:

* 1080×1920 (9:16 portrait)
* 24 fps with native audio
* MP4

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FBDEw7bAttVznZdHBC18O%2Fdivider.png?alt=media&#x26;token=fc4ff755-dfd1-4ec2-9088-9164e0c2447c" alt=""><figcaption></figcaption></figure>

## What you’ll need

* A **Google account**
* Access to **Google AI Studio**
* Either:
  * A **subject image** (person or product)
  * **or** a **text prompt** describing what you’d like generated
* Our **reference image** — defines the environment, lighting, or style you want the subject to match

{% columns %}
{% column %}
Reference image if you're making a product video for the HLD:

{% file src="<https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FPqFnTeIyIW6QzJ5goA1D%2Fprodref.jpg?alt=media&token=263ba15d-b78f-4410-98ad-c86d0bb0afe4>" %}
{% endcolumn %}

{% column %}
Reference image if you're making a person or character video for the HLD:

{% file src="<https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FVc9oW4h9uf1XqREjR7N0%2Fpersref.jpg?alt=media&token=a60ad116-f4f4-4ed9-9dd6-a33eb0aaa431>" %}
{% endcolumn %}
{% endcolumns %}

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FBDEw7bAttVznZdHBC18O%2Fdivider.png?alt=media&#x26;token=fc4ff755-dfd1-4ec2-9088-9164e0c2447c" alt=""><figcaption></figcaption></figure>

## Step 0: Set up Google AI Studio

Before you generate images or videos, make sure your Google account has access to **Google AI Studio** and (optionally) that billing is enabled for **pay-as-you-go** usage of the Gemini API.

1. Choose the account you’ll use
   1. **Personal Google accounts**: You can open AI Studio and start prototyping immediately. No admin approval is required.
   2. **Google Workspace (work/school) accounts**: AI Studio is **on by default**, but your Workspace admin can turn it on/off per OU or group. If you see an “access disabled” message, contact your admin.

{% hint style="info" %}
**Admin guidance (Workspace):** In the Admin console, go to **Apps → Additional Google services → AI Studio → Service status** to enable it for everyone, an OU, or an access group. (Domain verification is required to manage additional services.) See more on [Google Help](https://support.google.com/a/answer/15004095).
{% endhint %}

2. Check regional & age availability
   1. AI Studio and the Gemini API are available [in most countries/regions](https://ai.google.dev/gemini-api/docs/available-regions); if you’re outside a supported region, use Vertex AI instead. Minimum age is 18.
3. Sign in to AI Studio & create an API key (optional but recommended)
   1. To connect your projects and tasks to your billing setup, you need an API key. You can create one in the [Google AI Studio dashboard](https://aistudio.google.com/u/1/api-keys).
4. (Optional) Enable billing for pay-as-you-go & higher limits
   1. You can build on the **free tier** or upgrade to **pay-as-you-go** for higher rate limits and different data-use terms.
   2. Open **Settings → Plan information** and click **Set up Billing** for your project; you’ll be guided to link a **Google Cloud Billing** account.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FBDEw7bAttVznZdHBC18O%2Fdivider.png?alt=media&#x26;token=fc4ff755-dfd1-4ec2-9088-9164e0c2447c" alt=""><figcaption></figcaption></figure>

## Step 1: Generate or adapt an image with Nano-Banana

Once you're signed in and set up in Google AI Studio, you can start creating content.

### Option A: Adapt an existing image

1. Open image generation
   1. Go to AI Studio.
   2. Select **`Try Nano Banana`**.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2Fp6C3Mg7mi5poUdnxmz1V%2FGenAI-GoogleDashboard.png?alt=media&#x26;token=b0292c9a-b2f3-4a07-8527-775a7bcf06b6" alt=""><figcaption></figcaption></figure>

2. Upload your files
   1. First image: your subject (person/character or product)
   2. Second image: the reference image
      1. Use the bear for a person/character
      2. Use the headphones for a product
3. Add a prompt
   1. For a person/character, use our default prompt:

      **🧍** Person Flow

      > Composite the person/character from the first image into the scene from the second image. The subject should replace the second image's subject, match their height and feet placement, and the shadow and reflections should be updated to match the new subject.

      **📦** Product Flow

      > Composite the object from the first image into the scene from the second image. The object should be replace the second image's object, and the shadow should be updated to match the new object.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FdLK5UhFcNho2Aqgx7s3R%2FGenAI-GooglePersonUpload.png?alt=media&#x26;token=e28c96ea-e9a5-4d4f-835f-c9b3a00510db" alt=""><figcaption></figcaption></figure>

4. Generate and download
   1. Click **Run** and wait about 5–30 seconds.
   2. Download your **style-adapted image** — this will be the base for your video.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FTypU7HHyLK2qcJzr9QM6%2FGenAI-GoogleDownloadPerson.png?alt=media&#x26;token=aad19a91-a5ac-4d2b-ba14-a65fcc42e001" alt=""><figcaption></figcaption></figure>

### Option B: Generate a new image from text

You can also skip uploading a subject image and describe what you want instead.

1. Open image generation
   1. Go to AI Studio.
   2. Select **`Try Nano Banana`**.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2Fp6C3Mg7mi5poUdnxmz1V%2FGenAI-GoogleDashboard.png?alt=media&#x26;token=b0292c9a-b2f3-4a07-8527-775a7bcf06b6" alt=""><figcaption></figcaption></figure>

2. Upload only the reference image
   1. This sets the background, lighting, and style.
3. Enter a text prompt

   1. Example:

   > Replace the subject with a luxury handbag placed naturally in the reference scene. Match the shadows so they feel physically accurate.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FJERKRoephZ3x7hCt09hg%2FGenAI-GoogleTextProduct.png?alt=media&#x26;token=3e4de5b6-00db-4814-b7e2-f5f9e13b6f15" alt=""><figcaption></figcaption></figure>

4. Generate and download
   1. Gemini will create a **new image** based on your description, blending it into the reference scene. Download this image — it’s now ready for video generation.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2F37CQXVTcWK0sbce6i5Vk%2FGenAI-GoogleTextProdDownload.png?alt=media&#x26;token=51b90af5-41af-4665-a2ea-b81a3c122d32" alt=""><figcaption></figcaption></figure>

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FBDEw7bAttVznZdHBC18O%2Fdivider.png?alt=media&#x26;token=fc4ff755-dfd1-4ec2-9088-9164e0c2447c" alt=""><figcaption></figcaption></figure>

## Step 2: Animate with Veo 3.1

1. Open video generation
   1. In AI Studio, select the "Home" button.
   2. Choose the **`Veo 3.1`** from the home screen menu.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2Fe75dA4GdhNWFarbVrOXf%2FGenAI-GoogleSelectVeo.png?alt=media&#x26;token=2b3922c9-7966-4885-a937-727fb1b7363b" alt=""><figcaption></figcaption></figure>

2. Select "Frames to Video" instead of the default "Text to Video"

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FMDo6kJcunsG4WLtnHfnX%2FGenAI-GoogleVeoFrames.png?alt=media&#x26;token=f1aa2eb2-43b5-4cd6-b8bc-72d802268d66" alt=""><figcaption></figcaption></figure>

3. Upload the style-adapted or generated image from Step 1
   1. This is the image you just created with Nano-Banana.
   2. If you're doing a video of a **product**, we recommend setting the image as the **start and end frame.**
   3. If you're doing a video of a **person**, just upload the image as the start frame.
4. Select settings
   1. Model: `veo-3.1-generate-preview` for highest quality results
   2. Aspect Ratio: Portrait (9:16)
   3. Resolution: 1080p for best results
5. Add your prompt

   **🧍** Person flow

   > A natural, engaging video of the person speaking directly to camera. Subtle movements and natural expressions, static camera, realistic lighting.

   **📦** Product flow

   > A smooth, professional 360-degree rotation of the product. The object slowly spins clockwise when viewed from the top, showcasing it from all angles. The product remains centered and in perfect focus throughout the rotation, and the camera doesn't move.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FIqLgeKRNo9nH5YjACBqn%2FGenAI-GoogleVeoSetup.png?alt=media&#x26;token=2481a849-7c45-4b83-81d6-a1f678f9a104" alt=""><figcaption></figcaption></figure>

6. Generate and download
   1. Click **Run**. Veo 3.1 will render the video in 1–5 minutes.
   2. Download the **MP4** — it’s ready for playback on HLD displays.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2F85HQHSFf4YYHlZLz9Pus%2FGenAI-GoogleVeoDownload.png?alt=media&#x26;token=73789170-07e1-4a9e-ab14-02c988594020" alt=""><figcaption></figcaption></figure>

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FBDEw7bAttVznZdHBC18O%2Fdivider.png?alt=media&#x26;token=fc4ff755-dfd1-4ec2-9088-9164e0c2447c" alt=""><figcaption></figcaption></figure>

## Previewing and exporting for HLD playback

Once you’ve exported your video, we recommend opening it in [**HLD Author**](https://hlddocs.lookingglassfactory.com/apps-and-tools/hld-author).

HLD Author lets you:

* Preview how your content will appear on a Hololuminescent Display
* Automatically export the video in the correct format for playback on the included [HLD Demo Media Player](https://hlddocs.lookingglassfactory.com/managing-content/using-the-demo-media-player)

This is the easiest way to confirm framing, depth, and orientation before deploying content to a device.

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FBDEw7bAttVznZdHBC18O%2Fdivider.png?alt=media&#x26;token=fc4ff755-dfd1-4ec2-9088-9164e0c2447c" alt=""><figcaption></figcaption></figure>

## Output specs

| Property         | Value                   |
| ---------------- | ----------------------- |
| **Aspect Ratio** | 9:16 (portrait)         |
| **Resolution**   | 1080 × 1920             |
| **Duration**     | 8 seconds               |
| **Frame Rate**   | 24 fps                  |
| **Audio**        | Native (auto-generated) |
| **Format**       | MP4                     |
| **Watermark**    | SynthID (automatic)     |

<figure><img src="https://804846835-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FE9AhVJoaDxXfUQueIzAL%2Fuploads%2FBDEw7bAttVznZdHBC18O%2Fdivider.png?alt=media&#x26;token=fc4ff755-dfd1-4ec2-9088-9164e0c2447c" alt=""><figcaption></figcaption></figure>

## Tips for Best Results

* Use **bright, evenly lit** subject photos.
* Match the **lighting direction** between subject and reference, if possible.
* Use **9:16 images** whenever possible.
* Keep prompts **concise** but **specific** — mention the mood, camera behavior, and lighting style.
* **Avoid cluttered backgrounds** in the input image.
* Use images with **clear separation** between foreground and background.
