Reference to Video AI Generator
What Is Reference to Video AI?
Reference to video AI is a source-guided generation workflow. Instead of prompting a scene from scratch, you upload references that define who or what should stay stable, then describe the new shot you want the model to create.
Stable Identity
Use reference images to lock face identity, outfit details, product shape, packaging, or environment styling across multiple clips instead of rebuilding those details in every prompt.

Mixed Inputs
This workflow can use image, video, and audio references when the selected model supports them, giving you more control over look, motion, rhythm, and scene continuity.

Scene Continuity
Reference to video works well when the next clip needs to feel connected to the last one. It is useful for follow-up shots, product sequences, campaign variations, and recurring characters.

Clear Scope
Reference to video is for source-guided generation. Text to video is for prompt-first creation, and image to video is for animating a lead frame or frame pair.
How to Use Reference to Video
Pick a reference-capable model, upload the files that should control continuity, and write a prompt that explains the new shot you want to generate.
Pick a Model
Select a model on this page that supports source-guided video generation. The current reference workflow supports Veo 3.1 Fast, Seedance 2.0, Seedance 2.0 Fast, Gemini Omni Video, Happy Horse 1.0, Kling 2.6 Motion Control, and Kling 3 Motion Control.
Upload References
Use each file for one job: identity, product fidelity, motion example, scene styling, or audio timing. Cleaner references usually create more stable results.
Describe the Shot
Let your references carry continuity, then use the prompt to define what changes in the new clip, such as action, framing, camera movement, mood, or pacing.
More AI Tools & Effects
Discover more tools and effects to power your creative workflow.
Why Reference to Video Improves Continuity
Stable Characters
Reference files reduce identity drift, so the same face, outfit, and visual persona stay more recognizable across related clips.
Better Product Fidelity
Use reference assets to keep packaging, materials, logos, and visual styling consistent in product demos, ads, and campaign variations.
Motion Guidance
Video references can provide movement cues that help the next shot follow a clearer pose, rhythm, or motion pattern when the model supports it.
Audio Timing
Audio references can help define cadence, energy, or timing for music-driven and rhythm-sensitive generations on supported models.
Cleaner Prompts
Once references define identity and continuity, the prompt can stay focused on the new shot instead of repeating static subject details.
Predictable Iteration
Small prompt adjustments produce clearer A/B variants when references already anchor the subject, scene logic, and style direction.
Reference to Video AI FAQ
Practical answers for source-guided AI video generation.

