Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only
Home/Explore/Pixverse AI Models/pixverse/pixverse-c1/reference-to-video

PixVerse C1 Reference-to-Video

pixverse/pixverse-c1/reference-to-video

PixVerse C1 Reference-to-Video generates videos from reference images with subject and background consistency. Use @ref_name in prompts to reference uploaded images. Supports 360p to 1080p resolutions, 1-15 second duration, multiple aspect ratios, and optional audio generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video
Input
image_url

Drag & drop or click to upload

preview
type
ref_name
image_url

Drag & drop or click to upload

preview
type
ref_name
Enable audio generation for the video.

Idle

Your request will cost $0.05 per run.

For $1 you can run this model approximately 20 times.

One more thing:

ExamplesView all

README

PixVerse C1 Reference-to-Video

PixVerse C1 Reference-to-Video generates cinematic video guided by reference images. Upload up to 7 reference images as characters, objects, or backgrounds — then describe the scene in your prompt using @ref_name to refer to each reference — and the model produces a cohesive, identity-consistent video that brings your references to life.

Why Choose This?

  • Multi-reference image support Upload 1 to 7 reference images — characters, objects, or backgrounds — and combine them into a single generated scene.

  • Subject and background control Tag each reference as subject (character or object) or background (scene or environment) for more precise compositing.

  • @ref_name prompt referencing Reference specific images directly in your prompt using @ref_name for precise control over which element appears where.

  • Character-consistent output The model preserves the visual identity of referenced subjects throughout the generated clip.

  • Optional native audio generation Enable generate_audio_switch to produce synchronized ambient sound alongside the video.

  • Four resolution tiers Generate from 360p up to 1080p to match your quality and delivery requirements.

Parameters

ParameterRequiredDescription
promptYesText description of the scene. Use @ref_name to reference specific images.
imagesYesList of 1–7 reference images. Each entry requires image_url, type, and ref_name.
aspect_ratioNoOutput aspect ratio. Options: 16:9 (default), 4:3, 1:1, 3:4, 9:16, 2:3, 3:2, 21:9.
resolutionNoOutput resolution: 360p, 540p, 720p (default), or 1080p.
durationNoClip length in seconds. Range: 1–15. Default: 5.
generate_audio_switchNoWhether to generate native audio for the video. Default: off.

Image Entry Fields

Each image in the images list requires:

FieldDescription
image_urlURL of the reference image.
typeReference type: subject (character or object) or background (scene or environment).
ref_nameA short name for this reference. Use @ref_name in your prompt to refer to this image.

How to Use

  1. Write your prompt — describe the scene and use @ref_name to reference specific images (e.g. "@hero is running through @city_bg at night.").
  2. Add reference images — provide 1 to 7 images, each with an image_url, type, and ref_name.
  3. Select aspect ratio — choose the format that fits your target platform.
  4. Select resolution — 360p for fastest/lowest cost, 720p for standard, 1080p for highest quality.
  5. Set duration — choose between 1 and 15 seconds.
  6. Enable audio (optional) — check generate_audio_switch to generate synchronized native audio.
  7. Submit — generate, preview, and download your video.

Pricing

ResolutionWithout AudioWith Audio
360p$0.030/s$0.040/s
540p$0.040/s$0.050/s
720p$0.050/s$0.065/s
1080p$0.095/s$0.120/s

Billing Rules

  • Billing is calculated per second of video generated
  • Duration range: 1–15 seconds
  • Examples: 10s at 720p (no audio) = $0.50 — 10s at 1080p (no audio) = $0.95

Best Use Cases

  • Character-driven storytelling — Place consistent characters from reference images into entirely new scenes.
  • Brand & product videos — Generate new scenes featuring consistent brand characters or products from reference imagery.
  • Social media content — Produce short-form video clips with consistent visual identity from reference photos.
  • Creative concepting — Rapidly prototype multi-character or multi-element scenes for pitching and storyboarding.
  • Style-consistent series — Maintain a unified visual style across multiple video generations using the same reference set.

Pro Tips

  • Assign clear, memorable ref_names (e.g. hero, bg, product) and use them naturally in your prompt.
  • Tag environment or scene images as background and characters or objects as subject for the most accurate compositing.
  • Use clear, well-lit reference images with distinct subjects for the best identity preservation.
  • Use 360p to rapidly test your scene composition before committing to a higher-resolution render.
  • Enable audio for scenes with strong ambient environments like outdoor settings, crowds, or action sequences.

Notes

  • Both prompt and images are required fields; all other parameters are optional.
  • Each image entry must include image_url, type, and ref_name.
  • Ensure image URLs are publicly accessible.
  • Please follow PixVerse's content usage policies when crafting prompts.

Related Models