Browse ModelsPixversePixverse Pixverse C1 Reference To Video

Pixverse Pixverse C1 Reference To Video

Pixverse Pixverse C1 Reference To Video

Playground

Try it on WavespeedAI!

PixVerse C1 Reference-to-Video generates videos from reference images with subject and background consistency. Use @ref_name in prompts to reference uploaded images. Supports 360p to 1080p resolutions, 1-15 second duration, multiple aspect ratios, and optional audio generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

PixVerse C1 Reference-to-Video

PixVerse C1 Reference-to-Video generates cinematic video guided by reference images. Upload up to 7 reference images as characters, objects, or backgrounds — then describe the scene in your prompt using @ref_name to refer to each reference — and the model produces a cohesive, identity-consistent video that brings your references to life.


Why Choose This?

  • Multi-reference image support Upload 1 to 7 reference images — characters, objects, or backgrounds — and combine them into a single generated scene.

  • Subject and background control Tag each reference as subject (character or object) or background (scene or environment) for more precise compositing.

  • @ref_name prompt referencing Reference specific images directly in your prompt using @ref_name for precise control over which element appears where.

  • Character-consistent output The model preserves the visual identity of referenced subjects throughout the generated clip.

  • Optional native audio generation Enable generate_audio_switch to produce synchronized ambient sound alongside the video.

  • Four resolution tiers Generate from 360p up to 1080p to match your quality and delivery requirements.


Parameters

ParameterRequiredDescription
promptYesText description of the scene. Use @ref_name to reference specific images.
imagesYesList of 1–7 reference images. Each entry requires image_url, type, and ref_name.
aspect_ratioNoOutput aspect ratio. Options: 16:9 (default), 4:3, 1:1, 3:4, 9:16, 2:3, 3:2, 21:9.
resolutionNoOutput resolution: 360p, 540p, 720p (default), or 1080p.
durationNoClip length in seconds. Range: 1–15. Default: 5.
generate_audio_switchNoWhether to generate native audio for the video. Default: off.

Image Entry Fields

Each image in the images list requires:

FieldDescription
image_urlURL of the reference image.
typeReference type: subject (character or object) or background (scene or environment).
ref_nameA short name for this reference. Use @ref_name in your prompt to refer to this image.

How to Use

  1. Write your prompt — describe the scene and use @ref_name to reference specific images (e.g. “@hero is running through @city_bg at night.”).
  2. Add reference images — provide 1 to 7 images, each with an image_url, type, and ref_name.
  3. Select aspect ratio — choose the format that fits your target platform.
  4. Select resolution — 360p for fastest/lowest cost, 720p for standard, 1080p for highest quality.
  5. Set duration — choose between 1 and 15 seconds.
  6. Enable audio (optional) — check generate_audio_switch to generate synchronized native audio.
  7. Submit — generate, preview, and download your video.

Pricing

ResolutionWithout AudioWith Audio
360p$0.030/s$0.040/s
540p$0.040/s$0.050/s
720p$0.050/s$0.065/s
1080p$0.095/s$0.120/s

Billing Rules

  • Billing is calculated per second of video generated
  • Duration range: 1–15 seconds
  • Examples: 10s at 720p (no audio) = $0.50 — 10s at 1080p (no audio) = $0.95

Best Use Cases

  • Character-driven storytelling — Place consistent characters from reference images into entirely new scenes.
  • Brand & product videos — Generate new scenes featuring consistent brand characters or products from reference imagery.
  • Social media content — Produce short-form video clips with consistent visual identity from reference photos.
  • Creative concepting — Rapidly prototype multi-character or multi-element scenes for pitching and storyboarding.
  • Style-consistent series — Maintain a unified visual style across multiple video generations using the same reference set.

Pro Tips

  • Assign clear, memorable ref_names (e.g. hero, bg, product) and use them naturally in your prompt.
  • Tag environment or scene images as background and characters or objects as subject for the most accurate compositing.
  • Use clear, well-lit reference images with distinct subjects for the best identity preservation.
  • Use 360p to rapidly test your scene composition before committing to a higher-resolution render.
  • Enable audio for scenes with strong ambient environments like outdoor settings, crowds, or action sequences.

Notes

  • Both prompt and images are required fields; all other parameters are optional.
  • Each image entry must include image_url, type, and ref_name.
  • Ensure image URLs are publicly accessible.
  • Please follow PixVerse’s content usage policies when crafting prompts.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/pixverse/pixverse-c1/reference-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "images": [
        {
            "image_url": "",
            "ref_name": "",
            "type": "subject"
        }
    ],
    "aspect_ratio": "16:9",
    "resolution": "720p",
    "duration": 5,
    "generate_audio_switch": false
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-Text description of the desired video.
imagesarrayYes[]1 ~ 7 itemsThis is a controlnet that controls the maximum size of the generated model.
aspect_ratiostringNo16:916:9, 4:3, 1:1, 3:4, 9:16, 2:3, 3:2, 21:9The aspect ratio of the generated video
resolutionstringNo720p360p, 540p, 720p, 1080pThe resolution of the generated video
durationintegerNo51 ~ 15The duration of the generated video in seconds. v6 supports values from 1 to 15 seconds
generate_audio_switchbooleanNofalse-Enable audio generation for the video.
seedintegerNo--1 ~ 2147483647Random seed for generation.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content (empty when status is not completed).
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.