PixVerse C1 Reference-to-Video generates videos from reference images with subject and background consistency. Use @ref_name in prompts to reference uploaded images. Supports 360p to 1080p resolutions, 1-15 second duration, multiple aspect ratios, and optional audio generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle
$0.05per run·~20 / $1
PixVerse C1 Reference-to-Video generates cinematic video guided by reference images. Upload up to 7 reference images as characters, objects, or backgrounds — then describe the scene in your prompt using @ref_name to refer to each reference — and the model produces a cohesive, identity-consistent video that brings your references to life.
Multi-reference image support Upload 1 to 7 reference images — characters, objects, or backgrounds — and combine them into a single generated scene.
Subject and background control Tag each reference as subject (character or object) or background (scene or environment) for more precise compositing.
@ref_name prompt referencing Reference specific images directly in your prompt using @ref_name for precise control over which element appears where.
Character-consistent output The model preserves the visual identity of referenced subjects throughout the generated clip.
Optional native audio generation Enable generate_audio_switch to produce synchronized ambient sound alongside the video.
Four resolution tiers Generate from 360p up to 1080p to match your quality and delivery requirements.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the scene. Use @ref_name to reference specific images. |
| images | Yes | List of 1–7 reference images. Each entry requires image_url, type, and ref_name. |
| aspect_ratio | No | Output aspect ratio. Options: 16:9 (default), 4:3, 1:1, 3:4, 9:16, 2:3, 3:2, 21:9. |
| resolution | No | Output resolution: 360p, 540p, 720p (default), or 1080p. |
| duration | No | Clip length in seconds. Range: 1–15. Default: 5. |
| generate_audio_switch | No | Whether to generate native audio for the video. Default: off. |
Each image in the images list requires:
| Field | Description |
|---|---|
| image_url | URL of the reference image. |
| type | Reference type: subject (character or object) or background (scene or environment). |
| ref_name | A short name for this reference. Use @ref_name in your prompt to refer to this image. |
| Resolution | Without Audio | With Audio |
|---|---|---|
| 360p | $0.030/s | $0.040/s |
| 540p | $0.040/s | $0.050/s |
| 720p | $0.050/s | $0.065/s |
| 1080p | $0.095/s | $0.120/s |