Vidu Reference to Video Q1

Transform your reference images into dynamic videos with consistent subjects and environments. Our advanced Multi-Entity Consistency technology ensures characters, objects, and scenes maintain their visual identity throughout the video.

Features

Reference to Video Q1

Features

Multi-Entity Consistency

Maintains visual identity of characters and objects
Preserves style and appearance across frames
Supports multiple reference subjects

Flexible Input

Accepts 1-3 reference images
Supports various aspect ratios (16:9, 9:16, 1:1)
Adjustable movement amplitude

Requirements

Images

Number: 1-3 reference images
Format: PNG, JPEG, JPG, WebP
Size: Minimum 128x128 pixels
Aspect ratio: Less than 1:4 or 4:1
File size: Maximum 50MB per image

Prompt

Maximum length: 1500 characters
Should describe desired motion and scene

Best Practices

Use clear, high-quality reference images
Provide detailed prompts for better results
Consider scene composition in reference images
Test different movement amplitudes for optimal output

Effect Boundaries

Best results with clear, well-lit reference images
Movement amplitude affects transition smoothness
Complex scenes may require multiple reference images

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/vidu/reference-to-video-q1" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "images": [
        "https://d1q70pf5vjeyhc.cloudfront.net/media/images/1752044067562333610_05bBXmGY.PNG"
    ],
    "prompt": "A character takes two natural steps in front of the camera and then strikes some poses",
    "aspect_ratio": "16:9",
    "movement_amplitude": "auto",
    "seed": 0
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
images	array	Yes	https://d1q70pf5vjeyhc.cloudfront.net/media/images/1752044067562333610_05bBXmGY.PNG	-	Reference images for video generation. Requirements: 1. Accept 1-3 images; 2. Images can be URLs or Base64 encoded; 3. Supported formats: PNG, JPEG, JPG, WebP; 4. Minimum size: 128x128 pixels; 5. Aspect ratio: less than 1:4 or 4:1; 6. Maximum size: 50MB per image.
prompt	string	Yes		-	Text prompt: A textual description for video generation, with a maximum length of 1500 characters
aspect_ratio	string	No	16:9	-	The aspect ratio of the output video. Defaults to 16:9, accepted: 16:9 9:16 1:1.
movement_amplitude	string	No	auto	-	The movement amplitude of objects in the frame. Defaults to auto, accepted value: auto, small, medium, large.
seed	integer	No	-	-1 ~ 2147483647	The seed to use for generating the video. Random seed: Defaults to a random seed number; Manually set values will override the default random seed.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Query Parameters

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Vidu Reference to Video 2.0 Vidu Start End to Video 2.0