Pixverse Pixverse V5.5 Image To Video

Playground

PixVerse V5.5 Image-to-Video turns a single image into cinematic clips with smooth motion, clean detail, and strong subject fidelity—ideal for logo stingers, character motion, and social posts. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Features

PixVerse v5.5 — Image-to-Video (I2V)

PixVerse v5.5 Image-to-Video animates a single image into a short cinematic clip. You provide a still frame plus a prompt; the model adds motion, camera moves, lighting changes and FX while keeping the original character, composition and style intact.

✨ Highlights

Image-aware animation – Uses your input image as the first frame, preserving identity, pose and layout.
Flexible formats – Resolutions from 360p–1080p and aspect ratios 16:9, 4:3, 1:1, 3:4, 9:16 for feeds, stories and banners.
Multiple durations – Generate 5s, 8s or 10s clips for hooks, shorts or slightly longer moments.
Prompt reasoning (thinking_type) – Optional system optimisation that can refine complex prompts before generation.

🧩 Parameters

prompt* (string) Up to 2048 characters describing motion, camera, lighting and style. Example: “Dynamic anime close-up, wind blowing cloak and hair, camera slowly circling, sparks and glowing embers in the background.”
image* (URL or upload) The source frame to animate. Front-facing, well-lit images work best.
resolution One of 360p, 540p, 720p, 1080p.
duration 5, 8 or 10 seconds.(10 seconds is not available for 1080p)
thinking_type
- “enabled” – Turn on system-level reasoning to structure and optimise your prompt.
- “disabled” – Use your prompt exactly as written.
- “auto” (default) – Let the system decide whether to enable prompt optimizer automatically.
negative_prompt (optional) Words you don’t want in the video, e.g. watermark, logo, text, distortion.
seed (integer) Fix a seed for reproducible runs, or change it to get new variations from the same setup.

💰 Pricing

Resolution	5s clip (total)	8s clip (total)	10s clip (total)*
360p	$0.85	$1.30	$1.39
540p	$0.85	$1.30	$1.39
720p	$1.00	$1.60	$1.72
1080p	$1.60	$2.80	-

🚀 How to Use

Upload your image Add a clean, high-quality frame under image – ideally with clear subject and minimal motion blur.
Write the prompt Focus on how things move, camera path and overall mood, not on redesigning the character.
- Good: “Camera slowly pushes in, cloak flutters in the wind, sparks drift across frame, cinematic lighting.”
- Risky: “Change clothes and hairstyle completely while the character runs and transforms into a dragon.”
Choose resolution & ratio
- 16:9 for YouTube / landscape.
- 9:16 for TikTok / Reels / Stories.
- 1:1 or 4:3 for feed posts.
Set duration and resolution
- Select resolution from 360p,540p,720p,1080p
- Select duration from 5s, 8s or 10s (Not available for 1080p)
(Optional) Adjust thinking_type, negative_prompt and seed
- Use enabled or auto for complex, multi-sentence prompts. The model will optimize it for you to generate better video output.
- Add a short negative prompt to avoid artefacts.
- Lock the seed while you tweak small details.
Run and iterate Generate the clip, review motion and framing, then refine your prompt or duration as needed.

💡 Best Practices

Keep the image and prompt aligned – don’t describe a totally different scene or character.
Use medium or close shots for character-focused animations; wide shots can feel sparse.
For platforms with heavy compression, prefer 720p / 1080p to keep details clean.
Avoid overloading the prompt with too many actions; 1–3 clear motions usually work best.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/pixverse/pixverse-v5.5/image-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "resolution": "540p",
    "duration": 5,
    "generate_audio_switch": false,
    "generate_multi_clip_switch": false,
    "thinking_type": "auto"
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	The positive prompt for the generation.
image	string	No		-	First frame of the video; Supported image formats include.jpg/.jpeg/.png; The image file size cannot exceed 10MB, and the image resolution should not be less than 300*300px, and the aspect ratio of the image should be between 1:2.5 ~ 2.5:1.
resolution	string	No	540p	360p, 540p, 720p, 1080p	The resolution of the generated media.
duration	integer	No	5	5, 8, 10	The duration of the generated media.
generate_audio_switch	boolean	No	false	-	Enable audio generation for the video.
generate_multi_clip_switch	boolean	No	false	-	Enable multi-clip generation with dynamic camera changes.
thinking_type	string	No	auto	enabled, disabled, auto	Prompt reasoning enhancement.
negative_prompt	string	No		-	The negative prompt for the generation.
seed	integer	No	-	-1 ~ 2147483647	The random seed to use for the generation.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Pixverse Pixverse V5.5 Effects Pixverse Pixverse V5.5 Text To Video