Alibaba Wan 2.5 Image To Video

Playground

WAN 2.5 converts text or images into videos (480p/720p/1080p) with synced audio, faster and more affordable than Google Veo3. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

WAN 2.5 Image-to-Video Model

WAN 2.5 is an advanced image-to-video model on Cloud’s DashScope. It generates high-quality videos from images and supports output resolutions of 480p, 720p, and 1080p.

What makes it stand out?

More affordable: Wan 2.5 is more streamlined and cost-effective - reducing creator expenses and offering more options.
One-pass A/V sync: Wan 2.5 creates a fully synchronized video (audio/voiceover + lip-sync) from a single, well-structured prompt - no separate recording or manual alignment required.
Multilingual friendly: Wan 2.5 reliably processes like Chinese prompts for A/V-synced videos.
Longer duration & more video size options: Wan 2.5 delivers up to 10 seconds and 6 aspect/size options, enabling more storytelling room and publishing flexibility.
Custom Voice: Add your own audio or let the model generate one for you. Plug-and-play, easy to swap!

Designed For

Marketing teams: Fast, polished demos/tutorials—low cost, consistent style.
Global enterprises: Multilingual, lip-synced videos with subtitles for efficient localization.
Storytellers & YouTubers: Immersive narratives while maintaining cadence and quality—driving growth.
Corporate training teams: HD videos over docs—clearer key points, better communication.
Custom Voice: Add your own audio or let the model generate one for you. Plug-and-play, easy to swap!

Pricing

Resolution	Price per second
480p	$0.05
720p	$0.10
1080p	$0.15

How to Use

Write your prompt.
Upload an audio file (optional) for voice/music.
Choose the video size (resolution/aspect).
Select the video duration (e.g., 5s / 10s).
Submit and wait for processing.
Preview and download the result.

Note

Audio limits

Formats: wav, mp3
Length: 3–30 seconds
File size: ≤ 15 MB

Over-limit handling

If the audio exceeds the target duration (5s or 10s), the model keeps only the first 5s/10s; the rest is discarded.
If the audio is shorter than the video duration, the extra video part is silent.

Image Upload

If you did not upload the image locally, please ensure that the image URL is accessible! A successfully accessible image will display a preview in the interface.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.5/image-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "resolution": "720p",
    "duration": 5,
    "enable_prompt_expansion": false,
    "seed": -1
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	The positive prompt for the generation.
image	string	Yes		-	The image for generating the output.
audio	string	No	-	-	Audio URL to guide generation (optional).
negative_prompt	string	No		-	The negative prompt for the generation.
resolution	string	No	720p	480p, 720p, 1080p	The resolution of the generated media.
duration	integer	No	5	3, 4, 5, 6, 7, 8, 9, 10	The duration of the generated media in seconds.
enable_prompt_expansion	boolean	No	false	-	If set to true, the prompt optimizer will be enabled.
seed	integer	No	-1	-1 ~ 2147483647	The random seed to use for the generation. -1 means a random seed will be used.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	object	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Alibaba Wan 2.5 Image Edit Alibaba Wan 2.5 Image To Video Fast