Alibaba Wan 2.5 Text To Video

Playground

Alibaba WAN 2.5 makes 480p-1080p text/image-to-video with synced audio and is faster, more affordable than Google Veo3. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Alibaba WAN 2.5 Text-to-Video Model

Alibaba WAN 2.5 is an advanced text-to-video model provided by Alibaba Cloud’s DashScope platform. This model generates high-quality 480p/720p/1080p videos from text prompts.

What makes it stand out?

More affordable: Wan 2.5 is more streamlined and cost-effective - reducing creator expenses and offering more options.
One-pass A/V sync: Wan 2.5 creates a fully synchronized video (audio/voiceover + lip-sync) from a single, well-structured prompt - no separate recording or manual alignment required.
Multilingual friendly: Wan 2.5 reliably processes like Chinese prompts for A/V-synced videos.
Longer duration & more video size options: Wan 2.5 delivers up to 10 seconds and 6 aspect/size options, enabling more storytelling room and publishing flexibility.
Custom Voice: Add your own audio or let the model generate one for you. Plug-and-play, easy to swap!

Designed For

Marketing teams: Fast, polished demos/tutorials—low cost, consistent style.
Global enterprises: Multilingual, lip-synced videos with subtitles for efficient localization.
Storytellers & YouTubers: Immersive narratives while maintaining cadence and quality—driving growth.
Corporate training teams: HD videos over docs—clearer key points, better communication.

Pricing

Resolution	Price per second
480p	$0.05
720p	$0.10
1080p	$0.15

How to Use

Write your prompt.
Upload an audio file (optional) for voice/music.
Choose the video size (resolution/aspect).
Select the video duration (e.g., 5s / 10s).
Submit and wait for processing.
Preview and download the result.

Note

Audio limits

Formats: wav, mp3
Length: 3–30 seconds
File size: ≤ 15 MB

Over-limit handling

If the audio exceeds the target duration (5s or 10s), the model keeps only the first 5s/10s; the rest is discarded.
If the audio is shorter than the video duration, the extra video part is silent.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.5/text-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "size": "1280*720",
    "duration": 5,
    "enable_prompt_expansion": false,
    "seed": -1
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	The positive prompt for the generation.
negative_prompt	string	No		-	The negative prompt for the generation.
audio	string	No	-	-	Audio URL to guide generation (optional). Audio: ≥3s WAV/MP3, ≤15 MB
size	string	No	1280*720	832480, 480832, 1280720, 7201280, 19201080, 10801920	The size of the generated media in pixels (width*height).
duration	integer	No	5	5, 10	The duration of the generated media in seconds.
enable_prompt_expansion	boolean	No	false	-	If set to true, the prompt optimizer will be enabled.
seed	integer	No	-1	-1 ~ 2147483647	The random seed to use for the generation. -1 means a random seed will be used.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	object	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Alibaba Wan 2.5 Text To Image Alibaba Wan 2.5 Video Extend