WaveSpeedAI APIAlibabaAlibaba Wan 2.5 Text To Video

Alibaba Wan 2.5 Text To Video

Alibaba Wan 2.5 Text To Video

Playground

Try it on WavespeedAI!

Alibaba WAN 2.5 makes 480p-1080p text/image-to-video with synced audio and is faster, more affordable than Google Veo3. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Alibaba WAN 2.5 Text-to-Video Model

Alibaba WAN 2.5 is an advanced text-to-video model provided by Alibaba Cloud’s DashScope platform. This model generates high-quality 480p/720p/1080p videos from text prompts.

What makes it stand out?

  • More affordable: Wan 2.5 is more streamlined and cost-effective - reducing creator expenses and offering more options.
  • One-pass A/V sync: Wan 2.5 creates a fully synchronized video (audio/voiceover + lip-sync) from a single, well-structured prompt - no separate recording or manual alignment required.
  • Multilingual friendly: Wan 2.5 reliably processes like Chinese prompts for A/V-synced videos.
  • Longer duration & more video size options: Wan 2.5 delivers up to 10 seconds and 6 aspect/size options, enabling more storytelling room and publishing flexibility.
  • Custom Voice: Add your own audio or let the model generate one for you. Plug-and-play, easy to swap!

Designed For

  • Marketing teams: Fast, polished demos/tutorials—low cost, consistent style.
  • Global enterprises: Multilingual, lip-synced videos with subtitles for efficient localization.
  • Storytellers & YouTubers: Immersive narratives while maintaining cadence and quality—driving growth.
  • Corporate training teams: HD videos over docs—clearer key points, better communication.

Pricing

ResolutionPrice per second
480p$0.05
720p$0.10
1080p$0.15

How to Use

  1. Write your prompt.
  2. Upload an audio file (optional) for voice/music.
  3. Choose the video size (resolution/aspect).
  4. Select the video duration (e.g., 5s / 10s).
  5. Submit and wait for processing.
  6. Preview and download the result.

Note

Audio limits

  • Formats: wav, mp3
  • Length: 3–30 seconds
  • File size: ≤ 15 MB

Over-limit handling

  • If the audio exceeds the target duration (5s or 10s), the model keeps only the first 5s/10s; the rest is discarded.
  • If the audio is shorter than the video duration, the extra video part is silent.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.5/text-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "size": "1280*720",
    "duration": 5,
    "enable_prompt_expansion": false,
    "seed": -1
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-The positive prompt for the generation.
negative_promptstringNo-The negative prompt for the generation.
audiostringNo--Audio URL to guide generation (optional). Audio: ≥3s WAV/MP3, ≤15 MB
sizestringNo1280*720832*480, 480*832, 1280*720, 720*1280, 1920*1080, 1080*1920The size of the generated media in pixels (width*height).
durationintegerNo55, 10The duration of the generated media in seconds.
enable_prompt_expansionbooleanNofalse-If set to true, the prompt optimizer will be enabled.
seedintegerNo-1-1 ~ 2147483647The random seed to use for the generation. -1 means a random seed will be used.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsobjectArray of URLs to the generated content (empty when status is not completed).
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.