Alibaba Wan 2.5 Video Extend
Playground
Try it on WavespeedAI!Extend your videos with Alibaba WAN 2.5 video extender model with audio. Perfect for creating longer videos from shorter clips with audio track.
Features
Alibaba WAN 2.5 — Video Extend Model
Alibaba WAN 2.5 is an advanced video extension model available on Alibaba Cloud DashScope, designed to extend or complete video clips while maintaining perfect motion continuity, style consistency, and A/V synchronization. It supports custom durations from 3 to 10 seconds and resolutions up to 1080p.
✨ Why it’s special
- Seamless extension: Extends videos naturally — no abrupt cuts or looping artifacts. The motion flow and scene lighting stay coherent frame to frame.
- Flexible duration control: Choose any target length from 3 s – 10 s (3, 4, 5, 6, 7, 8, 9, 10 s all supported).
- One-pass A/V sync: Automatically aligns audio and mouth movements, ensuring natural speech and rhythm without manual editing.
- Multilingual friendly: Supports prompts in Chinese and English for synchronized video generation with accurate lip-sync.
- Custom audio option: You can upload your own audio (music or voice) or let the model generate one for you — plug-and-play.
- Lightweight and affordable: Built for creators and teams who need fast, consistent, and cost-effective video outputs.
🧩 Designed For
- Content creators & storytellers: Extend scenes smoothly to fit a beat, dialogue, or camera motion.
- Marketing teams: Produce multiple ad variations from a single core clip — faster, cheaper, consistent.
- Corporate training teams: Expand short clips into complete learning segments without reshooting.
- Global enterprises: Generate localized, multilingual video extensions for different audiences.
⚙️ How to Use
- Upload your base video clip.
- (Optional) Upload an audio file (
.wav
/.mp3
, ≤ 15 MB, 3–30 s). - Enter your prompt or description (supports Chinese / English).
- Select the output duration (3 s – 10 s).
- Choose resolution (480p / 720p / 1080p).
- Submit the job — wait a few moments, preview, and download your extended video.
💡 Audio Handling
- If audio > chosen video length: Only the first segment (3 s, 5 s, 10 s etc.) is used.
- If audio < chosen video length: Remaining video plays silently.
- If no audio: WAN 2.5 can auto-generate one or leave it silent.
💰 Pricing
Resolution | Price per second | Example (3 s) | Example (5 s) |
---|---|---|---|
480p | $0.05 | $0.15 | $0.25 |
720p | $0.10 | $0.30 | $0.50 |
1080p | $0.15 | $0.45 | $0.75 |
🧠 Pro Tips
- Start with 5 s for testing — balance between cost and quality.
- Upload high-quality input videos to maximize motion consistency.
- Keep your prompt concise but descriptive: include mood, motion, and scene intent.
- For multilingual videos, specify the desired spoken language in your prompt.
📎 Note
- If you didn’t upload your image or video locally, please ensure that the media URL is publicly accessible. A successfully accessible file will display a preview in the interface.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.5/video-extend" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"resolution": "720p",
"duration": 5,
"enable_prompt_expansion": false,
"seed": -1
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
Parameter | Type | Required | Default | Range | Description |
---|---|---|---|---|---|
video | string | Yes | - | The video for generating the output. | |
audio | string | No | - | - | Audio URL to guide generation (optional). |
prompt | string | Yes | - | The positive prompt for the generation. | |
negative_prompt | string | No | - | The negative prompt for the generation. | |
resolution | string | No | 720p | 480p, 720p, 1080p | The resolution of the generated media. |
duration | integer | No | 5 | 3, 4, 5, 6, 7, 8, 9, 10 | The duration of the generated media in seconds. |
enable_prompt_expansion | boolean | No | false | - | If set to true, the prompt optimizer will be enabled. |
seed | integer | No | -1 | -1 ~ 2147483647 | The random seed to use for the generation. -1 means a random seed will be used. |
Response Parameters
Parameter | Type | Description |
---|---|---|
code | integer | HTTP status code (e.g., 200 for success) |
message | string | Status message (e.g., “success”) |
data.id | string | Unique identifier for the prediction, Task Id |
data.model | string | Model ID used for the prediction |
data.outputs | array | Array of URLs to the generated content (empty when status is not completed ) |
data.urls | object | Object containing related API endpoints |
data.urls.get | string | URL to retrieve the prediction result |
data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
data.status | string | Status of the task: created , processing , completed , or failed |
data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
data.error | string | Error message (empty if no error occurred) |
data.timings | object | Object containing timing details |
data.timings.inference | integer | Inference time in milliseconds |