Alibaba Wan 2.7 Text To Video
Playground
Try it on WavespeedAI!Alibaba WAN 2.7 Text-to-Video turns plain prompts into coherent, cinematic clips with crisp detail, stable motion, and strong instruction-following—great for ads, explainers, and social posts. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.
Features
Wan 2.7 Text-to-Video
Wan 2.7 is Alibaba’s advanced text-to-video model, generating high-quality cinematic video from natural language prompts. With audio input support, negative prompt control, flexible resolution and aspect ratio options, and an optional prompt expansion mode, it delivers strong results for a wide range of creative and production workflows.
Why Choose This?
-
High-quality text-to-video generation Produces detailed, visually coherent video with accurate motion, lighting, and scene composition from text descriptions.
-
Audio input support Upload an audio track to guide the rhythm, mood, and pacing of the generated video for synchronized results.
-
Negative prompt support Specify what you don’t want in the video for more precise control over the output.
-
Prompt expansion Enable enable_prompt_expansion to let the model automatically enrich and optimize your prompt before generation.
-
Resolution options Generate at 720p or 1080p to match your delivery requirements.
-
Flexible aspect ratios Supports multiple orientations for social, cinematic, and broadcast formats.
-
Reproducible results Use the seed parameter to lock in a specific output for exact reproduction.
Parameters
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the scene, motion, camera style, and atmosphere. |
| negative_prompt | No | Elements to exclude from the generated video. |
| audio | No | Optional audio track to synchronize with the generated video. |
| resolution | No | Output resolution: 720p (default) or 1080p. |
| aspect_ratio | No | Output aspect ratio. Default: 16:9. |
| duration | No | Clip length in seconds. Default: 5. |
| enable_prompt_expansion | No | Enable automatic prompt optimization before generation. Default: off. |
| seed | No | Random seed for reproducible results. Use -1 for a random seed. |
How to Use
- Write your prompt — describe the scene, characters, camera movement, lighting, and atmosphere. Use the Prompt Enhancer for better results.
- Add negative prompt (optional) — specify elements you want to exclude from the output.
- Upload audio (optional) — provide an audio file or URL to synchronize the video to a specific track.
- Select resolution — 720p for standard output, 1080p for higher-quality results.
- Select aspect ratio — choose the format that fits your target platform.
- Set duration — choose your desired clip length in seconds.
- Enable prompt expansion (optional) — let the model automatically enrich your prompt before generation.
- Set seed (optional) — fix the seed to reproduce a specific result in future runs.
- Submit — generate, preview, and download your video.
Pricing
| Duration | 720p | 1080p |
|---|---|---|
| 5s | $0.50 | $0.75 |
| 10s | $1.00 | $1.50 |
| 15s | $1.50 | $2.25 |
Billing Rules
- 720p: $0.10 per second
- 1080p: $0.15 per second (1.5× base rate)
Best Use Cases
- Cinematic Storytelling — Render atmospheric, narrative-driven scenes from detailed text descriptions.
- Social Media Content — Generate platform-optimized video clips across multiple aspect ratios.
- Marketing & Advertising — Produce high-quality promotional video content without a film crew.
- Music & Audio-Visual — Synchronize generated video to a music track or voiceover for cohesive results.
- Concept Visualization — Bring creative ideas and moods to life quickly for pitching and review.
Pro Tips
- The more specific your prompt, the better — include camera angle, lighting style, color palette, and subject behavior.
- Use negative_prompt to avoid common artifacts like blurry faces or unwanted motion.
- Enable prompt expansion for shorter or less detailed prompts to get richer output automatically.
- Providing an audio track improves rhythm and pacing alignment in the generated video.
- Fix the seed once you find a result you like to iterate consistently across resolution and duration changes.
Notes
- Only prompt is required; all other parameters are optional.
- Ensure audio URLs are publicly accessible if using a link rather than a direct upload.
- Please ensure your content complies with Alibaba’s usage policies.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.7/text-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"resolution": "720p",
"aspect_ratio": "16:9",
"duration": 5,
"enable_prompt_expansion": false,
"seed": -1
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | The positive prompt for the generation. | |
| negative_prompt | string | No | - | The negative prompt for the generation. | |
| audio | string | No | - | - | Audio URL to guide generation (optional). |
| resolution | string | No | 720p | 720p, 1080p | The resolution of the generated video. |
| aspect_ratio | string | No | 16:9 | 16:9, 9:16, 1:1, 4:3, 3:4 | The aspect ratio of the generated video. |
| duration | integer | No | 5 | 2 ~ 15 | The duration of the generated media in seconds (2-15s). |
| enable_prompt_expansion | boolean | No | false | - | If set to true, the prompt optimizer will be enabled. |
| seed | integer | No | -1 | -1 ~ 2147483647 | The random seed to use for the generation. -1 means a random seed will be used. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
Result Request Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| id | string | Yes | - | Task ID |
Result Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | object | Array of URLs to the generated content (empty when status is not completed). |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |