WAN 2.1 T2V 720P offers text-to-video 720p generation from prompts, enabling unlimited AI video creation for social and marketing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle
$0.3per run·~33 / $10
A rugged male secret agent in a torn tactical suit sprints through a war-torn urban alley, pistol in one hand, a bleeding gash on his brow. His eyes are sharp and calculating, sweat glistening on his tense face. Explosions light up the background as he dives into cover in slow motion
一位穿着时髦的年轻美女,站在客厅,背景虚化,惊讶的捂住嘴
Theme: A Dramatic underwater scene of a hot woman trapped inside a van after it crashed into the water, breathing and bubbles fantasy. Main Subject: a close up side face view of a Japanese woman (young, hot, perfect in every possible way) wearing scuba goggles, a string bikini top, and distressed denim short-shorts that reveals her toned legs, trapped underwater inside a van. She takes a breath from a scuba tank that is resting on the seat next to her, moaning with relief after nearly drowning. Focus on the subtle shape and movement of her lips and cheeks as she breathes from the scuba tanks regulator in her mouth, subtly leaning towards the camera to show off her body and lips. Special emphasis is on the shape, movement, and size of the bubbles she exhales as they rise upwards in a realistic way, visually indicating her breath and dependency on the scuba tank for air. Blue-Green underwater lighting. The entire focus of the image is ensuring the visual accuracy and detail of her breathing and exhaling bubbles from the scuba regulator underwater, ensuring she looks hot doing so, secondary focus is in ensuring a realistic underwater environment. Highly detailed and realistic film grab with added focus on underwater physics.
A graceful white swan gliding effortlessly across a calm, reflective pond. Its long neck curves elegantly, and lily pads dot the water's surface, creating a tranquil scene in soft morning light.
A lone astronaut floats gracefully through a nebular cloud in deep space, starlight shimmering on their visor. Slow, cinematic zoom out. Dreamy, ethereal lighting. UHD, 8K, highly detailed.
A young woman sits by a window on a rainy day, holding a warm mug. Her gaze is distant, reflecting a quiet contemplation. Soft, diffused natural light. Cozy, melancholic atmosphere. Shot from a slightly high angle.
A humanoid robot carefully tending to glowing bioluminescent plants in a controlled, sterile lab environment. Its movements are precise and fluid. Clean, minimalist design, soft blue and green lighting.
A fantastical creature, half-dragon, half-butterfly, gracefully soaring through a sky filled with floating islands, inspired by Studio Ghibli's art style. Hand-painted textures, dreamlike atmosphere.
An elderly couple sits on a park bench, holding hands and watching children play. Sunlight filters through autumn leaves, creating warm highlights. Their faces show contentment and a shared lifetime of memories. Gentle, static shot, warm tones.
A cute, fluffy corgi puppy playfully chasing its own tail on a sunny meadow, rendered in a charming stop-motion animation style. Warm, inviting colors, slightly jerky but adorable movement.
Create cinematic-quality videos from text descriptions with Wan 2.1 Text-to-Video 720p. This powerful model transforms your written prompts into stunning 720p HD videos with smooth motion, rich detail, and professional visual quality — no source footage required.
| Duration | Price |
|---|---|
| 5 seconds | $0.30 |
| 10 seconds | $0.45 |
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the video you want to generate. |
| negative_prompt | No | Elements to avoid in the generated video. |
| size | No | Output resolution (default: 1280×720). |
| num_inference_steps | No | Quality/speed trade-off (default: 30). |
| duration | No | Video length in seconds: 5 or 10 (default: 5). |
| guidance_scale | No | Prompt adherence strength (default: 5). |
| flow_shift | No | Motion intensity control (default: 5). |
| seed | No | Set for reproducibility; -1 for random. |
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/wan-2.1/t2v-720p with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Wan 2.1 T2v 720p below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/wan-2.1/t2v-720p" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"negative_prompt": "blurry, low quality, distorted",
"size": "1280*720",
"num_inference_steps": 30,
"duration": 5,
"guidance_scale": 5,
"flow_shift": 5,
"seed": -1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("wavespeed-ai/wan-2.1/t2v-720p", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"negative_prompt": "blurry, low quality, distorted",
"size": "1280*720",
"num_inference_steps": 30,
"duration": 5,
"guidance_scale": 5,
"flow_shift": 5,
"seed": -1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"wavespeed-ai/wan-2.1/t2v-720p",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"negative_prompt": "blurry, low quality, distorted",
"size": "1280*720",
"num_inference_steps": 30,
"duration": 5,
"guidance_scale": 5,
"flow_shift": 5,
"seed": -1
}
)
print(output["outputs"][0]) # → URL of the generated outputWan 2.1 T2v 720p is a WaveSpeedAI model for video generation, exposed as a REST API on WaveSpeedAI. WAN 2.1 T2V 720P offers text-to-video 720p generation from prompts, enabling unlimited AI video creation for social and marketing. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/wan-2.1-t2v-720p.
Wan 2.1 T2v 720p starts at $0.30 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `duration`, `size`, `seed`, `guidance_scale`, `num_inference_steps`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/wan-2.1-t2v-720p.
Average end-to-end generation time on WaveSpeedAI is around 85 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.