WAN 2.1 Text-to-Video generates high-quality 720P videos from text prompts with an ultra-fast pipeline for unlimited AI videos. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle
$0.225per run·~44 / $10
Close-up tracking shot, a lone man wrapped in heavy winter gear, fur-lined hood, goggles and a frost-covered coat, trudges through deep snow as a blinding blizzard howls around him. The snow whips from side to side, partially obscuring the landscape. His figure advances against the wind. Harsh, isolated, cinematic
photo-realistic young chimpanzee with reddish-brown fur, wearing a mint-green turtleneck and blue denim overalls. The setting is outdoors, with a softly blurred lake and greenery in the background. The camera remains stationary and focused on the chimp's upper body in a close-up shot. The chimp stays completely still except for its eyes, which move as follows: Start with the chimpanzee looking towards the camera. Its eyes shift quickly to looking straight ahead. Pause for a moment as it holds that glance. The eyes then shift back. This continues for a few times. Facial expression remains neutral throughout, with a slightly awkward or concerned look in the eyes
这是一张充满活力的照片,捕捉到了日本姬路城堡的壮丽景色。图像的特点是前景中的樱花枝,装饰着精致的白色花朵,有些盛开,有些含苞待放。樱花清晰可见,花瓣精致,细节精致。树枝很细,略微弯曲,质地自然,稍粗糙。在背景中,标志性的姬路城堡,也被称为Shirasagi城堡,被突出显示。这座城堡是联合国教科文组织世界遗产,以其令人惊叹的建筑而闻名。该建筑是一种充满活力的暖橙色,边缘和屋顶上可见错综复杂的木制品和金色装饰。阳光投下柔和的阴影,增强了樱花和城堡的质感和深度。整体构图和谐,花朵的柔和粉红色与城堡的浓郁橙色形成了美丽的对比。镜头缓缓推进,从樱花树慢慢转向城堡,展现出樱花与城堡之间细腻的层次感。光线逐渐变化,营造出日出或日落时分的温暖氛围,使整个画面更加生动。
A female warrior in silver armor is walking through a dense enchanted forest, her cape flowing with the wind, glowing fireflies around her, mysterious light rays piercing through trees, fantasy cinematic look, back view, dramatic camera tilt.
A high school girl in uniform is riding a bicycle under cherry blossoms, petals floating in the wind, soft lighting with warm tones, peaceful atmosphere, back view, shallow depth of field.
A young man in a colorful streetwear outfit is skateboarding down a graffiti-covered alleyway, his movements fast and smooth, golden hour sunlight reflecting off metal walls, urban vibe, dynamic handheld camera motion with occasional slow-downs.
A chibi-style girl with oversized eyes and bubble pigtails is jumping happily on a pastel-colored floating island, surrounded by bouncing jelly creatures, toy-like materials and lighting, cute playful mood, wide-angle lens.
A futuristic soldier in a sleek exosuit is sprinting across a neon-lit battlefield, energy pulses glowing on his armor, explosions in the distance, gritty sci-fi tone, third-person tracking shot with depth-of-field blur.
A humanoid android with a transparent skull and glowing neural circuits is standing in a sterile lab chamber, connected to floating data streams and robotic arms, her eyes tracking the camera slowly, sleek chrome surfaces, high-tech sterile ambiance, smooth panning shot.
A girl with bangs and a vintage camera is walking along an old railway under cloudy skies, faded autumn leaves falling around her, soft film grain and warm tones, nostalgic atmosphere, shallow depth-of-field, handheld camera movements.
A demonic warrior with flaming horns and cracked molten skin is stomping through a burning battlefield, ashes flying through the air, heavy metal soundtrack vibe, slow motion sparks, intense close-up on glowing eyes, dark epic fantasy tone.
A pink-haired elf girl in armor is standing on a floating island surrounded by magic circles, sky filled with multiple moons, glowing particles drifting by, fantasy anime tone, dramatic lighting from below, camera tilt as she draws her sword.
Wan 2.1 Text-to-Video 720p Ultra Fast is a lightning-fast text-to-video generation model optimized for speed and efficiency. Generate HD 720p videos from text descriptions in seconds — perfect for rapid iteration, previews, and high-volume video creation.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the video you want to generate. |
| negative_prompt | No | Elements to avoid in the output. |
| size | No | Output resolution: 1280×720 or 720×1280 (default: 1280×720). |
| num_inference_steps | No | Quality/speed trade-off (default: 30). |
| duration | No | Video length: 5 or 10 seconds (default: 5). |
| guidance_scale | No | Prompt adherence strength (default: 5). |
| flow_shift | No | Motion flow control (default: 5). |
| seed | No | Set for reproducibility; -1 for random. |
| Duration | Price |
|---|---|
| 5 seconds | $0.225 |
| 10 seconds | $0.3375 |
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/wan-2.1/t2v-720p-ultra-fast with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Wan 2.1 T2v 720p Ultra Fast below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/wan-2.1/t2v-720p-ultra-fast" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"negative_prompt": "blurry, low quality, distorted",
"size": "1280*720",
"num_inference_steps": 30,
"duration": 5,
"guidance_scale": 5,
"flow_shift": 5,
"seed": -1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("wavespeed-ai/wan-2.1/t2v-720p-ultra-fast", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"negative_prompt": "blurry, low quality, distorted",
"size": "1280*720",
"num_inference_steps": 30,
"duration": 5,
"guidance_scale": 5,
"flow_shift": 5,
"seed": -1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"wavespeed-ai/wan-2.1/t2v-720p-ultra-fast",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"negative_prompt": "blurry, low quality, distorted",
"size": "1280*720",
"num_inference_steps": 30,
"duration": 5,
"guidance_scale": 5,
"flow_shift": 5,
"seed": -1
}
)
print(output["outputs"][0]) # → URL of the generated outputWan 2.1 T2v 720p Ultra Fast is a WaveSpeedAI model for video generation, exposed as a REST API on WaveSpeedAI. WAN 2.1 Text-to-Video generates high-quality 720P videos from text prompts with an ultra-fast pipeline for unlimited AI videos. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/wan-2.1-t2v-720p-ultra-fast.
Wan 2.1 T2v 720p Ultra Fast starts at $0.23 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `duration`, `size`, `seed`, `guidance_scale`, `num_inference_steps`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/wan-2.1-t2v-720p-ultra-fast.
Average end-to-end generation time on WaveSpeedAI is around 67 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.