PixVerse V5 Text-to-Video generates smooth, natural 5s videos from text prompts in seconds, with 720p output available ($0.20 per 5s). Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
就绪
$0.15每次运行·~66 / $10
Realism,a female inventor with auburn hair in an intricate updo and goggles on her head, her eyes full of intellect. She wears a leather corset and a multi-layered skirt, standing in her workshop. The room is filled with brass gears, complex clockwork devices, and glowing vacuum tubes. Warm light from gas lamps illuminates the scene. Steampunk style, highly detailed, retro-futurism, masterpiece.
A young man sits by a tall window in a library, sunlight streaming across the desk. He flips through a book slowly, pen in hand taking notes. The camera starts from a close-up of his hand writing, then pulls back to reveal rows of books behind him. Atmosphere of concentration, ambition, and quiet determination.
A teenage boy stands on a rooftop at dusk, city lights flickering below. He takes a deep breath and looks out at the skyline, the wind moving his jacket slightly. The camera circles slowly around him, shifting from a wide city view to a medium shot of his thoughtful expression. The mood is cinematic, inspiring, and filled with curiosity for the future.
A young man rides a bicycle along a quiet tree-lined road at sunrise, golden rays shining through the leaves. The camera tracks from the front as he pedals, capturing his relaxed smile and the gentle movement of the trees. The mood is fresh, hopeful, and full of vitality.
A young man stands at the top of a hill, wind brushing against his clothes. He opens his arms wide and takes a deep breath, looking at the horizon where the clouds are painted with sunset colors. The camera circles around him in a wide sweeping shot. Mood: inspiring, uplifting, full of possibility.
A young man in a tailored suit walks calmly down a glossy runway under bright white spotlights. The camera captures him from a low angle, emphasizing power and style. The atmosphere is modern, minimalistic, and high-fashion.
A young man sits on a vintage leather chair in a photography studio, shot entirely in black and white. He looks straight into the lens with intensity. The camera slowly pushes forward, creating a timeless editorial mood.
A model stands barefoot on the shoreline, wearing flowing designer fabric that moves with the wind. The golden sunset reflects on the water as the camera captures wide cinematic shots, then shifts to close-ups of the fabric details. Atmosphere: dreamy, luxurious, and fashion-forward.
A young woman in an elegant evening gown stands on a rocky cliff at night, the starry sky stretching above. The camera pans slowly across her profile, highlighting the gown shimmering under moonlight.
A young man stands barefoot on the beach during sunset, the golden sky reflecting on the waves. He looks into the distance with a calm expression. The scene feels like a warm memory captured on film.
PixVerse V5 Text-to-Video is PixVerse's latest text-to-video generation model with support for resolutions up to Full HD 1080p. Generate stunning, detailed videos from text descriptions with flexible aspect ratios and multiple resolution tiers.
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the video you want to generate. |
| resolution | No | Output resolution: 360p, 540p, 720p, 1080p (default: 1080p). |
| aspect_ratio | No | Output aspect ratio: 16:9, 4:3, 1:1, 3:4, 9:16 (default: 9:16). |
| duration | No | Video length: 5 or 8 seconds (default: 5). |
| negative_prompt | No | Elements to avoid in the output. |
| seed | No | Set for reproducibility; leave empty for random. |
| Resolution | Best For |
|---|---|
| 360p | Quick previews, drafts, low-bandwidth use |
| 540p | Social media, balanced quality and speed |
| 720p | HD output, general professional use |
| 1080p | Full HD, premium quality, final delivery |
| Aspect Ratio | Best For |
|---|---|
| 16:9 | YouTube, widescreen displays |
| 4:3 | Classic format, presentations |
| 1:1 | Instagram posts, social media squares |
| 3:4 | Portrait photos, Pinterest |
| 9:16 | TikTok, Instagram Stories, Reels |
| Resolution | 5 seconds | 8 seconds |
|---|---|---|
| 360p | $0.15 | $0.30 |
| 540p | $0.15 | $0.30 |
| 720p | $0.20 | $0.40 |
| 1080p | $0.40 | $0.80 |
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/pixverse/pixverse-v5-t2v with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Pixverse v5 T2v below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/pixverse/pixverse-v5-t2v" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"resolution": "540p",
"aspect_ratio": "16:9",
"duration": 5,
"negative_prompt": "blurry, low quality, distorted",
"seed": 0
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("pixverse/pixverse-v5-t2v", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"resolution": "540p",
"aspect_ratio": "16:9",
"duration": 5,
"negative_prompt": "blurry, low quality, distorted",
"seed": 0
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"pixverse/pixverse-v5-t2v",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"resolution": "540p",
"aspect_ratio": "16:9",
"duration": 5,
"negative_prompt": "blurry, low quality, distorted",
"seed": 0
}
)
print(output["outputs"][0]) # → URL of the generated outputPixverse v5 T2v is a Pixverse model for video generation, exposed as a REST API on WaveSpeedAI. PixVerse V5 Text-to-Video generates smooth, natural 5s videos from text prompts in seconds, with 720p output available ($0.20 per 5s). Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/pixverse/pixverse-pixverse-v5-t2v.
Pixverse v5 T2v starts at $0.15 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `aspect_ratio`, `resolution`, `duration`, `seed`, `negative_prompt`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/pixverse/pixverse-pixverse-v5-t2v.
Average end-to-end generation time on WaveSpeedAI is around 64 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (Pixverse). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.