Seedance 1.5 Pro (Text-to-Video) generates cinematic, live-action–leaning clips from text with strong prompt adherence, expressive motion, and stable aesthetics. It supports 4–12s duration control (including Smart Duration), multiple aspect ratios (including adaptive), and reproducible generation via seeds—ideal for ads and short-drama workflows.
Idle
$0.26per run·~38 / $10
Shot 1 (establishing): Wide aerial of a quiet coastal cliff at sunrise, low fog rolling over the ocean, golden light breaking through thin clouds. A lone runner appears as a small silhouette on the winding path. Camera: smooth drone-like glide forward, slow and steady, cinematic pacing. Shot 2 (character): Medium tracking shot at ground level beside the runner, shoes crunching gravel, breath visible in the cool air, wind tugging at a lightweight jacket. Camera: gimbal-stable side-tracking, shallow depth of field, keep the runner’s face and jacket details consistent. Shot 3 (emotion): Close-up on the runner’s face—focused eyes, subtle micro-expressions, a quick swallow, determination building. Camera: gentle push-in, soft background bokeh, natural handheld micro-shake kept minimal. Shot 4 (end beat): The runner reaches the cliff overlook and slows to a stop; fog parts to reveal a vast sunlit ocean. The runner exhales and smiles slightly. Camera: slow tilt up from the runner to the horizon, hold for a calm finish. Style: photoreal live-action, natural sunrise lighting, filmic color grading, realistic wind and fabric motion, crisp facial detail. Continuity: same runner, same outfit, consistent sunrise direction and color temperature across shots; avoid warping, duplicate limbs, flicker, jump cuts, text overlays, logos.
Create a cinematic live-action style scene set in a quiet café during late afternoon. A man and a woman sit across from each other at a small wooden table, engaged in a tense conversation. The man speaks first, leaning forward slightly, his voice calm but serious. The woman responds with subtle hesitation, maintaining eye contact before glancing away briefly. Use natural lip-synced dialogue for both speakers, with soft ambient café sounds in the background. Camera starts with a medium two-shot, slowly pushes in toward a close-up as the conversation becomes more emotional. Lighting is warm and natural, with sunlight filtering through the window. Maintain realistic facial expressions, smooth body movement, and clear audiovisual synchronization throughout the 8–10 second scene.
A large glowing circular ring floating above a barren foggy landscape. The light is soft, warm and steady, slightly pulsating. The ring feels ancient, neutral, not technological. Minimalist cinematic apocalypse, volumetric fog, ultra realistic lighting. --ar 51:91 --motion high --video 1
The race cars roar past on a track set before the pyramids, yellow sand billowing up beneath their tires. The camera hugs the ground as a car blasts through a corner, dust scattering into the air. Multiple cars weave through the winding circuit, overtaking one another, the composition capturing the tension between the pyramids’ silhouettes and extreme speed. [Aerial drone shot] A bird’s-eye view of the entire desert track reveals the flow of cars, their trajectories intertwining with the glow of the setting sun to create a dynamic visual spectacle.
closeup image of a woman with long curly hair reading a novel in a field of flowers. sky is deep blue. the breeze is gently moving the flowers and her hair.
Seedance 1.5 Pro (T2V) is Seed’s production-oriented text-to-video model built for cinematic realism, strong prompt adherence, and high expressive motion. It is designed for ad creatives and short-drama workflows where aesthetic stability, emotion-rich acting, and controllable duration matter.
High-quality prompt alignment Accurately follows detailed scene descriptions, shot instructions, and emotional tone, producing coherent clips with stable subject and scene intent.
Expressive motion and emotion Natural motion amplitude and strong rhythm; fine-grained facial expressions and emotional delivery, suitable for drama-like performances and high-impact ads.
Strong aesthetics with live-action realism A “natural, live-action” leaning look with stable lighting, composition, and color harmony for premium-looking outputs.
Rich effects via prompting The base model supports a wide variety of visual effects achievable directly through prompting, enabling template-like creative effects without extra tooling.
Flexible duration control (4–12 seconds)
Specify any integer duration from 4s to 12s. Supports Smart Duration (-1) where the model selects an appropriate length within the same range.
Aspect ratio control
Choose either adaptive (model selects the best ratio from the prompt) or explicitly set one of:
16:9, 9:16, 4:3, 3:4, 1:1, 21:9.
prompt (required)
Text description of the scene, style, subject actions, camera motion, and overall mood.
duration
Integer seconds in [4, 12]. Use -1 for Smart Duration (model decides within [4, 12]).
resolution
One of 480p, 720p.
fps
Fixed at 24.
ratio
Use adaptive (model selects a suitable ratio) or set one of: 16:9, 9:16, 4:3, 3:4, 1:1, 21:9.
fixed_camera
Boolean. If true, the camera remains fixed; if false, camera motion is driven by the prompt.
seed
Integer. Controls randomness; the same seed tends to produce more similar outputs. Set -1 to cancel random seed.
camera_fixed = true and describe motion using subject movement rather than camera movement.duration = -1, write prompts with a clear narrative “end beat” (a natural stopping point) so the model has a reason to conclude.Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/bytedance/seedance-v1.5-pro/text-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Seedance v1.5 Pro Text To Video below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/bytedance/seedance-v1.5-pro/text-to-video" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"aspect_ratio": "16:9",
"duration": 5,
"resolution": "720p",
"generate_audio": true,
"camera_fixed": false,
"seed": -1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("bytedance/seedance-v1.5-pro/text-to-video", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"aspect_ratio": "16:9",
"duration": 5,
"resolution": "720p",
"generate_audio": true,
"camera_fixed": false,
"seed": -1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"bytedance/seedance-v1.5-pro/text-to-video",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"aspect_ratio": "16:9",
"duration": 5,
"resolution": "720p",
"generate_audio": true,
"camera_fixed": false,
"seed": -1
}
)
print(output["outputs"][0]) # → URL of the generated outputSeedance v1.5 Pro Text To Video is a ByteDance model for video generation, exposed as a REST API on WaveSpeedAI. Seedance 1.5 Pro (Text-to-Video) generates cinematic, live-action–leaning clips from text with strong prompt adherence, expressive motion, and stable aesthetics. It supports 4–12s duration control (including Smart Duration), multiple aspect ratios (including adaptive), and reproducible generation via seeds—ideal for ads and short-drama workflows. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/bytedance/bytedance-seedance-v1.5-pro-text-to-video.
Seedance v1.5 Pro Text To Video starts at $0.26 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `aspect_ratio`, `resolution`, `duration`, `seed`, `camera_fixed`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/bytedance/bytedance-seedance-v1.5-pro-text-to-video.
Average end-to-end generation time on WaveSpeedAI is around 119 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (ByteDance). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.