Seedance 1.5 Pro Image-to-Video generates cinematic, live-action–leaning clips from a text prompt plus a first-frame image, preserving the image’s subject and composition while adding expressive motion and stable aesthetics. It supports 4–12s duration control (including Smart Duration), adaptive aspect ratio that follows the input image, and reproducible outputs via seeds—ideal for ad creatives and short-drama shots that need a strong visual anchor.
Idle
$0.26per run·~38 / $10
A wide cinematic anime shot in a sunlit, ruined futuristic plaza. A blue-haired girl in a loose white T-shirt and denim shorts stands centered in the foreground, facing camera, while a towering white-and-orange mech looms protectively behind her. Subtle motion: a light breeze gently flutters her hair and shirt; tiny dust motes drift across the frame; faint heat haze shimmers above the cracked concrete. The mech slowly “comes alive” with restrained, realistic movement—soft servo whirs, a small shift of weight, fingers flex slightly, chest vents pulse with a dim glow, and a brief puff of steam from a joint. Camera behavior: slow, steady dolly-in toward the girl with a slight upward tilt, keeping the original composition and scale; gentle parallax on foreground cracks and distant ruined structures; minimal handheld micro-shake for realism. Look/style: high-quality cel-shaded anime illustration, clean linework, soft watercolor textures, bright midday sky with fluffy clouds, cinematic lighting, natural motion blur, no cuts. Avoid identity drift, warping, extra limbs, text, or sudden camera jumps.
The character remains visually consistent with the image, maintaining facial structure, clothing, and lighting. The scene begins with the character standing still, then slowly turning their head toward the camera, followed by a subtle shift in posture and a natural blink. Add gentle camera movement with a slow handheld-style push-in. Include soft environmental sounds such as distant city noise or wind. Preserve live-action realism, smooth motion transitions, and stable temporal consistency across the 6–8 second video.
functional winter comforter. light yellow puffy winter comforter floating against a blue sky, the style should be minimalist, sleek, and high-end, suitable for advertising or branding materials.
Generate a cinematic transition shot based on the provided first and final frames. First frame: A drone-eye overhead shot of the entire city skyline, with the camera slowly pushing in. Maintain consistent scene color tones throughout the transition. Design a smooth, narrative animation sequence: the camera continues its slow push-in over the panoramic city view, then seamlessly zooms in further to a speeding train. Incorporate soft ambient city sounds in the background. The 8–12 second clip must ensure natural and fluid motion, clean and crisp visuals, and precise audiovisual synchronization.
Generate a cinematic transition between the provided first and last frames. The first frame shows the character standing at the edge of a rooftop at sunset, while the final frame shows the character seated, looking out over the city as night falls. Maintain consistent character appearance, clothing, and environment throughout the scene. Animate a smooth narrative progression: the character walks forward, pauses, then slowly sits down. Use a controlled camera pan combined with a gradual change in lighting from warm sunset tones to cool nighttime hues. Include subtle ambient city sounds and ensure natural motion, clean visuals, and precise audiovisual alignment across the 8–12 second clip.”
Seedance V1.5 Pro (Image-to-Video) turns a single reference image into a short video clip, using your prompt to guide motion, camera behavior, and overall style. It’s a practical choice when you want to “bring a keyframe to life” while keeping the original composition as an anchor.
This wrapper is best for short, shot-based generations: portraits with subtle motion, product shots, scenic pans, and cinematic beats where prompt-controlled camera and action matter.
Image-conditioned motion generation Animates a still image into a coherent clip, aiming to preserve the subject and scene layout while introducing natural movement.
Prompt-controlled action and camera Handles prompts that describe what moves and how the camera behaves (e.g., “slow dolly-in,” “handheld feel,” “locked-off tripod shot”).
Flexible output framing Supports common aspect ratios for landscape, vertical, and square content, so you can target feeds, stories, and banners.
Quality / cost tuning via resolution and duration Lets you trade off speed, cost, and visual detail by adjusting resolution and clip length.
Keep prompts shot-focused and concrete. A reliable structure:
Example
“Close-up portrait in soft window light. The subject slowly turns toward camera and smiles. Subtle handheld feel, shallow depth of field, cinematic color grade, film grain.”
Tips
camera_fixed: true and describe a locked-off camera in the prompt.image.last_image, choose an end frame with similar framing and lighting to reduce jumpy transitions.duration Start short for iteration. Increase length only after motion and framing look right.
resolution
Use 480p for fast previews, 720p for a balance of detail and cost.
aspect_ratio Match your target platform:
16:9 for landscape and cinematic shots
9:16 for vertical shorts / stories
1:1 for square feeds
4:3, 3:4, 21:9 for specific layouts
generate_audio Turn it on to speci to generate audio or not.
seed Set a fixed seed to compare prompt changes more fairly. -1 means a random seed will be used.
After you finish configuring the parameters, click Run, preview the result, and iterate if needed.
Pricing is parameter-related.
What this means in practice:
duration.resolution: "720p" costs ~2.17× the non-720p branch (1 / 0.461538).generate_audio: true costs 2× generate_audio: false.| Resolution | Duration | generate_audio | Cost per run |
|---|---|---|---|
| 480p | 5s | false | $0.06 |
| 480p | 5s | true | $0.12 |
| 720p | 5s | false | $0.13 |
| 720p | 5s | true | $0.26 |
| 480p | 10s | true | $0.24 |
| 720p | 10s | true | $0.52 |
aspect_ratio to avoid awkward crops or stretched motion.camera_fixed).480p + shorter duration, then scale up once the motion and framing are correct.Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/bytedance/seedance-v1.5-pro/image-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Seedance v1.5 Pro Image To Video below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/bytedance/seedance-v1.5-pro/image-to-video" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"aspect_ratio": "21:9",
"duration": 5,
"resolution": "720p",
"generate_audio": true,
"camera_fixed": false,
"seed": -1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("bytedance/seedance-v1.5-pro/image-to-video", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"aspect_ratio": "21:9",
"duration": 5,
"resolution": "720p",
"generate_audio": true,
"camera_fixed": false,
"seed": -1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"bytedance/seedance-v1.5-pro/image-to-video",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"aspect_ratio": "21:9",
"duration": 5,
"resolution": "720p",
"generate_audio": true,
"camera_fixed": false,
"seed": -1
}
)
print(output["outputs"][0]) # → URL of the generated outputSeedance v1.5 Pro Image To Video is a ByteDance model for video generation from images, exposed as a REST API on WaveSpeedAI. Seedance 1.5 Pro Image-to-Video generates cinematic, live-action–leaning clips from a text prompt plus a first-frame image, preserving the image’s subject and composition while adding expressive motion and stable aesthetics. It supports 4–12s duration control (including Smart Duration), adaptive aspect ratio that follows the input image, and reproducible outputs via seeds—ideal for ad creatives and short-drama shots that need a strong visual anchor. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/bytedance/bytedance-seedance-v1.5-pro-image-to-video.
Seedance v1.5 Pro Image To Video starts at $0.26 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `image`, `aspect_ratio`, `resolution`, `duration`, `seed`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/bytedance/bytedance-seedance-v1.5-pro-image-to-video.
Average end-to-end generation time on WaveSpeedAI is around 216 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (ByteDance). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.