Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only
ホーム/探索/Seedance 2.0 Models /bytedance/seedance-2.0/text-to-video
text-to-video

text-to-video

Seedance 2.0 Text-to-Video

bytedance/seedance-2.0/text-to-video

Seedance 2.0 (Text-to-Video) generates Hollywood-grade cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on ByteDance Seed's unified multimodal architecture, it leads on instruction adherence, motion quality, and visual aesthetics.

Input

Idle

このリクエストには1回あたりで$0.9の費用がかかります。

$10でおよそ11回実行できます。

もうひとつお知らせ:

サンプルすべて表示

README

Seedance 2.0 Text-to-Video

Seedance 2.0 is ByteDance Seed's latest video generation model, built on a unified multimodal architecture that accepts text, image, audio, and video inputs. The Text-to-Video mode generates production-grade cinematic videos from text prompts alone — with native audio, director-level control, and exceptional motion stability.

Key Features

  • Unified multimodal architecture A single model that handles text, image, audio, and video inputs for comprehensive creative flexibility.

  • Native audio-visual synchronization Generates video with synchronized audio in a single pass — no separate audio generation needed.

  • Director-level control Granular control over camera movement, lighting, shadows, and character performance through natural language prompts.

  • Production-grade cinematic quality Hollywood-grade visual fidelity with dramatic lighting, professional color grading, and smooth natural motion.

  • Exceptional motion stability Industry-leading motion coherence with stable subjects, consistent physics, and fluid transitions.

  • Strong instruction adherence Accurately follows detailed scene descriptions, shot compositions, and creative direction.

Parameters

ParameterRequiredDescription
promptYesDetailed description of the cinematic scene
aspect_ratioNoOutput format: 16:9 (default), 9:16, 4:3, 3:4, 1:1, 21:9
durationNoVideo length: 5 (default), 10, or 15 seconds
resolutionNoOutput resolution: 480p, 720p (default), or 1080p
reference_imagesNoReference image URLs to guide style, characters, or composition
reference_videosNoReference video URLs (total length must not exceed 15 seconds)

How to Use

  1. Write your prompt — describe the scene with cinematic detail: lighting, mood, camera movement, action, and style.
  2. Select aspect ratio — 16:9 for widescreen, 9:16 for vertical, 4:3 or 3:4 for classic formats.
  3. Set duration — choose 5, 10, or 15 seconds.
  4. Optionally add references — provide reference images or videos for style guidance.
  5. Run — submit and download your cinematic video with synchronized audio.

Pricing

ResolutionDurationWithout Reference VideosWith Reference Videos
480p5 s$0.90$1.80
480p10 s$1.80$3.60
480p15 s$2.70$5.40
720p5 s$1.80$3.60
720p10 s$3.60$7.20
720p15 s$5.40$10.80
1080p5 s$2.70$5.40
1080p10 s$5.40$10.80
1080p15 s$8.10$16.20

Billing Rules

  • Base rate (480p): $0.90 per 5 seconds (without reference videos), $1.80 with reference videos
  • 720p: 2x the 480p price
  • 1080p: 3x the 480p price
  • Duration options: 5, 10, or 15 seconds

Best Use Cases

  • Film & Production — Generate cinematic footage for professional video projects.
  • Commercials & Ads — Create high-end promotional content with Hollywood aesthetics.
  • Music Videos — Produce visually stunning sequences with native audio sync.
  • Social Media Premium — Stand out with film-quality short-form content.
  • Concept Visualization — Pitch film and TV concepts with production-quality previews.

Pro Tips

  • Write prompts like a film director — include lighting (e.g., "dramatic rim lighting"), camera angles, and mood.
  • Use 16:9 for cinematic widescreen; 9:16 for premium vertical content.
  • Include specific visual details for best results (e.g., "golden hour sunlight casting long shadows").
  • Describe character expressions and actions for more engaging scenes.
  • Start with 5s duration to iterate on the look, then extend to 10s or 15s.

Notes

  • Native audio generation is included — videos come with synchronized sound.
  • Duration options: 5, 10, or 15 seconds.
  • Built on the same architecture as Seedance 2.0 Image-to-Video.

Related Models