Nano Banana 2 & Pro Sale — 15% OFF | Apr 1–15 Only
Startseite/Entdecken/Seedance 2.0 Models /bytedance/seedance-2.0/text-to-video
text-to-video

text-to-video

Seedance 2.0 Text-to-Video

bytedance/seedance-2.0/text-to-video

Seedance 2.0 (Text-to-Video) generates Hollywood-grade cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on ByteDance Seed's unified multimodal architecture, it leads on instruction adherence, motion quality, and visual aesthetics.

Input

Idle

Ihre Anfrage kostet $0.9 pro Durchlauf.

Für $10 können Sie dieses Modell ungefähr 11 Mal ausführen.

Noch etwas:

BeispieleAlle anzeigen

README

Seedance 2.0 Text-to-Video

Seedance 2.0 is ByteDance Seed's latest video generation model, built on a unified multimodal architecture that accepts text, image, audio, and video inputs. The Text-to-Video mode generates production-grade cinematic videos from text prompts alone — with native audio, director-level control, and exceptional motion stability.

Key Features

  • Unified multimodal architecture A single model that handles text, image, audio, and video inputs for comprehensive creative flexibility.

  • Native audio-visual synchronization Generates video with synchronized audio in a single pass — no separate audio generation needed.

  • Director-level control Granular control over camera movement, lighting, shadows, and character performance through natural language prompts.

  • Production-grade cinematic quality Hollywood-grade visual fidelity with dramatic lighting, professional color grading, and smooth natural motion.

  • Exceptional motion stability Industry-leading motion coherence with stable subjects, consistent physics, and fluid transitions.

  • Strong instruction adherence Accurately follows detailed scene descriptions, shot compositions, and creative direction.

Parameters

ParameterRequiredDescription
promptYesDetailed description of the cinematic scene
aspect_ratioNoOutput format: 16:9 (default), 9:16, 4:3, 3:4, 1:1, 21:9
durationNoVideo length: 5 (default), 10, or 15 seconds
resolutionNoOutput resolution: 480p, 720p (default), or 1080p
reference_imagesNoReference image URLs to guide style, characters, or composition
reference_videosNoReference video URLs (total length must not exceed 15 seconds)

How to Use

  1. Write your prompt — describe the scene with cinematic detail: lighting, mood, camera movement, action, and style.
  2. Select aspect ratio — 16:9 for widescreen, 9:16 for vertical, 4:3 or 3:4 for classic formats.
  3. Set duration — choose 5, 10, or 15 seconds.
  4. Optionally add references — provide reference images or videos for style guidance.
  5. Run — submit and download your cinematic video with synchronized audio.

Pricing

ResolutionDurationWithout Reference VideosWith Reference Videos
480p5 s$0.90$1.80
480p10 s$1.80$3.60
480p15 s$2.70$5.40
720p5 s$1.80$3.60
720p10 s$3.60$7.20
720p15 s$5.40$10.80
1080p5 s$2.70$5.40
1080p10 s$5.40$10.80
1080p15 s$8.10$16.20

Billing Rules

  • Base rate (480p): $0.90 per 5 seconds (without reference videos), $1.80 with reference videos
  • 720p: 2x the 480p price
  • 1080p: 3x the 480p price
  • Duration options: 5, 10, or 15 seconds

Best Use Cases

  • Film & Production — Generate cinematic footage for professional video projects.
  • Commercials & Ads — Create high-end promotional content with Hollywood aesthetics.
  • Music Videos — Produce visually stunning sequences with native audio sync.
  • Social Media Premium — Stand out with film-quality short-form content.
  • Concept Visualization — Pitch film and TV concepts with production-quality previews.

Pro Tips

  • Write prompts like a film director — include lighting (e.g., "dramatic rim lighting"), camera angles, and mood.
  • Use 16:9 for cinematic widescreen; 9:16 for premium vertical content.
  • Include specific visual details for best results (e.g., "golden hour sunlight casting long shadows").
  • Describe character expressions and actions for more engaging scenes.
  • Start with 5s duration to iterate on the look, then extend to 10s or 15s.

Notes

  • Native audio generation is included — videos come with synchronized sound.
  • Duration options: 5, 10, or 15 seconds.
  • Built on the same architecture as Seedance 2.0 Image-to-Video.

Related Models