Seedance 2.0 | Built for Creation ✦ 10% OFF NOW

Seedance 2.0 Models

Seedance 2.0 Models unify text-, image-, video-, and audio-driven generation with native audio sync, multi-shot storyboarding, and cinematic 2K quality

Seedance 2.0 Models unify text-, image-, video-, and audio-driven generation with native audio sync, multi-shot storyboarding, and cinematic 2K quality

All Models

14 models
image-to-video

bytedance/seedance-2.0/image-to-video

Seedance 2.0 (Image-to-Video) generates Hollywood-grade cinematic videos from reference images and text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on Seed's unified multimodal architecture, it preserves the input image's subject and composition while adding expressive, physically accurate motion.

video-to-video

bytedance/seedance-2.0/video-edit

Seedance 2.0 (Video-Edit) edits an input video from a natural-language prompt. The reference video drives subject identity, composition, and motion while the model rewrites lighting, style, weather, environment, or specific elements as instructed. Built on ByteDance Seed's unified multimodal architecture for cinematic, motion-stable output. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

video-to-video

bytedance/seedance-2.0/video-edit-turbo

Seedance 2.0 (Video-Edit Turbo) is the turbo tier for editing an input video from a natural-language prompt — faster, more affordable high-resolution output while preserving subject identity, composition, and motion. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

video-to-video

bytedance/seedance-2.0-fast/video-edit-turbo

Seedance 2.0 Fast (Video-Edit Turbo) is the fastest, cheapest turbo tier for editing an input video from a natural-language prompt — high-resolution output with optimized cost and speed. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

video-to-video

bytedance/seedance-2.0-fast/video-edit

Seedance 2.0 Fast (Video-Edit) edits an input video from a natural-language prompt at a faster, cheaper tier. Built on ByteDance Seed's unified multimodal architecture, it preserves subject identity, composition, and motion while rewriting lighting, style, weather, environment, or specific elements as instructed. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

video-extend

bytedance/seedance-2.0/video-extend

Seedance 2.0 (Video-Extend) extends an input video with a new cinematic continuation generated from its last frame and a natural-language prompt. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

video-extend

bytedance/seedance-2.0-fast/video-extend

Seedance 2.0 Fast (Video-Extend) extends an input video with a new cinematic continuation generated from its last frame and a natural-language prompt — at the faster, cheaper Seedance 2.0 Fast tier. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

text-to-video

bytedance/seedance-2.0/text-to-video

Seedance 2.0 (Text-to-Video) generates Hollywood-grade cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability. Built on Seed's unified multimodal architecture, it leads on instruction adherence, motion quality, and visual aesthetics.

image-to-video

bytedance/seedance-2.0-fast/image-to-video

Seedance 2.0 Fast (Image-to-Video) generates cinematic videos from reference images and text prompts with native audio-visual synchronization, director-level control, and exceptional motion stability — optimized for faster generation at lower cost. Built on Seed's unified multimodal architecture.

text-to-video

bytedance/seedance-2.0-fast/text-to-video

Seedance 2.0 Fast (Text-to-Video) generates cinematic videos from text prompts with native audio-visual synchronization, director-level camera and lighting control, and exceptional motion stability — optimized for faster generation at lower cost. Built on Seed's unified multimodal architecture.

image-to-video

bytedance/seedance-2.0/image-to-video-turbo

Seedance 2.0 (Image-to-Video Turbo) generates cinematic 720p/1080p videos from reference images —delivering high-resolution output at near-480p speed with native audio-visual synchronization, director-level control, and exceptional motion stability.

text-to-video

bytedance/seedance-2.0/text-to-video-turbo

Seedance 2.0 (Text-to-Video Turbo) generates cinematic 720p/1080p videos from text prompts —delivering high-resolution output at near-480p speed with native audio-visual synchronization, director-level control, and exceptional motion stability.

image-to-video

bytedance/seedance-2.0-fast/image-to-video-turbo

Seedance 2.0 Fast (Image-to-Video Turbo) generates cinematic 720p/1080p videos from reference images using speed-optimized inference —the fastest and most affordable Seedance image-to-video option with native audio-visual synchronization and director-level control.

text-to-video

bytedance/seedance-2.0-fast/text-to-video-turbo

Seedance 2.0 Fast (Text-to-Video Turbo) generates cinematic 720p/1080p videos from text prompts using speed-optimized inference —the fastest and most affordable Seedance option with native audio-visual synchronization and director-level control.

Seedance 2.0 Models

ByteDance's Seedance 2.0 is a production-ready suite of AI video-generation endpoints, featuring native audio-video co-generation, multi-shot storyboarding, and cinematic 2K quality. Seedance 2.0 covers two core workflows: text-to-video generation and image-to-video animation, each available in standard and fast tiers, offering flexible quality-speed trade-offs.

Seedance 2.0 Series — Text-to-Video & Image-to-Video API

Seedance 2.0 offers four focused endpoints for generating videos from text prompts or animating still images—ideal for cinematic content production, social media automation, and repeatable video workflows.

  1. Seedance 2.0 Text-to-Video — Generate high-quality cinematic videos from text prompts with native audio sync, realistic physics, and multi-shot scene transitions.
  2. Seedance 2.0 Image-to-Video — Animate any still image into a fluid video clip with consistent character preservation, natural motion, and synchronized audio output.
  3. Seedance 2.0 Fast Text-to-Video — Fast tier text-to-video generation optimized for speed and rapid iteration without sacrificing core motion quality.
  4. Seedance 2.0 Fast Image-to-Video — Fast tier image animation for high-throughput pipelines requiring quick turnaround on visual content.

Key Features

  1. Native Audio-Video Co-Generation — Video and audio are generated simultaneously in a single pass, delivering lip-synced dialogue, contextual sound effects, and adaptive music without post-production stitching.
  2. Multi-Shot Storyboarding — Generate up to 15-second clips composed of multiple natural shots with seamless cuts and transitions, producing edited-sequence output from a single prompt.
  3. Character Consistency — Facial features, clothing, and visual style are preserved frame-to-frame and across multiple generated clips using reference-based identity locking.
  4. Standard & Fast Tiers — Choose cinematic-quality standard or speed-optimized fast endpoints based on your latency and throughput requirements.
  5. Cinematic Camera Control — Director-level camera controls including push-in, pan, orbit, and tracking shots via natural language prompt keywords.