Home/Explore/OpenAI Sora-2 Models/openai/sora-2/text-to-video

text-to-video

openai/sora-2/text-to-video

OpenAI's Sora 2 is new state of the art video and audio generation model. Building on the foundation of Sora, this new model introduces capabilities that have been difficult for prior video models to achieve– such as more accurate physics, sharper realism, synchronized audio, enhanced steerability, and an expanded stylistic range.

Idle

Your request will cost $0.4 per run.

For $10 you can run this model approximately 25 times.

One more thing:

ExamplesView all

README

OpenAI Sora 2 — Text-to-Video

Sora 2 is a state-of-the-art video+audio generator. It advances prior video models with more accurate physics, sharper realism, synchronized audio, stronger steerability, and a wider stylistic range—built on the original Sora foundation.

Why it looks great

  • Physics-aware motion: learns contact, inertia, and momentum so objects move and collide believably.
  • Temporal consistency: stable identities, minimal flicker/ghosting, and clean frame-to-frame transitions.
  • Synchronized audio: lip-sync alignment, beat-aware cuts, and ambience that matches on-screen action.
  • High-frequency detail: preserves fine textures (skin, fabric, foliage) without plastic over-sharpening.
  • Complex scene reasoning: handles multiple subjects, occlusions, depth, and long camera moves coherently.
  • Cinematic camera literacy: natural pans, push-ins, and handheld vibes without warping or jelly-artifacts.
  • Wide stylistic range: from photoreal and documentary to anime, 3D, and illustrative aesthetics.
  • Strong steerability: responds predictably to prompt edits and control settings (duration, fps, motion strength).

How to Use

  1. Prompt: describe scene, style, camera, and audio cues.
  2. Duration: select 4s, 8s, or 12s.
  3. Submit: start generation; preview and download when ready.

Pricing

DurationTotal ($)
4s0.40
8s0.80
12s1.20

Billing Rules: Pricing scales linearly with duration (flat $0.10/s). Durations are fixed at 4s, 8s, or 12s.

Note

Please follow the user rules from OpenAI, you can find details in the reference: What images are permitted and prohibited in Sora-2