Home/Explore/OpenAI Models/openai/sora-2/text-to-video
text-to-video

text-to-video

OpenAI Sora 2 | Text-To-Video With Synchronized Audio, Realistic Physics, Enhanced Steerability | WaveSpeedAI

openai/sora-2/text-to-video

OpenAI Sora 2 is a state-of-the-art text-to-video model with realistic visuals, accurate physics, synchronized audio, and strong steerability. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Idle

Your request will cost $0.4 per run.

For $10 you can run this model approximately 25 times.

One more thing::

ExamplesView all

README

OpenAI Sora 2 — Text-to-Video

Sora 2 is a state-of-the-art video+audio generator. It advances prior video models with more accurate physics, sharper realism, synchronized audio, stronger steerability, and a wider stylistic range—built on the original Sora foundation.

Why it looks great

  • Physics-aware motion: learns contact, inertia, and momentum so objects move and collide believably.
  • Temporal consistency: stable identities, minimal flicker/ghosting, and clean frame-to-frame transitions.
  • Synchronized audio: lip-sync alignment, beat-aware cuts, and ambience that matches on-screen action.
  • High-frequency detail: preserves fine textures (skin, fabric, foliage) without plastic over-sharpening.
  • Complex scene reasoning: handles multiple subjects, occlusions, depth, and long camera moves coherently.
  • Cinematic camera literacy: natural pans, push-ins, and handheld vibes without warping or jelly-artifacts.
  • Wide stylistic range: from photoreal and documentary to anime, 3D, and illustrative aesthetics.
  • Strong steerability: responds predictably to prompt edits and control settings (duration, fps, motion strength).

How to Use

  1. Prompt: describe scene, style, camera, and audio cues.
  2. Duration: select 4s, 8s, or 12s.
  3. Submit: start generation; preview and download when ready.

Pricing

DurationTotal ($)
4s0.40
8s0.80
12s1.20

Billing Rules: Pricing scales linearly with duration (flat $0.10/s). Durations are fixed at 4s, 8s, or 12s.

Note

Please follow the user rules from OpenAI, you can find details in the reference: What images are permitted and prohibited in Sora-2