GPT Image 2 is LIVE Now. Try in Image Generator→
Home/Explore/OpenAI Models/openai/sora-2-pro/text-to-video

OpenAI Sora 2

openai /

OpenAI Sora 2 Pro is a state-of-the-art text-to-video model with realistic physics, synchronized audio, and strong steerability. Supports multiple resolutions up to 1080p and durations up to 20 seconds. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video
Input

Idle

Your request will cost $1.2 per run.

One more thing:

ExamplesView all

README

OpenAI Sora 2 Pro Text-to-Video

Sora 2 Pro is OpenAI's premium video and audio generator. It advances prior video models with more accurate physics, sharper realism, synchronized audio, stronger steerability, and a wider stylistic range — built on the original Sora foundation.

Now with character consistency — create reusable character IDs and feature them across multiple videos with the same identity.

Why It Looks Great

  • Physics-aware motion: learns contact, inertia, and momentum so objects move and collide believably.
  • Temporal consistency: stable identities, minimal flicker/ghosting, and clean frame-to-frame transitions.
  • Synchronized audio: lip-sync alignment, beat-aware cuts, and ambience that matches on-screen action.
  • High-frequency detail: preserves fine textures (skin, fabric, foliage) without plastic over-sharpening.
  • Complex scene reasoning: handles multiple subjects, occlusions, depth, and long camera moves coherently.
  • Cinematic camera literacy: natural pans, push-ins, and handheld vibes without warping or jelly-artifacts.
  • Wide stylistic range: from photoreal and documentary to anime, 3D, and illustrative aesthetics.
  • Strong steerability: responds predictably to prompt edits and control settings (duration, fps, motion strength).

Parameters

ParameterRequiredDescription
promptYesDescribe scene, style, camera, and audio cues
sizeNoOutput resolution (see options below)
durationNoVideo length: 4, 8, 12, 16, or 20 seconds
charactersNoList of character IDs for consistent identity

Size Options

  • 720×1280 / 1280×720 (720p)
  • 1024×1792 / 1792×1024 (1024p)
  • 1080×1920 / 1920×1080 (1080p)

How to Use

  1. Write your prompt — describe scene, style, camera movement, and audio cues.
  2. Select size — choose resolution and orientation.
  3. Set duration — select 4, 8, 12, 16, or 20 seconds.
  4. Add characters (optional) — paste character IDs from Sora 2 Characters.
  5. Submit — generate, preview, and download when ready.

Pricing

Duration720p1024p1080p
4 s$1.20$2.00$2.80
8 s$2.40$4.00$5.60
12 s$3.60$6.00$8.40
16 s$4.80$8.00$11.20
20 s$6.00$10.00$14.00

Billing Rules

  • 720p rate: $0.30 per second
  • 1024p rate: $0.50 per second
  • 1080p rate: $0.70 per second
  • Duration options: 4, 8, 12, 16, or 20 seconds

Notes

Related Models