Sora 2 Text to Video | Powerful Text-to-Video API

Главная/Обзор/OpenAI/Sora 2/Text To Video

openai /

OpenAI Sora 2 is a state-of-the-art text-to-video model with realistic visuals, accurate physics, synchronized audio, and strong steerability. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video

Ввод

Enable Safety Checker

Ожидание

$0.4за запуск·~25 / $10

ПримерыСмотреть всё

Peter and Joe are playing on the grass in a park.

In a 90s documentary-style interview, an old Swedish man sits in a study and says, "I still remember when I was young."

Format & Look Duration 4s; 180° shutter; digital capture emulating 65 mm photochemical contrast; fine grain; subtle halation on speculars; no gate weave. Lenses & Filtration 32 mm / 50 mm spherical primes; Black Pro-Mist 1/4; slight CPL rotation to manage glass reflections on train windows. Grade / Palette Highlights: clean morning sunlight with amber lift. Mids: balanced neutrals with slight teal cast in shadows. Blacks: soft, neutral with mild lift for haze retention. Lighting & Atmosphere Natural sunlight from camera left, low angle (07:30 AM). Bounce: 4×4 ultrabounce silver from trackside. Negative fill from opposite wall. Practical: sodium platform lights on dim fade. Atmos: gentle mist; train exhaust drift through light beam. Location & Framing Urban commuter platform, dawn. Foreground: yellow safety line, coffee cup on bench. Midground: waiting passengers silhouetted in haze. Background: arriving train braking to a stop. Avoid signage or corporate branding. Wardrobe / Props / Extras Main subject: mid-30s traveler, navy coat, backpack slung on one shoulder, holding phone loosely at side. Extras: commuters in muted tones; one cyclist pushing bike. Props: paper coffee cup, rolling luggage, LED departure board (generic destinations). Sound Diegetic only: faint rail screech, train brakes hiss, distant announcement muffled (-20 LUFS), low ambient hum. Footsteps and paper rustle; no score or added foley. Optimized Shot List (2 shots / 4 s total) 0.00–2.40 — “Arrival Drift” (32 mm, shoulder-mounted slow dolly left) Camera slides past platform signage edge; shallow focus reveals traveler mid-frame looking down tracks. Morning light blooms across lens; train headlights flare softly through mist. Purpose: establish setting and tone, hint anticipation. 2.40–4.00 — “Turn and Pause” (50 mm, slow arc in) Cut to tighter over-shoulder arc as train halts; traveler turns slightly toward camera, catching sunlight rim across cheek and phone screen refle

Convenience store entrance after rain; street reflections; meteors streak above. Characters: Night clerk (blue vest) + lone traveler. Action: Clerk hands over hot cocoa; both glance up to watch a meteor; traveler bows in thanks. Camera: Warm interior push-out → meteor reflected in puddle → shoulders-together upshot → rack focus back to cup steam. Look & Lighting: Anime-real blend; clean mirror-wet pavement with cool/warm contrast. Physics & Motion: Stable handoff; believable steam and drips. Audio: Distant city ambience + light electronic pad; soft “Thanks—the road feels closer now.”

Morning above cloud sea; toast-shaped balloons drift. Characters: two travel vloggers in basket. Action: orbiting drone shot; a seagull swoops; balloon makes a tiny “bounce.” Camera: orbit + gentle dolly; autofocus through clouds. Look: bright/clean sky blue; toasted surface texture. Motion: volumetric clouds and consistent lighting. Audio: burner whoosh + soft whistle; line: “Morning from the sky!”

Modern office afternoon, sunlight on desk plants. Character: quiet “ninja” intern; hoodie with a smiley sticker mask. Action: tiptoes to refill coffee; folds tiny paper shuriken reading “Keep going!” for each desk; “shh” to camera. Camera: over-shoulder follow → paper close-up → co-workers’ reactions. Look: realistic with light comedy tone; controlled reflections. Motion: stable interactions, real paper bending. Audio: light percussion + paper rustle; whispered “Shh…”.

README

Sora 2 Text-to-Video

Notice — Service Stability

The Sora 2 family is currently unstable. Generations may fall back to alternative models without notice and the service can be temporarily unavailable. OpenAI is also expected to discontinue this model in the future.

If you need an equally capable, stable alternative, we recommend Seedance 2: bytedance/seedance-2.0/text-to-video.

Sora 2 Text-to-Video

Sora 2 Text-to-Video is OpenAI's text-to-video model purpose-built for scenes featuring multiple distinct characters simultaneously. Describe the scene in natural language, reference your pre-defined character IDs, and the model renders a cohesive, temporally consistent video where every character looks and moves exactly as intended — no manual compositing required.

Why Choose This?

True multi-character consistency Reference two or more character IDs in a single generation. Each character retains its unique appearance, proportions, and style throughout every frame.
Natural-language scene control Describe interactions, environments, and actions in plain text. The model understands spatial relationships and character dynamics to produce believable compositions.
Flexible aspect ratio support Choose between portrait (720×1280) and landscape (1280×720) orientations to match your target platform.
Scalable duration Generate clips from 4 seconds up to 20 seconds in fixed steps, giving you full control over pacing and output cost.
Production-ready output Delivers smooth, artifact-free motion suitable for marketing content, storytelling, game cinematics, and social media video.

Parameters

Parameter	Required	Description
prompt	Yes	Text description of the scene, characters, actions, and environment.
size	No	Output resolution: 720×1280 (portrait) or 1280×720 (landscape).
duration	No	Clip length in seconds. Options: 4, 8, 12, 16, 20.
characters	No	List of character IDs to include. Add one or more char_... identifiers.

How to Use

Write your prompt — describe what the characters are doing and where the scene takes place.
Select size — portrait (720×1280) for mobile/social, landscape (1280×720) for widescreen.
Set duration — choose 4, 8, 12, 16, or 20 seconds based on your scene length.
Add character IDs — click Add Item under the characters section to include each character by their unique identifier.
Submit — generate, preview, and download your video.

Pricing

Duration	Cost per Generation
4s	$0.40
8s	$0.80
12s	$1.20
16s	$1.60
20s	$2.00

Billing Rules

Rate: $0.10 per second
Duration options: 4, 8, 12, 16, or 20 seconds
Billing is based on the selected duration, not actual playback length

Best Use Cases

Brand & Marketing Videos — Feature multiple characters or spokespeople in a single scene without manual compositing.
Social Media Content — Produce portrait-format multi-character clips optimized for Reels, TikTok, and Shorts.
Game & IP Storytelling — Render in-world scenes with established characters maintaining consistent visual identity.
Educational & Explainer Content — Animate two or more characters interacting to illustrate concepts or narratives.
Advertising & Campaigns — Generate diverse cast scenarios rapidly for A/B testing creative variations.

Pro Tips

Be specific about character positions and actions in your prompt for better spatial composition.
Use portrait mode (720×1280) for mobile-first platforms and landscape (1280×720) for cinematic or desktop use.
Start with a 4-second generation to validate composition and character rendering before committing to a longer duration.
Ensure all referenced character IDs are valid and accessible in your account before submitting.

Notes

Character IDs must be created and saved in advance — this model references existing character profiles and does not create new definitions.
Only prompt is a required field; size, duration, and characters are optional.
Complex multi-character scenes benefit from concise, clearly structured prompts.

Related Models

Примечание:Этот сайт использует модели ИИ, предоставляемые третьими лицами.

ПримерыСмотреть всё

Похожие модели

README

Sora 2 Text-to-Video

Sora 2 Text-to-Video

Why Choose This?

Parameters

How to Use

Pricing

Billing Rules

Best Use Cases

Pro Tips

Notes

Related Models

Sora 2 Text To Video API — Quick start

Sora 2 Text To Video API — Frequently asked questions