WaveSpeedAI APIOpenaiOpenai Sora 2 Text To Video Pro

Openai Sora 2 Text To Video Pro

Openai Sora 2 Text To Video Pro

Playground

Try it on WavespeedAI!

OpenAI’s Sora 2 is new state of the art video and audio generation model. Building on the foundation of Sora, this new model introduces capabilities that have been difficult for prior video models to achieve– such as more accurate physics, sharper realism, synchronized audio, enhanced steerability, and an expanded stylistic range.

Features

OpenAI Sora 2 — Text-to-Video-Pro

Sora 2 is a state-of-the-art video+audio generator. It advances prior video models with more accurate physics, sharper realism, synchronized audio, stronger steerability, and a wider stylistic range—built on the original Sora foundation.


Why it looks great

  • Physics-aware motion: learns contact, inertia, and momentum so objects move and collide believably.
  • Temporal consistency: stable identities, minimal flicker/ghosting, and clean frame-to-frame transitions.
  • Synchronized audio: lip-sync alignment, beat-aware cuts, and ambience that matches on-screen action.
  • High-frequency detail: preserves fine textures (skin, fabric, foliage) without plastic over-sharpening.
  • Complex scene reasoning: handles multiple subjects, occlusions, depth, and long camera moves coherently.
  • Cinematic camera literacy: natural pans, push-ins, and handheld vibes without warping or jelly-artifacts.
  • Wide stylistic range: from photoreal and documentary to anime, 3D, and illustrative aesthetics.
  • Strong steerability: responds predictably to prompt edits and control settings (duration, fps, motion strength).

How to Use

  1. Prompt: describe scene, style, camera, and audio cues.
  2. Duration: select 4s, 8s, or 12s.
  3. Submit: start generation; preview and download when ready.

Pricing

DurationResolutionTotal ($)
4s720×1280 / 1280×7201.20
8s720×1280 / 1280×7202.40
12s720×1280 / 1280×7203.60
4s1024×1792 / 1792×10242.00
8s1024×1792 / 1792×10244.00
12s1024×1792 / 1792×10246.00

Note

Please follow the user rules from OpenAI, you can find details in the reference: What images are permitted and prohibited in Sora-2

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/openai/sora-2/text-to-video-pro" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "size": "720*1280",
    "duration": 4
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-The positive prompt for the generation.
sizestringNo720*1280720*1280, 1280*720, 1024*1792, 1792*1024The size of the generated media in pixels (width*height).
durationintegerNo44, 8, 12The duration of the generated video in seconds.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

© 2025 WaveSpeedAI. All rights reserved.