Vidu Q3 与 Q3 Pro 模型 5 折 · 仅限 WaveSpeedAI | 5月20日 – 6月2日
首页/探索/OpenAI/Sora 2/Image To Video Pro

Sora 2 Image to Video Pro

openai /

OpenAI Sora 2 Image-to-Video Pro creates physics-aware, realistic videos with synchronized audio and greater steerability. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video
输入

拖放文件或点击上传

preview

就绪

$1.2每次运行

下一步:

示例查看全部

Action: She opened her hands Ambient Sound: The soft crackling of the dying fire in the oven; a high-pitched, happy little ding sound from the timer; the warm, persistent sizzle of butter melting on a nearby stovetop. Character Dialogue: (Voice is high-pitched, bubbly, and enthusiastic) "Welcome to my bakery!"

Action: The tortoise slowly raises its head, and its crystal shell catches the sunlight, momentarily casting a rainbow of light across the forest. It then closes its eyes as a tiny puff of magical mist rises from its back. Ambient Sound: The soft, constant drip-drip-drip of water filtering down through the cavern rocks; the low, deep rumble that comes from the tortoise's chest (a protective resonance); gentle wind chimes sound whenever the mist appears. Character Dialogue: (Voice is slow, ancient, and deep like moving earth) "Be still, little one. The forest remembers. All things are safe beneath the roots of the world."

Action: The character slowly unrolls the scroll, sighs softly, and uses a single finger to gently trace the fading characters on the parchment. He then looks up with a serene expression. Ambient Sound: The soft, rustling sound of silk as the scroll moves; the gentle, intermittent plink of cherry blossoms falling onto the stone ground; the very distant, calming trickle of a stream somewhere down the mountain. Character Dialogue: (Voice is calm, deep, and slightly resonant with age) "Patience is the truest form of power. All knowledge, like these blooms, returns to the earth in time. Observe and learn."

Action: He stops, lowers his gaze to the ground, and lets out a slow breath of cold air that briefly obscures his face before gripping the sword hilt tightly. Ambient Sound: The low, mournful howl of the wind sweeping through the pines; the crisp, soft crunch of boots on frozen gravel; the sharp, clear shing sound as the steel blade is drawn.

A nostalgic, rhythmic mood, with a slow, continuous circular orbit shot around the blurred record, emphasizing its steady rotation.

An aggressive, rapid motion, forward, with the tires spinning instantly into a high-speed blur, and the camera pulling back quickly (fast dolly out) as if accelerating away.

Action: The cube slowly spins faster, and the glowing runes pulse brightly for a moment, illuminating a dusty floor before returning to its steady, slow rotation. Ambient Sound: A deep, sustained electronic hum (the core power source); a very subtle, rhythmic tick-tock sound like an old clock deep within the mechanism; the faint echo of dripping water somewhere off-screen. Character Dialogue: (Voice is calm, synthesized, and androgynous) "Initiating sequence... Primary function: observation. Access denied to unauthorized entities. Remain dormant."

Action: The drone’s fins adjust slightly to maintain position, and its single robotic "eye" (lens) zooms in on a piece of strange, unknown wreckage in the gloom. A small puff of exhaust bubbles rises to the top of the frame. Ambient Sound: A constant, low-frequency sonar ping sound (slow and steady); muffled, bubbling noises from the drone's movement; the heavy, crushing silence of the deep ocean that dominates the background.

A delicate, ephemeral motion, with the dew droplets slowly beginning to slide down the petal, and a micro-level, gentle push-in (dolly in).

A wild, free mood, with a gentle, continuous horizontal pan (pan left or right) across the blurred grass, simulating the wind's uninterrupted flow.

a short prompt for mood, motion style, or camera behavior: a moody, quiet atmosphere, with a slow, subtle forward tracking shot (dolly in) towards the largest reflection, capturing the steaming manholes.

相关模型

README

Sora 2 Image-to-Video Pro

Notice — Service Stability

The Sora 2 family is currently unstable. Generations may fall back to alternative models without notice and the service can be temporarily unavailable. OpenAI is also expected to discontinue this model in the future.

If you need an equally capable, stable alternative, we recommend Seedance 2: bytedance/seedance-2.0/image-to-video.

OpenAI Sora 2 Image-to-Video Pro

Sora 2 Image-to-Video Pro is OpenAI's premium image animation model. Upload an image and describe the motion — AI transforms your still photo into a cinematic video with physics-aware movement, synchronized audio, and professional-grade quality.

Why Choose This?

  • Premium quality Higher fidelity output with enhanced detail preservation and motion coherence.

  • Physics-aware motion Learns contact, inertia, and momentum so objects move and collide believably.

  • Synchronized audio Generates matching audio — ambient sounds, dialogue, and sound effects.

  • Temporal consistency Stable identities, minimal flicker/ghosting, and clean frame-to-frame transitions.

  • Resolution options Output in 720p or 1080p for high-definition results.

  • Extended duration Generate videos up to 20 seconds long.

Parameters

ParameterRequiredDescription
imageYesSource image to animate
promptYesDescribe the motion, action, and audio cues
resolutionNoOutput resolution: 720p or 1080p
durationNoVideo length: 4, 8, 12, 16, or 20 seconds

How to Use

  1. Upload your image — the still photo you want to animate.
  2. Write your prompt — describe the action, motion, camera movement, and audio.
  3. Select resolution — 720p or 1080p.
  4. Set duration — choose 4, 8, 12, 16, or 20 seconds.
  5. Submit — generate, preview, and download your video.

Pricing

Duration720p1080p
4 s$1.20$2.00
8 s$2.40$4.00
12 s$3.60$6.00
16 s$4.80$8.00
20 s$6.00$10.00

Billing Rules

  • 720p rate: $0.30 per second
  • 1080p rate: $0.50 per second
  • Duration options: 4, 8, 12, 16, or 20 seconds

Best Use Cases

  • Premium photo animation — Bring still photos to life with cinema-quality motion.
  • Commercial production — High-resolution output for professional marketing.
  • Art animation — Transform illustrations into broadcast-quality videos.
  • Product showcases — Animate product images for premium presentations.
  • Storytelling — Build cinematic narratives from key visual moments.

Pro Tips

  • Be specific about motion in your prompt for better results.
  • Include audio cues in your prompt for synchronized sound.
  • Higher resolution source images produce better output.
  • Use 1080p for final production, 720p for faster iteration.
  • Start with shorter durations to test your prompt.

Notes

Related Models

无障碍:本网站使用的 AI 模型由第三方提供。

Sora 2 Image To Video Pro API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/openai/sora-2/image-to-video-pro with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Sora 2 Image To Video Pro below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/openai/sora-2/image-to-video-pro" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "resolution": "720p",
    "duration": 4
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("openai/sora-2/image-to-video-pro", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "image": "https://example.com/your-input.jpg",
        "resolution": "720p",
        "duration": 4
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "openai/sora-2/image-to-video-pro",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "resolution": "720p",
    "duration": 4
}
)

print(output["outputs"][0])  # → URL of the generated output

Sora 2 Image To Video Pro API — Frequently asked questions

What is the Sora 2 Image To Video Pro API?

Sora 2 Image To Video Pro is a OpenAI model for video generation from images, exposed as a REST API on WaveSpeedAI. OpenAI Sora 2 Image-to-Video Pro creates physics-aware, realistic videos with synchronized audio and greater steerability. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Sora 2 Image To Video Pro API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/openai/openai-sora-2-image-to-video-pro.

How much does Sora 2 Image To Video Pro cost per run?

Sora 2 Image To Video Pro starts at $1.20 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Sora 2 Image To Video Pro accept?

Key inputs: `prompt`, `image`, `resolution`, `duration`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/openai/openai-sora-2-image-to-video-pro.

How long does Sora 2 Image To Video Pro take to generate?

Average end-to-end generation time on WaveSpeedAI is around 271 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Sora 2 Image To Video Pro outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (OpenAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.