Video 02 | Fast Image-to-Video API

minimax /

Hailuo 02 is an AI video generation model fine-tuned for ultra-clear 1080P output and handling complex physics-driven scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

Entrée

Enable Safety Checker

En attente

$0.25par exécution·~40 / $10

ExemplesTout voir

Circus Scene. The camera follows a clown riding unicycle while jugging balls. The camera pulls back, tracks left, and tilts loft

The character in the frame juggles throwing knives with fast and fluid motion.Circus Scene

Circus Scene. A lion jumps through a flaming hoop. The camera puls back and follows, then tracks the lion running throuah the circus ring

Circus Scene. A bear n a vest does three consecutive somersaults on a mat, the movements are smooth and fast.

Circus Scene. The character waves a torch quickly, the camera pushes in for a close-up of their face.

Circus Scene. Acrooatic performance: performer swings rapidly on an aerial hoop, executing high-difficulty moves as the camera folows

A black cat leaps across rooftops under a full moon. [Side-scrolling style], reminiscent of a 2D platformer game. The rooftops have different textures—tiles, bricks, wooden planks. The background slowly scrolls to reveal more of the stylized nighttime cityscape.

Inside a massive library with impossibly tall shelves, books fly off on their own, fluttering like birds. A young girl with round glasses floats gently upward, reaching for a glowing book at the top. Warm golden light pours through stained-glass windows.

A sad clown in vintage makeup stands alone under a dim spotlight in an empty circus tent, fog swirling around his shoes, camera slowly zooming in on his teary eyes. Cinematic lighting, melancholic tone.

A fashion editorial scene set in a barren desert. A tall woman in a futuristic metallic dress walks down a cracked salt flat. The sky is bright white, almost overexposed, and the dress reflects the environment like liquid metal. [Slow panning shot] captures her silhouette against the endless horizon.

A female gymnast performs a flawless floor routine under dramatic spotlight, slow-motion close-ups highlight her elegant spins, flips, and landings, cinematic lighting and smooth camera transitions

The character in the frame juggles throwing knives with fast and fluid motion.Circus Scene

Modèles associés

video-01

image-to-video

speech-2.5-hd-preview

text-to-audio

voice-design

text-to-audio

voice-clone

audio-to-audio

speech-02-turbo

text-to-audio

speech-02-hd

text-to-audio

README

MiniMax Video-02 — minimax/video-02

MiniMax Video-02 generates short video clips from a text prompt, with optional image guidance. Describe the subject, action, scene, and camera movement, and the model produces a coherent, motion-rich clip suitable for story beats, ads, and creative prototyping.

Key capabilities

Text-to-video generation with strong motion and scene coherence
Optional image input to anchor composition and style
Camera-direction friendly prompting (follow, pull back, track, tilt, orbit)
Prompt expansion option to automatically enhance prompts and enable the safety checker

Use cases

Cinematic shot generation (camera moves, blocking, atmosphere)
Storyboarding and pre-visualization for short scenes
Marketing creatives and social clips with clear action cues
Image-guided variants (keep a reference look while changing motion/camera)

Pricing

Resolution	Price per video
720p	$0.0625
1080p	$0.11

Parameters

prompt (required): What happens in the video (subject, action, scene, camera, style)
image (optional): Reference image to guide composition/style
resolution: Output resolution (e.g., 720p, 1080p)
duration: Video length (seconds)
enable_prompt_expansion: Enhances the prompt automatically and enables the safety checker

Prompting tips

Lead with action verbs (runs, spins, juggles, turns, laughs), then add camera language (pull back, track left, tilt up).
Keep one “main event” per clip; add atmosphere as a second layer (dust, fog, rim light, crowd).
If using an image, state what must stay consistent (character identity, outfit, composition) and what should change (motion, camera path, mood).

Accessibilité :Ce site utilise des modèles d'IA fournis par des tiers.

Video 02 API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/minimax/video-02 with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Video 02 below.

HTTP example

# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/minimax/video-02" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "resolution": "768p",
    "duration": 6,
    "enable_prompt_expansion": false
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].

Node.js example

// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("minimax/video-02", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "image": "https://example.com/your-input.jpg",
        "resolution": "768p",
        "duration": 6,
        "enable_prompt_expansion": false
});

console.log(result.outputs[0]); // → URL of the generated output

Python example

# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "minimax/video-02",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "resolution": "768p",
    "duration": 6,
    "enable_prompt_expansion": false
}
)

print(output["outputs"][0])  # → URL of the generated output

Video 02 API — Frequently asked questions

What is the Video 02 API?

Video 02 is a MiniMax model for video generation from images, exposed as a REST API on WaveSpeedAI. Hailuo 02 is an AI video generation model fine-tuned for ultra-clear 1080P output and handling complex physics-driven scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Video 02 API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/minimax/minimax-video-02.

How much does Video 02 cost per run?

Video 02 starts at $0.25 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Video 02 accept?

Key inputs: `prompt`, `image`, `resolution`, `duration`, `enable_prompt_expansion`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/minimax/minimax-video-02.

How long does Video 02 take to generate?

Average end-to-end generation time on WaveSpeedAI is around 105 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Video 02 outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (MiniMax). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.