MiniMax Hailuo 2.3 Pro is an image-to-video model for ultra-clear 1080P output and physics-aware scenes with responsive rendering. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle
$0.49per run·~20 / $10
A passionate male and female dance duo performing a fiery, high-energy Latin dance routine (like Salsa or Cha-Cha). They execute fast, intricate footwork and sensual, rhythmic hip movements with visible chemistry. The man leads the woman through a series of rapid spins and sharp turns, culminating in a dramatic dip where her hair fans out, all under a warm, intense spotlight on a wooden dance floor. Cinematic, passionate, rhythmic, energetic, dynamic lighting, professional dancers, fiery chemistry.
Camera: A smooth, slow 360-degree orbit shot around the floating earbud (Part 1). The camera slowly dollies in, focusing on the brushed metallic texture. As it gets extremely close, the earbud moves forward, filling the frame. We match-cut to an intimate close-up of a person's ear as their hand brings the same earbud into frame and inserts it. The camera then pulls back slightly to show their satisfied profile. Effect: (Part 1) The earbud rotates weightlessly; light streaks orbit it. (Part 2) As the camera pushes through, the background cross-fades from the abstract purple/blue gradient to a soft-focus, stylish apartment. The person smiles gently and closes their eyes as they put the earbud in, signifying immersion. Sounds/Voices: The minimalist electronic track swells during the transition. As the earbud is inserted, a soft, satisfying "chime" (like a 'connected' sound) plays, perfectly synced with a subtle bass drop. The music then continues, now feeling warmer and more ambient. Mood: Begins as "Premium, futuristic, sophisticated," then transitions to "Personal, immersive, comfortable, and satisfying." Lighting: (Part 1) Polished, cool studio lighting. (Part 2) Transitions to warm, soft, indoor lifestyle lighting (e.g., soft morning light from a window) that casts a gentle glow on the person's face. The earbud itself still catches a perfect, clean highlight.
Motion: The ballerina begins in a balanced pose, then smoothly transitions into a slow turn, extending one leg outward with perfect control. Her dress flows naturally with the motion. The camera performs a slow circular dolly, following her movement from side to front, maintaining full visibility of her body. Camera: Stable and smooth motion with perfect temporal coherence. Focus automatically adjusts to keep her body in sharp detail as the camera moves. No jitter, flicker, or limb distortion — the dancer remains fully visible at all times. Lighting: Spotlight dynamically adjusts as she turns, creating shifting highlights on her face and body. The floor reflections and subtle stage fog enhance realism. Sound: Soft piano melody with ambient stage reverb and delicate footstep sounds. Style: Cinematic realism, graceful body motion, balanced anatomy, and smooth camera synchronization — fully showcasing Hailuo 2.3’s superior handling of human structure and motion coherence.
A soldier, his face streaked with mud and sweat, crouches low in a narrow, muddy trench. His gaze is intensely focused down the line. Suddenly, a massive explosion erupts just behind him, illuminating the entire trench with a violent, fiery flash. Sparks, smoke, and debris are blasted past him in a chaotic, dramatic wave (possibly in slow-motion). The soldier braces his body against the concussion, flinching but heroically holding his ground. He tightens his grip on his rifle, his knuckles white, as dirt rains down from above. His eyes never leave his target, showcasing unwavering focus amidst the chaos. The camera holds a tight, tense medium shot on him, capturing the raw intensity of the moment. Cinematic, blockbuster war movie, gritty, high-action, dramatic lighting, palpable tension, high-stakes, realistic
A martial arts grandmaster, with flowing white hair and a focused expression, executes a rapid sequence of Tai Chi-inspired wushu movements. His body shifts with precise, powerful fluidity, channeling an inner, glowing cyan energy that intensifies around his rapidly moving hands and forearms. Each strike and block is imbued with explosive, controlled force. The setting remains a sleek, futuristic sci-fi corridor, with vibrant, pulsating neon blue and orange lights accentuating the speed and power of his actions. The lighting is dramatic and high-contrast, highlighting muscle definition and dynamic motion. Cinematic, action-packed, powerful, intense, high-energy, ethereal glow, motion blur on limbs.
Hailuo 2.3 Pro represents the pinnacle of MiniMax’s image-to-video generation technology. It turns static images into cinematic, full-HD 1080p videos with stunning realism, seamless motion, and physics-accurate animation. Built for creators, studios, and digital storytellers, it combines next-generation motion rendering with precise visual fidelity — all in just 5 seconds per clip.
| Duration | Resolution | Cost per Job |
|---|---|---|
| 5 seconds | 1080p | $0.49 |
“Nothing else feels as cinematic straight out of the box — it just moves naturally.” “The motion and lighting look like real camera work. Perfect for film concepts and game trailers.” “The best image-to-video balance so far — clean details, zero flicker, amazing physics.”
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/minimax/hailuo-2.3/i2v-pro with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Hailuo 2.3 I2v Pro below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/minimax/hailuo-2.3/i2v-pro" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"image": "https://example.com/your-input.jpg",
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"enable_prompt_expansion": true
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("minimax/hailuo-2.3/i2v-pro", {
"image": "https://example.com/your-input.jpg",
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"enable_prompt_expansion": true
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"minimax/hailuo-2.3/i2v-pro",
{
"image": "https://example.com/your-input.jpg",
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"enable_prompt_expansion": true
}
)
print(output["outputs"][0]) # → URL of the generated outputHailuo 2.3 I2v Pro is a MiniMax model for video generation from images, exposed as a REST API on WaveSpeedAI. MiniMax Hailuo 2.3 Pro is an image-to-video model for ultra-clear 1080P output and physics-aware scenes with responsive rendering. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/minimax/minimax-hailuo-2.3-i2v-pro.
Hailuo 2.3 I2v Pro starts at $0.49 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `image`, `enable_prompt_expansion`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/minimax/minimax-hailuo-2.3-i2v-pro.
Average end-to-end generation time on WaveSpeedAI is around 192 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (MiniMax). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.