Hailuo 2.3 is a text-to-video model creating physics-aware 768p videos with 2.5× efficiency and 85% complex instruction response rate. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle
$0.28per run·~35 / $10
Camera: An extreme low-angle, shaky cam shot, looking up as a massive alien dreadnought descends onto a barren red planet. The camera is pushed back slightly by intense wind and vibration. Effect: The landing thrusters kick up a colossal plume of red dust and sand, creating a shockwave. The intense plasma exhaust from the engines superheats the ground, causing it to visibly vitrify (turn to glass). Sounds: An extremely deep, chest-thumping bass rumble (LFE), mixed with a high-pitched engine whine and storm-like wind howling. The grating sound of metal on rock. Mood: Epic, awe-inspiring, oppressive, and ominous. Lighting: Harsh, unnatural blue light from the ship's engines contrasts with the dim, dusty red daylight of the planet, creating stark, moving shadows.
Camera: A slow, ethereal crane shot begins high above, looking down as a ballerina begins an elegant pirouette on a shimmering, moonlit stage. The camera slowly descends and gently orbits her as she spins. Effect: Her white tutu shimmers with a faint, magical stardust glow that gently drifts upwards. As she extends her arm, a delicate, translucent ripple effect emanates from her fingertips. Sounds: Soft, ambient orchestral music with light, echoing piano notes. A gentle chime sound effect as the stardust appears, and a subtle whoosh with the ripple. Mood: Graceful, enchanting, and otherworldly. Like a dance within a dream. Lighting: Soft, cool moonlight, subtle spotlight on the dancer, with a magical, diffused glow from the stage floor.
Camera: A stable, slight downward close-up on a young person's face. They are leaning against a window, and their gaze slowly drifts from the window to the middle distance. Effect: The person's eyes become glossy and wet, but no tears fall. They bite their lower lip for a moment, and their brow furrows slightly. Raindrops slowly trace paths down the window glass, reflecting on their skin. Sounds/Voices: A soft, melancholic piano melody. The patter of rain against the glass. A single, barely audible sigh. Mood: Melancholy, contemplative, vulnerable, and introspective. Lighting: Soft, cool, diffused natural light from the overcast window, giving their skin a pale and somber tone.
Camera: An extreme macro "probe lens" shot that travels over the intricate gears of a luxury watch. The camera slowly rotates as the gears turn. Effect: The tiny gears and ruby jewels of the watch mechanism turn in perfect, mesmerizing synchronization. A single ray of light creates a "star gleam" sparkle on a polished screw head. Sounds/Voices: A crisp, clean, rhythmic "tick-tock" sound, slightly reverberated. A soft, sophisticated classical music swell (e.g., a single cello note). Mood: Elegant, precise, luxurious, and mesmerizing. Lighting: Clean, high-key studio light that highlights every metallic texture. The "star gleam" is the key lighting effect.
Camera: Fast dolly-in as three cartoon ghosts burst into frame from different sides, bumping into each other mid-air. Effect: Their glowing blue trails collide, forming the word "BOO CREW!" in neon ghost-light with a quick sparkle explosion. Voices: Each ghost shouts “BOO!” in different tones — one deep, one squeaky, one delayed — followed by giggles. Mood: Chaotic and funny, like a ghost boy band intro. Lighting: Electric blue glows, cartoon bounce light, subtle smoke bursts.
Hailuo 2.3 Standard is the latest generation of AI video creation models, featuring advanced physics rendering and cinematic-grade scene transitions. Built for both creators and professionals, it combines high fidelity, reliability, and cost efficiency, outperforming many closed or premium video generation systems.
| Duration | Cost per Job | Max Resolution |
|---|---|---|
| 6 seconds | $0.23 | 768p |
| 10 seconds | $0.56 | 768p |
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/minimax/hailuo-2.3/t2v-standard with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Hailuo 2.3 T2v Standard below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/minimax/hailuo-2.3/t2v-standard" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"duration": 6,
"enable_prompt_expansion": true
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("minimax/hailuo-2.3/t2v-standard", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"duration": 6,
"enable_prompt_expansion": true
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"minimax/hailuo-2.3/t2v-standard",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"duration": 6,
"enable_prompt_expansion": true
}
)
print(output["outputs"][0]) # → URL of the generated outputHailuo 2.3 T2v Standard is a MiniMax model for video generation, exposed as a REST API on WaveSpeedAI. Hailuo 2.3 is a text-to-video model creating physics-aware 768p videos with 2.5× efficiency and 85% complex instruction response rate. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/minimax/minimax-hailuo-2.3-t2v-standard.
Hailuo 2.3 T2v Standard starts at $0.28 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `duration`, `enable_prompt_expansion`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/minimax/minimax-hailuo-2.3-t2v-standard.
Average end-to-end generation time on WaveSpeedAI is around 140 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (MiniMax). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.