Minimax Hailuo 02 Standard

Hailuo 02 Standard - MiniMax's next-generation AI video model with 2.5x efficiency improvement, 85% complex instruction response rate, and industry-leading cost-effectiveness for generating high-quality videos.

Features

Hailuo 02 Standard - Professional AI Video Model

Hailuo 02 Standard is MiniMax's professional-grade AI video generation model, representing a significant upgrade from Hailuo 01. Currently ranked #2 globally in both image-to-video and text-to-video benchmarks, surpassing Kuaishou's Kling and Google's Veo3, second only to ByteDance's recently released Seedance 1.0.

🚀 Model Highlights

Industry-Leading Performance

2.5x Efficiency Boost: Both training and inference efficiency improved by 250%
3x Model Parameters: Significantly enhanced model capacity
4x Training Data: Massive dataset expansion for superior quality
85% Complex Instruction Response Rate: Exceptional understanding of intricate prompts

Architectural Innovation

Hailuo 02 Standard features a completely redesigned DiT (Diffusion Transformer) architecture, abandoning the previous framework for a more efficient and powerful system that delivers:

Enhanced temporal consistency
Superior motion dynamics
Exceptional physical realism

Professional Quality

25 FPS Smooth Output: Cinema-grade frame rate
Professional Color Grading: Enhanced color accuracy and depth

🎯 Key Features

Advanced Capabilities

Extreme Physics Simulation: Generates complex physical scenarios like acrobatics, fluid dynamics, and intricate movements
Cinematic Camera Control: Professional camera movements including panning, tilting, tracking, and complex trajectories
Multi-Style Support: From photorealistic to artistic, anime to documentary styles
Consistent Character Generation: Maintains character appearance throughout the video

💡 Application Scenarios

Film & Television Production

Rapidly generate complex VFX shots, including acrobatics, fantasy scenes, and challenging physical performances, dramatically reducing production costs and time.

Advertising & Creative

Provide brands with cost-effective, high-quality video content that meets diverse creative requirements while maintaining professional standards.

Content Creation

Empower creators and influencers to produce engaging video content efficiently, enhancing productivity without compromising quality.

Educational Entertainment

Generate educational animations, virtual performances, and engaging content that combines learning with entertainment value.

Corporate Communications

Offer SMEs affordable promotional videos that elevate brand image and market competitiveness without breaking the budget.

📊 Technical Specifications

Video Duration: 6 seconds (with plans for extended duration)
Frame Rate: 25 fps
Supported Formats: MP4, MOV
Input Types: Text prompts, reference images
Processing Time: Optimized for rapid generation

🔧 Usage Guidelines

Best Practices

Detailed Prompts: Leverage the 85% complex instruction response rate with comprehensive descriptions
High-Quality References: Use clear, high-resolution images for image-to-video generation
Style Consistency: Specify desired artistic style for coherent results
Physics Descriptions: Take advantage of advanced physics capabilities with specific motion descriptions

Limitations

Current maximum duration: 6 seconds
Output quality depends on input prompt/image quality
Designed for creative synthesis, not documentary accuracy

🛡️ Responsible Use

This model must not be used for:

Generating harmful, illegal, or deceptive content
Creating non-consensual or inappropriate material
Violating privacy or intellectual property rights
Spreading misinformation or propaganda
Any activity violating local or international laws

🌟 Why Choose Hailuo 02 Standard?

Performance Leader: #2 globally, surpassing established competitors
Technical Excellence: 2.5x efficiency with 3x parameters
Versatility: Handles extreme complexity with ease
Future-Ready: Continuous improvements and feature expansions

Experience professional-grade AI video generation with Hailuo 02 Standard - where cutting-edge technology meets practical efficiency.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/minimax/hailuo-02/standard" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "prompt": "A graceful female ballet dancer performs under warm, golden light in a spacious sunlit studio with tall windows. She wears a flowing white tutu and soft pink pointe shoes, her arms extended elegantly as she spins mid-air in a grand jeté. The polished wooden floor reflects soft rays of sunlight. Dust particles shimmer in the air, adding a magical, uplifting mood. The color palette is bright and pastel—light peach, cream, and sky blue—evoking a dreamy, cinematic atmosphere full of energy and elegance.",
    "duration": 6,
    "enable_prompt_expansion": true
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes	A graceful female ballet dancer performs under warm, golden light in a spacious sunlit studio with tall windows. She wears a flowing white tutu and soft pink pointe shoes, her arms extended elegantly as she spins mid-air in a grand jeté. The polished wooden floor reflects soft rays of sunlight. Dust particles shimmer in the air, adding a magical, uplifting mood. The color palette is bright and pastel—light peach, cream, and sky blue—evoking a dreamy, cinematic atmosphere full of energy and elegance.	-	Generate a description of the video.
image	string	No	-	-	The model generates video with the picture passed in as the first frame.Base64 encoded strings in data:image/jpeg; base64,{data} format for incoming images, or URLs accessible via the public network. The uploaded image needs to meet the following conditions: Format is JPG/JPEG/PNG; The aspect ratio is greater than 2:5 and less than 5:2; Short side pixels greater than 300px; The image file size cannot exceed 20MB.
duration	integer	No	6	6, 10	Video duration in seconds
enable_prompt_expansion	boolean	No	true	-	The model automatically optimizes incoming prompts to improve build quality.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Query Parameters

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Minimax Hailuo 02 Pro Real ESRGAN