Minimax Video-01
Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.
Features
minimax/video-01 is an advanced AI-native video generation model developed by MiniMax and hosted on WaveSpeedAI. This model enables the creation of high-definition videos at 720p resolution and 25fps, featuring cinematic camera movements such as panning, tilting, and tracking. It supports text-to-video, image-to-video, and subject-to-video modes, allowing users to generate dynamic content based on text descriptions or reference images.
Key Features
- High-Resolution Output: Produces 720p videos with cinematic quality, featuring smooth camera movements such as panning, tilting, and tracking.
- Text Responsiveness: Delivers precise alignment with complex prompts, ensuring outputs match user expectations.
- Style Versatility: Supports a wide range of artistic and realistic styles, enabling diverse creative expressions.
- Efficiency: Rapid generation of visually striking content, with current support for videos up to 6 seconds long (with plans to extend duration in future updates).
- Subject Reference: Users can upload a reference image to generate videos with consistent character appearances, making it easy to create personalized content.
ComfyUI
minimax/video-01 is also available on ComfyUI, providing local inference capabilities through a node-based workflow, ensuring flexible and efficient image generation on your system.
Limitations
- Video Duration: Currently supports generating videos up to 6 seconds long; future updates aim to extend this duration.
- Input Sensitivity: The quality and consistency of generated videos depend significantly on the quality of the input text or image; subtle variations may lead to output variability.
- Creative Focus: Designed for creative video synthesis; not intended for generating factually accurate or reliable content.
Out-of-Scope Use
The model and its derivatives may not be used in any way that violates applicable national, federal, state, local, or international law or regulation, including but not limited to:
- Exploiting, harming, or attempting to exploit or harm minors, including solicitation, creation, acquisition, or dissemination of child exploitative content.
- Generating or disseminating verifiably false information with the intent to harm others.
- Creating or distributing personal identifiable information that could be used to harm an individual.
- Harassing, abusing, threatening, stalking, or bullying individuals or groups.
- Producing non-consensual nudity or illegal pornographic content.
- Making fully automated decisions that adversely affect an individual’s legal rights or create binding obligations.
- Facilitating large-scale disinformation campaigns.
Accelerated Inference
Our accelerated inference approach leverages advanced optimization technology from WavespeedAI. This innovative fusion technique significantly reduces computational overhead and latency, enabling rapid image generation without compromising quality. The entire system is designed to efficiently handle large-scale inference tasks while ensuring that real-time applications achieve an optimal balance between speed and accuracy. For further details, please refer to the blog post.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v2/minimax/video-01" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"prompt": "astronaut in a yellow suit slowly walks through a futuristic airlock door, heading into deep space. Cinematic sci-fi atmosphere, smooth slow motion",
"enable_prompt_expansion": true
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v2/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
Parameter | Type | Required | Default | Range | Description |
---|---|---|---|---|---|
prompt | string | Yes | astronaut in a yellow suit slowly walks through a futuristic airlock door, heading into deep space. Cinematic sci-fi atmosphere, smooth slow motion | - | Text prompt for generation |
image | string | No | - | - | First frame image for video generation. The output video will have the same aspect ratio as this image. |
enable_prompt_expansion | boolean | No | true | - | Use prompt optimizer |
Response Parameters
Parameter | Type | Description |
---|---|---|
code | integer | HTTP status code (e.g., 200 for success) |
message | string | Status message (e.g., “success”) |
data.id | string | Unique identifier for the prediction, Task Id |
data.model | string | Model ID used for the prediction |
data.outputs | array | Array of URLs to the generated content (empty when status is not completed ) |
data.urls | object | Object containing related API endpoints |
data.urls.get | string | URL to retrieve the prediction result |
data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
data.status | string | Status of the task: created , processing , completed , or failed |
data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
data.error | string | Error message (empty if no error occurred) |
data.timings | object | Object containing timing details |
data.timings.inference | integer | Inference time in milliseconds |
Result Query Parameters
Result Request Parameters
Parameter | Type | Required | Default | Description |
---|---|---|---|---|
id | string | Yes | - | Task ID |
Result Response Parameters
Parameter | Type | Description |
---|---|---|
code | integer | HTTP status code (e.g., 200 for success) |
message | string | Status message (e.g., “success”) |
data | object | The prediction data object containing all details |
data.id | string | Unique identifier for the prediction |
data.model | string | Model ID used for the prediction |
data.outputs | array | Array of URLs to the generated content (empty when status is not completed ) |
data.urls | object | Object containing related API endpoints |
data.urls.get | string | URL to retrieve the prediction result |
data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
data.status | string | Status of the task: created , processing , completed , or failed |
data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
data.error | string | Error message (empty if no error occurred) |
data.timings | object | Object containing timing details |
data.timings.inference | integer | Inference time in milliseconds |