MiniMax Image-01 Text-to-Image
MiniMax Image-01 Text-to-Image is a powerful AI image generation model that creates high-quality images from text descriptions. Part of the MiniMax image-01 family, this model excels at understanding natural language prompts and generating diverse, creative visuals across multiple styles and scenarios. Perfect for content creators, designers, marketers, and developers building AI-powered applications.
Key Features
-
Natural Language Understanding
Simply describe what you want to see in plain text (up to 1500 characters), and the model generates corresponding images with impressive accuracy and creativity.
-
Flexible Image Dimensions
Specify exact pixel dimensions from 512×512 to 2048×2048 pixels (must be divisible by 8) for precise control over output size. Common sizes include 1024×1024, 1280×720, 1152×864, and more.
-
Prompt Optimization
Built-in prompt optimizer automatically enhances your text descriptions for better generation results, making it easier to achieve professional-quality outputs even with simple prompts.
-
Batch Generation
Generate up to 9 images in a single request, perfect for exploring creative variations and selecting the best result for your needs.
-
Reproducible Results
Use seed values to generate consistent results across multiple runs, essential for iterative refinement and maintaining consistency in production workflows.
-
Multiple Output Formats
Receive generated images as direct URLs (24-hour expiration) or Base64-encoded data for immediate embedding in your applications.
Use Cases
- Content Creation: Generate unique visuals for blogs, articles, social media posts, and marketing materials
- Concept Art: Quickly visualize ideas, characters, scenes, and environments for creative projects
- Product Mockups: Create product visualizations, packaging designs, and promotional imagery
- Marketing & Advertising: Generate eye-catching visuals for campaigns, ads, and promotional content
- Game Development: Create concept art, textures, backgrounds, and character designs
- E-commerce: Generate product lifestyle images, backgrounds, and contextual scenes
- Education: Create custom illustrations for educational materials, presentations, and courses
- Prototyping: Rapidly generate visual concepts for UI/UX design and app development
Supported Formats & Dimensions
Output Dimensions:
- Width/Height range: 512 to 2048 pixels
- Must be divisible by 8
- Common sizes: 1024×1024 (square), 1280×720 (widescreen), 1152×864 (standard), 1248×832 (photo), 832×1248 (portrait photo), 864×1152 (portrait), 720×1280 (mobile/vertical), 1344×576 (ultra-wide)
Output Formats:
- URL (default): Direct links to generated images, valid for 24 hours
- Base64: Encoded image data for direct embedding
How to Use
Basic Text-to-Image Generation
-
Write Your Prompt
- Describe the image you want in the
prompt field (max 1500 characters)
- Be specific about subjects, style, composition, lighting, and mood
- Example: "A serene mountain landscape at sunset with purple and orange skies, snow-capped peaks, and a crystal-clear lake reflecting the colors"
-
Select Image Size
- Specify dimensions using the
size parameter like "1024 * 1024" or "1280 * 720"
- Choose dimensions that match your use case (square for social media, widescreen for presentations, etc.)
-
Configure Options
num_images: Set 1-9 to generate multiple variations (default: 1)
prompt_optimizer: Enable for automatic prompt enhancement (recommended for beginners)
seed: Use a specific number for reproducible results
-
Generate
- Submit your request and receive generated images as URLs or Base64 strings
- Images are typically ready within seconds
Advanced Tips
Prompt Writing Best Practices:
- Start with the main subject, then add details about style, lighting, composition
- Use descriptive adjectives: "vibrant", "moody", "minimalist", "photorealistic"
- Specify artistic styles: "oil painting", "digital art", "watercolor", "3D render"
- Include lighting details: "golden hour", "studio lighting", "dramatic shadows"
- Mention camera angles: "aerial view", "close-up", "wide angle"
Using Seeds for Consistency:
- Generate an image you like and note its seed value
- Use the same seed with modified prompts to create variations
- Perfect for maintaining consistent style across multiple images
Batch Generation Strategy:
- Generate 4-9 variations in one request to explore different interpretations
- Compare results and select the best output
- More cost-effective than multiple single-image requests
API Parameters
- prompt (required): Text description of desired image (max 1500 chars)
- size: Image dimensions (e.g., "10241024", "1280720")
- num_images: Number of images to generate (1-9, default: 1)
- seed: Random seed for reproducible results (integer)
- prompt_optimizer: Enable automatic prompt enhancement (boolean)
- enable_base64_output: Return Base64 instead of URLs (boolean)
- enable_sync_mode: Wait for generation to complete before returning (boolean)
Pricing
- $0.0035 per image
- Generate multiple images in one request for efficient batch processing
- Total cost = $0.0035 × number of images generated
- Example: Generating 9 variations costs only $0.0315
Output Format
Generations return as:
- URLs (default): Direct links to generated images hosted on WaveSpeedAI (valid for 24 hours)
- Base64 (optional): Encoded image data for direct embedding in applications
Response includes:
- Unique request ID for tracking
- Generation status (created, processing, completed, failed)
- Output array with generated image URLs or Base64 data
- NSFW content detection flags for each image
- Creation timestamp
- Success/failure counts
Best Practices
- Be Descriptive: More detailed prompts generally produce better results
- Use Prompt Optimizer: Enable it if you're new to AI image generation
- Generate Multiple Variations: Use num_images > 1 to explore different interpretations
- Iterate with Seeds: Find a good result, then use its seed to create variations
- Choose Appropriate Dimensions: Select dimensions that match your use case
- Test Different Styles: Experiment with artistic styles in your prompts
- Save Successful Prompts: Keep a library of prompts that work well for future use
Example Prompts
Photorealistic:
"A professional product photo of a luxury watch on a marble surface, studio lighting, shallow depth of field, commercial photography style"
Artistic:
"An impressionist oil painting of a Parisian café in autumn, warm colors, loose brushstrokes, golden afternoon light"
Conceptual:
"A futuristic cityscape at night with neon lights, flying vehicles, cyberpunk aesthetic, rain-slicked streets, dramatic perspective"
Character:
"A friendly robot character with a round body, expressive LED eyes, metallic blue finish, standing in a modern laboratory, 3D render style"
Related Models
Also available on WaveSpeedAI:
- minimax/image-01/image-to-image - Transform existing images with text prompts