MiniMax Speech-02-Turbo
Convert text to natural, expressive speech with MiniMax Speech-02-Turbo. This advanced text-to-speech model offers 17+ preset voices, custom voice cloning support, and emotional expression control — perfect for voiceovers, content creation, and audio production.
Why It Sounds Great
- Natural speech: Human-like intonation, rhythm, and expression.
- 17+ preset voices: Wide variety of characters from casual to professional.
- Custom voice cloning: Use your own trained voice IDs for personalized output.
- Emotion control: Add emotional expression like happy, sad, or neutral.
- Voice tuning: Adjust speed, volume, and pitch for perfect delivery.
- Audio quality options: Configure sample rate, bitrate, and format.
Parameters
| Parameter | Required | Description |
|---|
| text | Yes | The text you want to convert to speech. |
| voice_id | Yes | Voice to use — preset ID or custom trained voice. |
| speed | No | Speech speed multiplier. Default: 1. |
| volume | No | Volume level. Default: 1. |
| pitch | No | Pitch adjustment. Default: 0. |
| emotion | No | Emotional tone: happy, sad, angry, neutral, etc. |
| english_normalization | No | Improves number-reading in English text. |
| sample_rate | No | Audio sample rate (e.g., 22050, 44100). |
| bitrate | No | Audio bitrate quality. |
| channel | No | Audio channels (mono/stereo). |
| format | No | Output format (mp3, wav, etc.). |
| language_boost | No | Boost specific language pronunciation. |
Available Preset Voices
| Voice ID | Character |
|---|
| Wise_Woman | Mature, thoughtful female |
| Friendly_Person | Warm, approachable |
| Inspirational_girl | Motivating young female |
| Deep_Voice_Man | Rich, deep male voice |
| Calm_Woman | Soothing, relaxed female |
| Casual_Guy | Laid-back male |
| Lively_Girl | Energetic young female |
| Patient_Man | Steady, reassuring male |
| Young_Knight | Youthful, heroic male |
| Determined_Man | Strong, resolute male |
| Lovely_Girl | Sweet, pleasant female |
| Decent_Boy | Polite young male |
| Imposing_Manner | Authoritative presence |
| Elegant_Man | Refined, sophisticated male |
| Abbess | Wise, spiritual female |
| Sweet_Girl_2 | Gentle, charming female |
| Exuberant_Girl | Excited, enthusiastic female |
| Energetic_Girl | Vibrant, dynamic female |
How to Use
- Enter your text — type or paste the content to convert.
- Select voice — choose a preset voice or enter your custom voice ID.
- Adjust settings (optional) — tune speed, volume, pitch, and emotion.
- Configure audio (optional) — set sample rate, bitrate, and format.
- Run — click the button to generate.
- Download — preview and save your audio file.
Pricing
Per-character billing based on text length.
| Text Length | Cost |
|---|
| 1,000 characters | $0.03 |
| 5,000 characters | $0.15 |
| 10,000 characters | $0.30 |
Best Use Cases
- Voiceovers — Create professional narration for videos and presentations.
- Audiobooks — Generate natural-sounding book narration.
- Content Creation — Add voice to social media videos and podcasts.
- E-learning — Produce educational audio content at scale.
- Accessibility — Convert written content to audio format.
- Character Voices — Create distinct voices for games and animations.
Custom Voice Cloning
Train your own voice for personalized output:
Voice Clone Training
Pro Tips for Best Results
- Match voice character to content tone — use Calm_Woman for meditation, Energetic_Girl for ads.
- Use emotion parameter to add expressiveness: "happy" for upbeat, "neutral" for professional.
- Adjust speed slightly (0.9-1.1) for more natural pacing.
- Enable english_normalization when text contains numbers or abbreviations.
- Test different voices with the same text to find the perfect match.
- For long content, break into paragraphs for more natural pacing.
Notes
- Pricing is based on character count, not audio duration.
- Custom voice IDs require prior voice clone training.
- Processing time scales with text length.
- Multiple output formats available for different use cases.