Home/Explore/Speech Generation/elevenlabs/multilingual-v1
text-to-audio

text-to-audio

ElevenLabs Multilingual V1 | Multilingual Text To Speech Model | WaveSpeedAI

elevenlabs/multilingual-v1

ElevenLabs Multilingual V1 provides natural-sounding multilingual text-to-speech across many languages. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

This parameter supports English text normalization, which improves performance in number-reading scenarios.

Idle

Your request will cost $0.1 per run.

For $1 you can run this model approximately 10 times.

ExamplesView all

README

ElevenLabs — Multilingual V1 Text-to-Speech

Multilingual V1 turns text into natural, expressive speech across multiple languages. It delivers clean pronunciation, smooth pacing, and controllable tone—great for voiceovers, narrations, learning content, and product videos.

🎧 Key Features

  • Multilingual synthesis with automatic accent handling
  • Humanlike intonation and timing; clear number/date reading
  • Tone controls via similarity and stability
  • Speaker boost for crisper English numerals and units
  • Large built-in voice library (see the voice list)

💰 Pricing

  • $0.10 per 1,000 characters
  • If the input length is less than 1000 characters, it will be counted as 1000 characters to pay.

🚀 How to Use

  1. Enter your text in the text field.
  2. Set voice_id to a built-in voice name (for example: Callum, Alice, Elli). For more options, use the voice list above.
  3. Optional controls • similarity: 0–1 (higher = closer to the base voice’s timbre) • stability: 0–1 (higher = more consistent delivery) • use_speaker_boost: improves English number and unit reading
  4. Click Run to generate and preview your audio.

📝 Notes

  • Works best with clear punctuation and short sentences; split very long text into segments.
  • voice_id must match a valid ID, if you see an invalid-voice error, pick one from the voice list.
  • use_speaker_boost is especially helpful for English financial, time, and measurement reads.