Nano Banana 2Nano Banana 2 is live
WaveSpeed.ai
Inicio/Explorar/Speech Generation/elevenlabs/flash-v2.5
text-to-audio

text-to-audio

ElevenLabs Flash V2.5 Text To Speech Model

elevenlabs/flash-v2.5

ElevenLabs Flash v2.5 is a text-to-speech model on WaveSpeedAI, billed at $0.05 per 1000 characters for generated speech. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input
This parameter supports English text normalization, which improves performance in number-reading scenarios.

Idle

Tu solicitud costará $0.05 por ejecución.

Con $1 puedes ejecutar este modelo aproximadamente 20 veces.

EjemplosVer todo

README

ElevenLabs — Flash v2.5 Text-to-Speech

ElevenLabs Flash V2.5 text-to-speech model Generates natural-sounding speech from written text. Delivers clear pronunciation, smooth pacing, and expressive tone—ideal for voiceovers, narration, and digital content. We offer a rich, built-in library of multi-lingual voices.

🎧 Key Features

  • Fast generation with consistent, humanlike intonation and timing
  • Multilingual capability with strong English number/date reading
  • Fine control of timbre and delivery via similarity and stability
  • Speaker Boost for crisper English numerals, times, and measurements
  • Large built-in voice library; supports your custom voice IDs. See voice list here

💰 Pricing

  • $0.05 per 1,000 characters
  • If the input length is less than 1000 characters, it will be counted as 1000 characters to pay.

🚀 How to Use

  1. Enter your script in the text field.
  2. Set voice_id to a built-in or custom voice (for example: Gigi, Callum, Alice). See the full catalog in the voice list above.
  3. Tune delivery with the optional controls • similarity: 0–1 (higher = closer to the base voice’s timbre) • stability: 0–1 (higher = more consistent delivery) • use_speaker_boost: improves English number and unit reading
  4. Click Run to synthesize and preview your audio.

📝 Notes

  • Output format on the platform is MP3.
  • Split very long text into smaller paragraphs for more stable prosody.
  • Punctuation guides rhythm—prefer clear sentences over run-ons.
  • voice_id must be valid; if you see a voice error, pick one from the official voice list.
  • For financial, time, or measurement content, keep use_speaker_boost enabled for best readability.