Vidu Contest
WaveSpeed.ai
Startseite/Entdecken/Speech Generation/inworld/1.5-mini/text-to-speech
text-to-audio

text-to-audio

Inworld 1.5 Mini

inworld/1.5-mini/text-to-speech

Inworld 1.5 Mini delivers high-quality text-to-speech synthesis with 56+ multilingual voices, adjustable speaking rate, and natural-sounding audio output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Idle

Ihre Anfrage kostet $0.005 pro Durchlauf.

Für $1 können Sie dieses Modell ungefähr 200 Mal ausführen.

BeispieleAlle anzeigen

README

Inworld 1.5 Mini Text-to-Speech

Inworld 1.5 Mini is a lightweight, ultra-affordable text-to-speech model that converts written text into natural speech. It offers the same voice selection, speaking rate, and expressiveness controls as the Max model — at half the cost. Perfect for high-volume workflows, prototyping, and budget-conscious production.

Why Choose This?

  • Ultra-low cost Just $0.005 per 1,000 characters — the most affordable option for text-to-speech at scale.

  • Voice selection Choose from a library of distinct voice identities to match your brand, character, or use case.

  • Speaking rate control Adjust the speed of speech to suit narration, dialogue, announcements, or any delivery style.

  • Temperature control Fine-tune expressiveness — lower values for consistent delivery; higher values for more dynamic, varied speech.

  • Fast processing Lightweight architecture delivers quick turnaround, ideal for real-time or high-volume pipelines.

Parameters

ParameterRequiredDescription
textYesThe text content to convert to speech
voice_idNoVoice preset to use (e.g., Hades)
speaking_rateNoSpeed of speech (default: 1)
temperatureNoExpressiveness level (default: 1)

How to Use

  1. Enter your text — type or paste the content you want converted to speech.
  2. Select a voice — choose a voice preset from the voice_id dropdown.
  3. Adjust speaking rate — slide to control how fast or slow the speech is delivered.
  4. Adjust temperature — slide to control the expressiveness and variation in delivery.
  5. Run — submit and download the generated audio.

Pricing

CharactersCost
Up to 1,000$0.005
Up to 2,000$0.010
Up to 5,000$0.025
Up to 10,000$0.050

Billing Rules

  • Rate: $0.005 per 1,000 characters
  • Rounding: character count is rounded up to the next 1,000

Best Use Cases

  • High-Volume Production — Generate large batches of audio at minimal cost.
  • Prototyping & Testing — Quickly preview voiceovers before committing to final production.
  • Chatbots & Virtual Assistants — Add voice output to conversational AI at scale.
  • Content Accessibility — Convert written content to audio affordably for wider audiences.
  • Game & App Dialogue — Generate character voice lines for interactive experiences on a budget.

Pro Tips

  • Use Mini for drafting and iteration, then switch to Max for final production if higher quality is needed.
  • Keep speaking_rate around 1 for natural pacing; adjust lower for dramatic reads, higher for quick announcements.
  • Lower temperature gives more predictable, consistent output — great for automated systems.
  • Break long texts into logical paragraphs for better pacing and natural pauses.

Notes

  • Text is the only required field.
  • Billing is based on character count, rounded up to the nearest 1,000.
  • For maximum voice quality, consider Inworld 1.5 Max.