Vidu Contest
WaveSpeed.ai
Home/Explore/Speech Generation/inworld/inworld-1.5-mini/text-to-speech
text-to-audio

text-to-audio

Inworld 1.5 Mini

inworld/inworld-1.5-mini/text-to-speech

Inworld 1.5 Mini delivers high-quality text-to-speech synthesis with 56+ multilingual voices, adjustable speaking rate, and natural-sounding audio output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Idle

Your request will cost $0.005 per run.

For $1 you can run this model approximately 200 times.

ExamplesView all

README

Inworld 1.5 Mini Text-to-Speech

Inworld 1.5 Mini is a lightweight, ultra-affordable text-to-speech model that converts written text into natural speech. It offers the same voice selection, speaking rate, and expressiveness controls as the Max model — at half the cost. Perfect for high-volume workflows, prototyping, and budget-conscious production.

Why Choose This?

  • Ultra-low cost Just $0.005 per 1,000 characters — the most affordable option for text-to-speech at scale.

  • Multilingual voice library 65+ voices across 14 languages including English, Chinese, Japanese, Korean, and more.

  • Speaking rate control Adjust the speed of speech to suit narration, dialogue, announcements, or any delivery style.

  • Temperature control Fine-tune expressiveness — lower values for consistent delivery; higher values for more dynamic, varied speech.

  • Fast processing Lightweight architecture delivers quick turnaround, ideal for real-time or high-volume pipelines.

Parameters

ParameterRequiredDescription
textYesThe text content to convert to speech
voice_idNoVoice preset to use (see Available Voices below)
speaking_rateNoSpeed of speech (default: 1)
temperatureNoExpressiveness level (default: 1)

Available Voices

English

Voice IDDescription
AlexEnergetic and expressive mid-range male voice, with a mildly nasal quality
AshleyA warm, natural female voice
CraigOlder British male with a refined and articulate voice
DeborahGentle and elegant female voice
DennisMiddle-aged man with a smooth, calm and friendly voice
EdwardMale with a fast-talking, emphatic and streetwise tone
ElizabethProfessional middle-aged woman, perfect for narrations and voiceovers
HadesCommanding and gruff male voice, think an omniscient narrator or castle guard
JuliaQuirky, high-pitched female voice that delivers lines with playful energy
PixieHigh-pitched, childlike female voice with a squeaky quality
MarkEnergetic, expressive man with a rapid-fire delivery
OliviaYoung, British female with an upbeat, friendly tone
PriyaEven-toned female voice with an Indian accent
RonaldConfident, British man with a deep, gravelly voice
SarahFast-talking young adult woman, with a questioning and curious tone
ShaunFriendly, dynamic male voice great for conversations
TheodoreGravelly male voice, with a time-worn quality
TimothyLively, upbeat American male voice
WendyPosh, middle-aged British female voice
DominusRobotic, deep male voice with a menacing quality. Perfect for villains
HanaBright, expressive young female voice, perfect for storytelling and gaming
CliveBritish-accented male voice with a calm, cordial quality
CarterEnergetic, mature radio announcer-style male voice
BlakeRich, intimate male voice, perfect for audiobooks and romantic content
LunaCalm, relaxing female voice, perfect for meditations and sleep stories

Chinese

Voice IDDescription
YichenA calm, flat young adult male Chinese voice
XiaoyinA youthful Chinese female voice with a gentle, sweet voice
XinyiA Chinese woman with a neutral tone, perfect for narrations
JingAn energetic, fast-paced young Chinese female

Japanese

Voice IDDescription
AsukaFriendly, young adult Japanese female voice
SatoshiDramatic, expressive male Japanese voice filled with energy

Korean

Voice IDDescription
HyunwooYoung adult Korean male voice
MinjiEnergetic, friendly young Korean female voice
SeojunClear, deep mature Korean male voice
YoonaKorean woman with a gentle, soothing voice

French

Voice IDDescription
AlainDeep, smooth middle-aged male French voice. Composed and calm
HélèneMiddle-aged French woman, with a smooth, musical, and graceful voice
MathieuA French male voice carrying a nasal quality
ÉtienneCalm young adult French male

German

Voice IDDescription
JohannaA calm older German female with a low, smoky voice
JosefAn articulate German male voice with an announcer-like quality

Spanish

Voice IDDescription
DiegoSpanish-speaking male voice with a soothing, gentle quality
LupitaVibrant, energetic young Spanish-speaking female voice
MiguelA calm adult Spanish-speaking male voice, perfect for storytelling
RafaelMiddle-aged Spanish-speaking male with a deep, composed voice

Portuguese

Voice IDDescription
HeitorComposed Portuguese-speaking male voice with a neutral tone
MaitêMiddle-aged Portuguese-speaking female voice

Italian

Voice IDDescription
GianniDeep, smooth Italian male voice that speaks rapidly
OriettaCalm adult female Italian voice, with a soothing cadence

Dutch

Voice IDDescription
ErikOlder Dutch male voice with a weathered edge
KatrienDutch woman with an expressive voice
LennartA confident Dutch male voice. Calm and relaxed
LoreClear, calm Dutch female voice, great for narrations

Polish

Voice IDDescription
SzymonPolish adult male voice with a warm, friendly quality
WojciechA middle-aged Polish male voice

Russian

Voice IDDescription
SvetlanaSoft, high-pitched female voice, with a slightly breathy quality
ElenaClear, mid-range female voice, with a neutral, informational tone
DmitryDeep, gravelly male voice, with a commanding and narrative tone
NikolaiDeep, resonant male voice, with a clear, theatrical quality

Hindi

Voice IDDescription
RiyaProfessional and clean female voice, polished and approachable
ManojClear, professional Hindi male voice. Great for narrations

Hebrew

Voice IDDescription
YaelMid-range female Hebrew voice, suitable for narrations
OrenSteady male Hebrew voice, great for podcasts and voiceovers

Arabic

Voice IDDescription
NourPolished female Arabic voice with a friendly tone
OmarBright, confident Arabic male voice, great for announcements

How to Use

  1. Enter your text — type or paste the content you want converted to speech.
  2. Select a voice — choose a voice preset from the voice_id dropdown.
  3. Adjust speaking rate — slide to control how fast or slow the speech is delivered.
  4. Adjust temperature — slide to control the expressiveness and variation in delivery.
  5. Run — submit and download the generated audio.

Pricing

CharactersCost
Up to 1,000$0.005
Up to 2,000$0.010
Up to 5,000$0.025
Up to 10,000$0.050

Billing Rules

  • Rate: $0.005 per 1,000 characters
  • Rounding: character count is rounded up to the next 1,000

Best Use Cases

  • High-Volume Production — Generate large batches of audio at minimal cost.
  • Prototyping & Testing — Quickly preview voiceovers before committing to final production.
  • Chatbots & Virtual Assistants — Add voice output to conversational AI at scale.
  • Content Accessibility — Convert written content to audio affordably for wider audiences.
  • Game & App Dialogue — Generate character voice lines for interactive experiences on a budget.
  • Multilingual Content — Create audio content in 14 languages from a single API.

Pro Tips

  • Use Mini for drafting and iteration, then switch to Max for final production if higher quality is needed.
  • Keep speaking_rate around 1 for natural pacing; adjust lower for dramatic reads, higher for quick announcements.
  • Lower temperature gives more predictable, consistent output — great for automated systems.
  • Break long texts into logical paragraphs for better pacing and natural pauses.
  • Match voice language to your text language for best pronunciation and intonation.

Notes

  • Text is the only required field.
  • Billing is based on character count, rounded up to the nearest 1,000.
  • For maximum voice quality, consider Inworld 1.5 Max.

Related Models