Inworld Inworld 1.5 Mini Text To Speech

Playground

Inworld 1.5 Mini delivers high-quality text-to-speech synthesis with 56+ multilingual voices, adjustable speaking rate, and natural-sounding audio output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Inworld 1.5 Mini Text-to-Speech

Inworld 1.5 Mini is a lightweight, ultra-affordable text-to-speech model that converts written text into natural speech. It offers the same voice selection, speaking rate, and expressiveness controls as the Max model — at half the cost. Perfect for high-volume workflows, prototyping, and budget-conscious production.

Need higher quality? Try Inworld 1.5 Max Text-to-Speech

Why Choose This?

Ultra-low cost Just $0.005 per 1,000 characters — the most affordable option for text-to-speech at scale.
Multilingual voice library 65+ voices across 14 languages including English, Chinese, Japanese, Korean, and more.
Speaking rate control Adjust the speed of speech to suit narration, dialogue, announcements, or any delivery style.
Temperature control Fine-tune expressiveness — lower values for consistent delivery; higher values for more dynamic, varied speech.
Fast processing Lightweight architecture delivers quick turnaround, ideal for real-time or high-volume pipelines.

Parameters

Parameter	Required	Description
text	Yes	The text content to convert to speech
voice_id	No	Voice preset to use (see Available Voices below)
speaking_rate	No	Speed of speech (default: 1)
temperature	No	Expressiveness level (default: 1)

Available Voices

English

Voice ID	Description
Alex	Energetic and expressive mid-range male voice, with a mildly nasal quality
Ashley	A warm, natural female voice
Craig	Older British male with a refined and articulate voice
Deborah	Gentle and elegant female voice
Dennis	Middle-aged man with a smooth, calm and friendly voice
Edward	Male with a fast-talking, emphatic and streetwise tone
Elizabeth	Professional middle-aged woman, perfect for narrations and voiceovers
Hades	Commanding and gruff male voice, think an omniscient narrator or castle guard
Julia	Quirky, high-pitched female voice that delivers lines with playful energy
Pixie	High-pitched, childlike female voice with a squeaky quality
Mark	Energetic, expressive man with a rapid-fire delivery
Olivia	Young, British female with an upbeat, friendly tone
Priya	Even-toned female voice with an Indian accent
Ronald	Confident, British man with a deep, gravelly voice
Sarah	Fast-talking young adult woman, with a questioning and curious tone
Shaun	Friendly, dynamic male voice great for conversations
Theodore	Gravelly male voice, with a time-worn quality
Timothy	Lively, upbeat American male voice
Wendy	Posh, middle-aged British female voice
Dominus	Robotic, deep male voice with a menacing quality. Perfect for villains
Hana	Bright, expressive young female voice, perfect for storytelling and gaming
Clive	British-accented male voice with a calm, cordial quality
Carter	Energetic, mature radio announcer-style male voice
Blake	Rich, intimate male voice, perfect for audiobooks and romantic content
Luna	Calm, relaxing female voice, perfect for meditations and sleep stories

Chinese

Voice ID	Description
Yichen	A calm, flat young adult male Chinese voice
Xiaoyin	A youthful Chinese female voice with a gentle, sweet voice
Xinyi	A Chinese woman with a neutral tone, perfect for narrations
Jing	An energetic, fast-paced young Chinese female

Japanese

Voice ID	Description
Asuka	Friendly, young adult Japanese female voice
Satoshi	Dramatic, expressive male Japanese voice filled with energy

Korean

Voice ID	Description
Hyunwoo	Young adult Korean male voice
Minji	Energetic, friendly young Korean female voice
Seojun	Clear, deep mature Korean male voice
Yoona	Korean woman with a gentle, soothing voice

French

Voice ID	Description
Alain	Deep, smooth middle-aged male French voice. Composed and calm
Hélène	Middle-aged French woman, with a smooth, musical, and graceful voice
Mathieu	A French male voice carrying a nasal quality
Étienne	Calm young adult French male

German

Voice ID	Description
Johanna	A calm older German female with a low, smoky voice
Josef	An articulate German male voice with an announcer-like quality

Spanish

Voice ID	Description
Diego	Spanish-speaking male voice with a soothing, gentle quality
Lupita	Vibrant, energetic young Spanish-speaking female voice
Miguel	A calm adult Spanish-speaking male voice, perfect for storytelling
Rafael	Middle-aged Spanish-speaking male with a deep, composed voice

Portuguese

Voice ID	Description
Heitor	Composed Portuguese-speaking male voice with a neutral tone
Maitê	Middle-aged Portuguese-speaking female voice

Italian

Voice ID	Description
Gianni	Deep, smooth Italian male voice that speaks rapidly
Orietta	Calm adult female Italian voice, with a soothing cadence

Dutch

Voice ID	Description
Erik	Older Dutch male voice with a weathered edge
Katrien	Dutch woman with an expressive voice
Lennart	A confident Dutch male voice. Calm and relaxed
Lore	Clear, calm Dutch female voice, great for narrations

Polish

Voice ID	Description
Szymon	Polish adult male voice with a warm, friendly quality
Wojciech	A middle-aged Polish male voice

Russian

Voice ID	Description
Svetlana	Soft, high-pitched female voice, with a slightly breathy quality
Elena	Clear, mid-range female voice, with a neutral, informational tone
Dmitry	Deep, gravelly male voice, with a commanding and narrative tone
Nikolai	Deep, resonant male voice, with a clear, theatrical quality

Hindi

Voice ID	Description
Riya	Professional and clean female voice, polished and approachable
Manoj	Clear, professional Hindi male voice. Great for narrations

Hebrew

Voice ID	Description
Yael	Mid-range female Hebrew voice, suitable for narrations
Oren	Steady male Hebrew voice, great for podcasts and voiceovers

Arabic

Voice ID	Description
Nour	Polished female Arabic voice with a friendly tone
Omar	Bright, confident Arabic male voice, great for announcements

How to Use

Enter your text — type or paste the content you want converted to speech.
Select a voice — choose a voice preset from the voice_id dropdown.
Adjust speaking rate — slide to control how fast or slow the speech is delivered.
Adjust temperature — slide to control the expressiveness and variation in delivery.
Run — submit and download the generated audio.

Pricing

Characters	Cost
Up to 1,000	$0.005
Up to 2,000	$0.010
Up to 5,000	$0.025
Up to 10,000	$0.050

Billing Rules

Rate: $0.005 per 1,000 characters
Rounding: character count is rounded up to the next 1,000

Best Use Cases

High-Volume Production — Generate large batches of audio at minimal cost.
Prototyping & Testing — Quickly preview voiceovers before committing to final production.
Chatbots & Virtual Assistants — Add voice output to conversational AI at scale.
Content Accessibility — Convert written content to audio affordably for wider audiences.
Game & App Dialogue — Generate character voice lines for interactive experiences on a budget.
Multilingual Content — Create audio content in 14 languages from a single API.

Pro Tips

Use Mini for drafting and iteration, then switch to Max for final production if higher quality is needed.
Keep speaking_rate around 1 for natural pacing; adjust lower for dramatic reads, higher for quick announcements.
Lower temperature gives more predictable, consistent output — great for automated systems.
Break long texts into logical paragraphs for better pacing and natural pauses.
Match voice language to your text language for best pronunciation and intonation.

Notes

Text is the only required field.
Billing is based on character count, rounded up to the nearest 1,000.
For maximum voice quality, consider Inworld 1.5 Max.

Inworld 1.5 Max Text-to-Speech — Higher quality voices at $0.01 per 1,000 characters.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/inworld/inworld-1.5-mini/text-to-speech" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "voice_id": "Alex",
    "speaking_rate": 1,
    "temperature": 1
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
text	string	Yes	-	-	Styling instructions on how to synthesize the content in the text field.
voice_id	string	No	Alex	Alex, Ashley, Craig, Deborah, Dennis, Edward, Elizabeth, Hades, Julia, Pixie, Mark, Olivia, Priya, Ronald, Sarah, Shaun, Theodore, Timothy, Wendy, Dominus, Hana, Clive, Carter, Blake, Luna, Yichen, Xiaoyin, Xinyi, Jing, Erik, Katrien, Lennart, Lore, Alain, Hélène, Mathieu, Étienne, Johanna, Josef, Gianni, Orietta, Asuka, Satoshi, Hyunwoo, Minji, Seojun, Yoona, Szymon, Wojciech, Heitor, Maitê, Diego, Lupita, Miguel, Rafael, Svetlana, Elena, Dmitry, Nikolai, Riya, Manoj, Yael, Oren, Nour, Omar	The voice to use for speech generation.
speaking_rate	number	No	1	0.5 ~ 1.5	The speed of speaking.
temperature	number	No	1	0.7 ~ 1.5	The temperature to use for the generation. A higher value means more randomness in the output.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Inworld Inworld 1.5 Max Text To Speech Inworld Realtime Tts 2

Inworld Inworld 1.5 Mini Text To Speech

Playground

Features

Inworld 1.5 Mini Text-to-Speech

Why Choose This?

Parameters

Available Voices

English

Chinese

Japanese

Korean

French

German

Spanish

Portuguese

Italian

Dutch

Polish

Russian

Hindi

Hebrew

Arabic

How to Use

Pricing

Billing Rules

Best Use Cases

Pro Tips

Notes

Related Models

Authentication

API Endpoints

Submit Task & Query Result

Parameters

Task Submission Parameters

Request Parameters

Response Parameters

Result Request Parameters

Result Response Parameters