Mirelo Ai Sfx 1.6 Text To Audio
Playground
Try it on WavespeedAI!Mirelo SFX1.6 Text to Audio is a fast AI audio generation model that creates sound effects and ambient audio directly from text prompts, with optional seamless ambience looping. Ready-to-use REST inference API for sound effect generation, game audio, video production, cinematic sound design, background ambience, loopable audio assets, and professional audio workflows with simple integration, no coldstarts, and affordable pricing.
Features
Mirelo AI SFX 1.6 Text-to-Audio
Mirelo AI SFX 1.6 Text-to-Audio generates sound effects, ambience, and short audio clips from natural-language prompts. It supports loop-friendly ambience mode, multiple variations, flexible duration control, and optional doubled loop output for seamless background audio workflows.
Why Choose This?
-
Prompt-based audio generation Generate sound effects or ambient audio directly from a text description.
-
Flexible duration control Choose the target duration for the generated clip, from short effects to longer ambient beds.
-
Loop-friendly ambience mode Enable
ambienceto generate audio designed for seamless looping. -
Multiple variations Generate up to
4different versions in one request withnum_samples. -
Optional doubled loop output When using ambience mode, enable
double_outputto concatenate the loop with itself for a longer seamless result. -
Production-ready API Useful for games, film, podcasts, background ambience, sound design, and content production workflows.
Parameters
| Parameter | Required | Description |
|---|---|---|
| text_prompt | Yes | Text prompt describing the sound effect or ambient audio to generate. Minimum length: 4 characters. |
| duration | No | Target duration in seconds. Range: 0.1–60. Default: 10. |
| ambience | No | When true, generate and stitch the result so the tile loops seamlessly. Default: false. |
| double_output | No | Only used when ambience is true: concatenate the loop with itself for a 2x-length output. Default: false. |
| num_samples | No | Number of variations to generate. Range: 1–4. Default: 1. |
How to Use
- Write your prompt — describe the sound, mood, texture, or environment you want.
- Set duration — choose how long the generated audio should be.
- Enable ambience (optional) — turn this on if you want a seamless loopable result.
- Enable double output (optional) — when using ambience, use this to produce a doubled loop.
- Set number of samples — choose how many variations you want, from
1to4. - Submit — run the model and download the generated audio.
Example Prompt
Dark cinematic ambience with distant thunder, soft low-frequency rumble, subtle wind, and evolving tension
Pricing
Pricing is based on generated duration and number of samples.
| Duration | 1 Sample | 2 Samples | 3 Samples | 4 Samples |
|---|---|---|---|---|
| 1s | $0.01 | $0.02 | $0.03 | $0.04 |
| 5s | $0.05 | $0.10 | $0.15 | $0.20 |
| 10s | $0.10 | $0.20 | $0.30 | $0.40 |
| 20s | $0.20 | $0.40 | $0.60 | $0.80 |
| 30s | $0.30 | $0.60 | $0.90 | $1.20 |
| 60s | $0.60 | $1.20 | $1.80 | $2.40 |
Billing Rules
- Pricing is $0.01 per generated second
text_promptdoes not affect pricing- This pricing assumes
ambienceanddouble_outputdo not change billing unless your backend explicitly makes them billable
Best Use Cases
- Sound effects — Generate short custom SFX for games, apps, and media.
- Ambient loops — Create seamless background beds for environments and scenes.
- Content production — Add generated sound design to videos, podcasts, or social content.
- Creative prototyping — Explore multiple sound directions quickly with several variations.
- Game and app audio — Produce loopable background textures and interactive sound assets.
Pro Tips
- Be specific in your prompt about texture, mood, environment, and intensity.
- Use
ambiencewhen the output needs to loop smoothly. - Turn on
double_outputonly when you want a longer looped deliverable. - Increase
num_sampleswhen you want multiple creative options from the same prompt. - Start with shorter durations for testing, then scale up once the direction feels right.
Notes
text_promptis required.durationsupports0.1–60seconds.num_samplessupports1–4.double_outputonly applies whenambienceis enabled.- Pricing is based on requested generation duration and sample count.
Related Models
- Mirelo AI SFX 1.6 Extend Audio — Extend an existing audio clip with newly generated continuation.
- Mirelo AI SFX 1.6 Inpaint Audio — Regenerate a selected segment inside an existing audio clip.
- Other Mirelo AI sound generation workflows — Useful when you need continuation or localized audio editing instead of fresh generation.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/mirelo-ai/sfx-1.6/text-to-audio" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"duration": 10,
"ambience": false,
"double_output": false,
"num_samples": 1
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| text_prompt | string | Yes | - | - | Text prompt describing the sound effect or ambient audio to generate. |
| duration | number | No | 10 | 0 ~ 60 | Target duration in seconds. |
| ambience | boolean | No | false | - | When true, generate and stitch the result so the tile loops seamlessly. |
| double_output | boolean | No | false | - | Only used when ambience is true: concatenate the loop with itself for a 2x-length output. |
| num_samples | integer | No | 1 | 1 ~ 4 | Number of variations to generate. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
Result Request Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| id | string | Yes | - | Task ID |
Result Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | string | Array of URLs to the generated content. |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |