Heartmula Transcribe Lyrics

Playground

HeartMuLa Transcribe extracts lyrics from audio files using advanced AI. Supports multilingual transcription. Ready-to-use REST inference API with best performance, no coldstarts, and affordable pricing.

Features

HeartMuLa Transcribe Lyrics

HeartMuLa Transcribe Lyrics is an AI-powered audio transcription model that extracts lyrics from music tracks. Upload a song and the model automatically recognizes and transcribes the vocal content into text.

Why Choose This?

Automatic lyrics extraction Transcribes vocals from any music track into readable text.
Simple one-input workflow Just upload an audio file — no additional configuration needed.
Fast processing Get transcribed lyrics in seconds.
Versatile audio support Works with various audio formats and music styles.

Parameters

Parameter	Required	Description
audio	Yes	Music audio file to transcribe (URL or upload)

How to Use

Upload your audio — provide the music track you want to transcribe.
Run — submit and receive the transcribed lyrics.

Pricing

Output	Cost
Per transcription	$0.05

Best Use Cases

Lyrics Transcription — Extract lyrics from songs for reference or annotation.
Music Production — Transcribe vocal recordings for editing and review.
Content Creation — Get text versions of song lyrics for subtitles or captions.
Music Analysis — Extract lyrics for study, review, or documentation.
Karaoke Preparation — Generate lyrics text from audio tracks.

Pro Tips

Use high-quality audio files with clear vocals for best transcription accuracy.
Works best when vocals are prominent and not heavily mixed with instrumentals.
Combine with HeartMuLa Generate Music to create songs then verify lyrics.

Notes

Only audio is required.
Output is a JSON object containing the transcribed lyrics text.
Ensure uploaded audio URLs are publicly accessible.
Transcription accuracy depends on vocal clarity and audio quality.

HeartMuLa Generate Music — Generate complete songs from lyrics and style tags.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/heartmula/transcribe-lyrics" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
audio	string	Yes	-	-	URL to the audio file to transcribe lyrics from.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	object	Array of transcription results (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Heartmula Generate Music Hidream E1 Full

Heartmula Transcribe Lyrics

Playground

Features

HeartMuLa Transcribe Lyrics

Why Choose This?

Parameters

How to Use

Pricing

Best Use Cases

Pro Tips

Notes

Related Models

Authentication

API Endpoints

Submit Task & Query Result

Parameters

Task Submission Parameters

Request Parameters

Response Parameters

Result Request Parameters

Result Response Parameters