Browse ModelsWavespeed AIHeartmula Transcribe Lyrics

Heartmula Transcribe Lyrics

Heartmula Transcribe Lyrics

Playground

Try it on WavespeedAI!

HeartMuLa Transcribe extracts lyrics from audio files using advanced AI. Supports multilingual transcription. Ready-to-use REST inference API with best performance, no coldstarts, and affordable pricing.

Features

HeartMuLa Transcribe Lyrics

HeartMuLa Transcribe Lyrics is an AI-powered audio transcription model that extracts lyrics from music tracks. Upload a song and the model automatically recognizes and transcribes the vocal content into text.


Why Choose This?

  • Automatic lyrics extraction Transcribes vocals from any music track into readable text.

  • Simple one-input workflow Just upload an audio file — no additional configuration needed.

  • Fast processing Get transcribed lyrics in seconds.

  • Versatile audio support Works with various audio formats and music styles.


Parameters

ParameterRequiredDescription
audioYesMusic audio file to transcribe (URL or upload)

How to Use

  1. Upload your audio — provide the music track you want to transcribe.
  2. Run — submit and receive the transcribed lyrics.

Pricing

OutputCost
Per transcription$0.05

Best Use Cases

  • Lyrics Transcription — Extract lyrics from songs for reference or annotation.
  • Music Production — Transcribe vocal recordings for editing and review.
  • Content Creation — Get text versions of song lyrics for subtitles or captions.
  • Music Analysis — Extract lyrics for study, review, or documentation.
  • Karaoke Preparation — Generate lyrics text from audio tracks.

Pro Tips

  • Use high-quality audio files with clear vocals for best transcription accuracy.
  • Works best when vocals are prominent and not heavily mixed with instrumentals.
  • Combine with HeartMuLa Generate Music to create songs then verify lyrics.

Notes

  • Only audio is required.
  • Output is a JSON object containing the transcribed lyrics text.
  • Ensure uploaded audio URLs are publicly accessible.
  • Transcription accuracy depends on vocal clarity and audio quality.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/heartmula/transcribe-lyrics" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
audiostringYes--URL to the audio file to transcribe lyrics from.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsobjectArray of transcription results (empty when status is not completed).
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.