WaveSpeedAI APIWavespeed AIImage Text Remover

Image Text Remover

Image Text Remover

Playground

Try it on WavespeedAI!

AI Image Text Remover that erases on-image text cleanly and reconstructs the background with stunning visual accuracy. Automatically removes captions, labels, subtitles, watermarks, annotations, or embedded UI text while preserving texture, lighting, and scene integrity. Ready-to-use REST inference API, fast performance, no coldstarts, and affordable pricing.

Features

WaveSpeedAI Image Text Remover

Remove unwanted text from any image with pixel-level reconstruction. WaveSpeedAI Image Text Remover automatically finds text regions and rebuilds the underlying background so the edit looks natural — no blurry blocks, no jagged edges.


What is it?

WaveSpeedAI Image Text Remover is an end-to-end image cleanup tool:

  • Input: an image containing text (captions, labels, subtitles, UI text, stickers, overlays)
  • Output: a clean image with the text removed and background filled in
  • Workflow: text detection → intelligent mask generation → background inpainting

It’s designed for cleaning social posts, product photos, posters, screenshots, UI/UX mockups, marketing materials, and any image you want to reuse without visible text.


Why it stands out

  • High-fidelity inpainting Reconstructs texture, lighting, depth, and structure to blend the edited area into the original scene with minimal artifacts.

  • Fully automatic workflow Just upload your image and run the model—no need for manual masking, brushing, or Photoshop skills.

  • Text-focused cleanup Works especially well on captions, subtitles, labels, annotations, and embedded UI strings; also effective on many watermarks and overlays.

  • Flexible output formats Choose JPEG, PNG, or WEBP so the result fits directly into your web, app, or design pipeline.

  • Batch and API friendly Suitable for e-commerce asset cleanup, marketing automation, dataset preparation, and production-scale image processing.


Limits and performance

  • Input formats: JPEG, PNG, WEBP
  • Output formats: JPEG / PNG / WEBP (selectable)
  • Typical processing time: a few seconds per image, depending on resolution and queue load
  • Best cases: clearly readable text that is not extremely stylized or heavily blended into complex textures

Pricing

  • Each processed image costs $0.15.

How to use

  1. Upload an image or paste a publicly accessible image URL.
  2. Choose your preferred output_format (JPEG, PNG, or WEBP).
  3. Click run to start automatic text detection and removal.
  4. Download your cleaned, text-free image from the result panel or dashboard.

Pro tips for best quality

  • Use the highest-resolution version of the image you have; more pixels mean better inpainting.
  • Flat or simple backgrounds are easiest to reconstruct; complex patterns may occasionally need light manual touch-up.
  • For semi-transparent watermarks or overlays, higher resolution and good contrast between text and background improve results.

Notes

  • If you provide an image URL instead of uploading, make sure it is publicly accessible; a valid image will show a preview before you run the job.
  • Extremely stylized, warped, or deeply textured text may not be removed perfectly in one pass.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/image-text-remover" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "output_format": "jpeg",
    "enable_base64_output": false,
    "enable_sync_mode": false
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
imagestringYes-The URL of the image to generate an image from.
output_formatstringNojpegjpeg, png, webpThe format of the output image.
enable_base64_outputbooleanNofalse-If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
enable_sync_modebooleanNofalse-If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

© 2025 WaveSpeedAI. All rights reserved.