Image Text Remover
Playground
Try it on WavespeedAI!AI Image Text Remover that erases on-image text cleanly and reconstructs the background with stunning visual accuracy. Automatically removes captions, labels, subtitles, watermarks, annotations, or embedded UI text while preserving texture, lighting, and scene integrity. Ready-to-use REST inference API, fast performance, no coldstarts, and affordable pricing.
Features
WaveSpeedAI Image Text Remover
Remove unwanted text from any image with pixel-level reconstruction. WaveSpeedAI Image Text Remover automatically finds text regions and rebuilds the underlying background so the edit looks natural — no blurry blocks, no jagged edges.
What is it?
WaveSpeedAI Image Text Remover is an end-to-end image cleanup tool:
- Input: an image containing text (captions, labels, subtitles, UI text, stickers, overlays)
- Output: a clean image with the text removed and background filled in
- Workflow: text detection → intelligent mask generation → background inpainting
It’s designed for cleaning social posts, product photos, posters, screenshots, UI/UX mockups, marketing materials, and any image you want to reuse without visible text.
Why it stands out
-
High-fidelity inpainting Reconstructs texture, lighting, depth, and structure to blend the edited area into the original scene with minimal artifacts.
-
Fully automatic workflow Just upload your image and run the model—no need for manual masking, brushing, or Photoshop skills.
-
Text-focused cleanup Works especially well on captions, subtitles, labels, annotations, and embedded UI strings; also effective on many watermarks and overlays.
-
Flexible output formats Choose JPEG, PNG, or WEBP so the result fits directly into your web, app, or design pipeline.
-
Batch and API friendly Suitable for e-commerce asset cleanup, marketing automation, dataset preparation, and production-scale image processing.
Limits and performance
- Input formats: JPEG, PNG, WEBP
- Output formats: JPEG / PNG / WEBP (selectable)
- Typical processing time: a few seconds per image, depending on resolution and queue load
- Best cases: clearly readable text that is not extremely stylized or heavily blended into complex textures
Pricing
- Each processed image costs $0.15.
How to use
- Upload an image or paste a publicly accessible image URL.
- Choose your preferred output_format (JPEG, PNG, or WEBP).
- Click run to start automatic text detection and removal.
- Download your cleaned, text-free image from the result panel or dashboard.
Pro tips for best quality
- Use the highest-resolution version of the image you have; more pixels mean better inpainting.
- Flat or simple backgrounds are easiest to reconstruct; complex patterns may occasionally need light manual touch-up.
- For semi-transparent watermarks or overlays, higher resolution and good contrast between text and background improve results.
Notes
- If you provide an image URL instead of uploading, make sure it is publicly accessible; a valid image will show a preview before you run the job.
- Extremely stylized, warped, or deeply textured text may not be removed perfectly in one pass.
Related tools
- WaveSpeedAI Image Translator: Translate on-image text into 30+ languages while preserving layout.
- WaveSpeedAI Image Eraser: General-purpose object removal and background reconstruction for non-text elements.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/image-text-remover" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"output_format": "jpeg",
"enable_base64_output": false,
"enable_sync_mode": false
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| image | string | Yes | - | The URL of the image to generate an image from. | |
| output_format | string | No | jpeg | jpeg, png, webp | The format of the output image. |
| enable_base64_output | boolean | No | false | - | If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API. |
| enable_sync_mode | boolean | No | false | - | If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |