Phota Edit

Phota Edit

Playground

Try it on WavespeedAI!

Phota Edit transforms existing images using natural language instructions. Supports up to 10 reference images, 1K and 4K resolutions, and batch output up to 4 images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Phota Edit

Phota Edit is an AI-powered image editing model that transforms photos using natural language instructions. Describe the change you want — swap clothing, alter backgrounds, adjust styles, add or remove objects — and the model applies precise, context-aware edits while preserving the rest of the image. Supports multiple input images, batch output, and flexible aspect ratio control.


Why Choose This?

  • Natural-language editing Describe your edit in plain text — no masks, no manual selections, no technical knowledge required.

  • Multi-image input Upload multiple reference images to provide richer visual context for complex edits.

  • 4K output support Generate high-resolution edited results for print, production, or archival use.

  • Flexible aspect ratio control Output in auto, 1:1, 16:9, 4:3, 3:4, or 9:16 to match your target platform.

  • Multiple output formats Export in JPEG, PNG, or WebP for any downstream workflow.

  • Batch output Generate multiple edited variations in a single run using the num_images parameter.


Parameters

ParameterRequiredDescription
promptYesText description of the desired edit.
imagesNoOne or more source images to edit (URL or file upload). Click Add Item for more.
resolutionNoOutput resolution: 1K (default) or 4K.
num_imagesNoNumber of edited output variations to generate per run. Default: 1.
aspect_ratioNoOutput aspect ratio: auto (default), 1:1, 16:9, 4:3, 3:4, 9:16.
output_formatNoOutput file format: jpeg (default), png, or webp.

How to Use

  1. Write your prompt — describe exactly what should change in the image (e.g., “Change the man’s clothes to beach casual wear and put on sunglasses.”).
  2. Upload your image(s) — provide one or more source images via URL or drag-and-drop. Click Add Item to add more.
  3. Select resolution — 1K for standard output, 4K for high-resolution results.
  4. Set num_images (optional) — generate multiple variations in one run.
  5. Choose aspect ratio — use auto to preserve the source ratio, or select a specific format.
  6. Choose output format — jpeg, png, or webp based on your delivery needs.
  7. Submit — generate and download your edited image.

Pricing

ResolutionCost per Image
1K$0.09
4K$0.18

Billing Rules

  • 1K: $0.09 per image
  • 4K: $0.18 per image (2× base price)
  • Total cost = cost per image × num_images

Best Use Cases

  • Fashion & Apparel — Swap clothing, accessories, or outfit styles on model photos without reshooting.
  • E-commerce — Edit product images to showcase different colors, settings, or variants from a single source.
  • Marketing & Advertising — Update visual assets quickly — change backgrounds, props, or styling to match campaign needs.
  • Portrait Retouching — Make targeted appearance changes while preserving the overall look and feel.
  • Creative Concepting — Rapidly iterate on visual ideas and styling directions for client review.

Pro Tips

  • Be specific and descriptive in your prompt — the more detail you provide, the more accurate the edit.
  • Upload multiple images when you want the model to reference additional context, such as a target style or specific elements.
  • Use 4K output for final production assets and 1K for rapid iteration and testing.
  • Use PNG output for lossless results when editing images with text, graphics, or sharp edges.
  • Enable sync mode in API workflows where you need the result returned directly without polling.

Notes

  • Only prompt is required; all other parameters are optional.
  • Ensure image URLs are publicly accessible if using links rather than direct uploads.
  • Please ensure your content complies with WaveSpeed AI’s usage policies.

  • Phota Enhance — Restore and upscale images without editing using AI-powered detail reconstruction.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/phota/edit" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "resolution": "1K",
    "num_images": 1,
    "aspect_ratio": "auto",
    "output_format": "jpeg",
    "enable_sync_mode": false,
    "enable_base64_output": false
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-Text description of the desired image.
imagesarrayNo[]1 ~ 10 itemsList of URLs of input images for editing. The maximum number of images is 10.
resolutionstringNo1K1K, 4KResolution of the generated image.
num_imagesintegerNo11 ~ 4Number of images to generate.
aspect_ratiostringNoautoauto, 1:1, 16:9, 4:3, 3:4, 9:16Aspect ratio of the generated image.
output_formatstringNojpegjpeg, png, webpThe format of the generated image.
enable_sync_modebooleanNofalse-If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.
enable_base64_outputbooleanNofalse-If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content (empty when status is not completed).
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.