Vidu Contest
WaveSpeed.ai
首页/探索/Flux Image Tools/wavespeed-ai/flux-kontext-dev/multi-ultra-fast
image-to-image

image-to-image

Flux Kontext Multi Ultra-Fast Experimental Dev Model

wavespeed-ai/flux-kontext-dev/multi-ultra-fast

Experimental FLUX.1 Kontext [dev] - Multi-Ultra-Fast endpoint with native multi-image handling for batch and multi-view inputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Hint: You can drag and drop a file or click to upload

preview

Hint: You can drag and drop a file or click to upload

preview
width
height
1280 × 720 px
Range: 256 - 1536
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Idle

The boy put on sunglasses.

您的请求将花费 $0.025 每次运行。

使用 $1 您可以运行此模型大约 40 次。

还有一件事:

示例查看全部

The boy put on sunglasses.
The dog is playing with a ball
This woman is eating cake.
Let the woman drink the juice.
Let the woman take the bag.
Luffy stands in front of the house.
The boy is riding a horse.
Little girl holding a cabbage doll.
The man is holding a flower.
The girl is sitting on a chair.

README

FLUX Kontext Dev Multi Ultra Fast — wavespeed-ai/flux-kontext-dev/multi-ultra-fast

FLUX.1 Kontext Dev Multi Ultra Fast is a low-latency, multi-image editing model designed for fast, instruction-based image editing with richer context. Provide up to 4 reference images plus a text instruction, and the model performs controlled edits while using the references to improve consistency across identity, style, and scene—optimized for rapid iteration and production workflows.

Key capabilities

  • Ultra-fast multi-image contextual editing with up to 4 reference images
  • Stronger consistency by grounding edits in multiple references (identity, outfit, style, lighting, background)
  • Supports both local edits and global transformations
  • Ideal for iterative workflows: quick refinements with minimal drift

Typical use cases

  • Multi-reference character consistency for portraits and creatives
  • Product/branding edits using multiple references (logo + label + lighting + packaging)
  • Background swaps with better subject matching (lighting, shadows, perspective)
  • Text edits that must follow reference typography and layout
  • Rapid A/B iteration for marketing assets and creative variations

Pricing

$0.025 per generation.

If you generate multiple outputs in one run, total cost = num_images × $0.025 Example: num_images = 4 → $0.10

Inputs and outputs

Input:

  • Up to 4 reference images (upload or public URLs)
  • One edit instruction (prompt)

Output:

  • One or more edited images (controlled by num_images)

Parameters

  • prompt: Edit instruction describing what to change and what to preserve
  • images: Up to 4 reference images
  • width / height: Output resolution
  • num_inference_steps: More steps can improve fidelity but increases latency
  • guidance_scale: Higher values follow the prompt more strongly; too high may over-edit
  • num_images: Number of variations generated per run
  • seed: Fixed value for reproducibility; -1 for random
  • output_format: jpeg or png
  • enable_base64_output: Return BASE64 instead of a URL (API only)
  • enable_sync_mode: Wait for generation and return results directly (API only)

Prompting guide

Assign clear roles to references to avoid conflicts:

Template: Use reference 1 for [identity]. Use reference 2 for [outfit/material]. Use reference 3 for [style/lighting]. Use reference 4 for [background/scene]. Keep [must-preserve]. Change [edit request]. Match [lighting/shadows/perspective].

Example prompts

  • Use reference 1 for face identity and reference 2 for hairstyle. Keep the pose from the base image. Replace the background with a clean studio setup and match shadow direction.
  • Use reference 1 for the product shape and reference 2 for the label design. Replace the label text with “WaveSpeedAI”, keeping font style, perspective, and print texture consistent.
  • Use reference 3 as the style guide (soft illustration look) and reference 4 for lighting mood (sunset). Preserve identity from reference 1 and keep composition unchanged.

Best practices

  • Use high-quality references with clear subjects and minimal occlusion.
  • Give each reference a purpose (identity vs. style vs. scene) for more reliable results.
  • Iterate with one change per run for tighter control.
  • Fix seed for stable comparisons across prompt variants.