WaveSpeed.ai
ホーム/探索/Flux Image Tools/wavespeed-ai/flux-kontext-pro/multi
image-to-image

image-to-image

Flux Kontext Pro Multi

wavespeed-ai/flux-kontext-pro/multi

Experimental FLUX.1 Kontext [pro] with multi-image handling to combine context from multiple images for richer output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Hint: You can drag and drop a file or click to upload

preview

Hint: You can drag and drop a file or click to upload

preview
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Idle

The boy is holding a gun in his hand.

このリクエストには1回あたりで$0.04の費用がかかります。

$1でおよそ25回実行できます。

もうひとつお知らせ::

サンプルすべて表示

The boy is holding a gun in his hand.
A girl with a hat and sunglasses is in the garden.
The clown stands in front of the house.
Little girl holding flowers in her hands.
The girl is wearing a hippocampus brooch.
The boy put on sunglasses.
The girl puts on sunglasses.
The horse is carrying fruits.
The girl happily hugs the doll.
Santa Claus is standing in front of the tree.
A boy wears a chain around his neck.

README

FLUX Kontext Pro Multi — wavespeed-ai/flux-kontext-pro/multi

FLUX Kontext Pro Multi is a fast, reliable multi-image model for context-guided generation and editing. Provide a text prompt plus up to 5 reference images, and the model uses them to improve identity consistency, style alignment, and scene coherence—ideal for practical production workflows that need strong control at a lower cost.

Key capabilities

  • Multi-image contextual generation with up to 5 reference images
  • Strong identity and style consistency by grounding outputs in references
  • Reliable composition control for everyday creative and marketing use
  • Efficient for iterative workflows and rapid A/B exploration

Typical use cases

  • Character consistency using multiple portraits, outfits, or angles
  • Product and branding consistency (packaging + logo + lighting references)
  • Style steering with multiple exemplars (art style + texture + mood)
  • Scene creation guided by reference frames
  • Marketing creatives that need predictable, repeatable visual direction

Pricing

$0.04 per image.

Total cost = num_images × $0.04 Example: num_images = 4 → $0.16

Inputs and outputs

Input:

  • prompt (required): Instruction describing what to generate and how to use the references
  • images (required): Up to 5 reference images (upload or public URLs)

Output:

  • One or more generated images (based on your num_images setting, if available in your interface)

Parameters

  • prompt (required): The instruction for generation or editing
  • images (required): Up to 5 reference images
  • seed: Fixed value for reproducibility; leave empty/random for variation
  • guidance_scale: Prompt adherence strength (higher = stricter; too high may over-constrain)
  • aspect_ratio: Output aspect ratio (e.g., 16:9, 1:1, 9:16)

Prompting guide (multi-reference)

Assign roles to references to reduce ambiguity:

Template: Use image 1 for identity. Use image 2 for outfit/material. Use image 3 for style. Use image 4 for lighting. Use image 5 for background/scene. Generate the shot described below. Keep the key traits unchanged.

Example prompts

  • Use image 1 for the person’s identity and image 2 for outfit details. Use image 3 for visual style. Create a 16:9 cinematic medium shot in a rainy city street at night. Match lighting and reflections. Keep face structure and expression consistent.
  • Use image 1 for the product shape and image 2 for label layout. Use image 3 for lighting mood. Generate a clean studio product shot with realistic shadows and crisp edges. Keep branding placement consistent.
  • Use images 1–2 as identity references from different angles. Create a neutral-background portrait with softbox lighting and natural skin texture. Keep proportions realistic and avoid exaggerated stylization.

Best practices

  • Use high-quality references (sharp, well-lit, minimal occlusion)
  • Avoid conflicting references unless you explicitly state which reference dominates (identity vs. style vs. scene)
  • Keep guidance_scale moderate and let references do most of the steering
  • Fix seed when you need stable iteration and consistent comparisons
  • Choose aspect_ratio intentionally to avoid awkward cropping or stretched composition