Vidu Q3 與 Q3 Pro 模型 5 折 · 僅限 WaveSpeedAI | 5月20日 – 6月2日
首頁/探索/Higgsfield/Soul/Image To Image

Soul Image to Image

higgsfield /

SOUL is a realistic image-to-image engine for sophisticated visuals that, with Soul ID, preserves character consistency across scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-image
輸入

拖放檔案或點擊上傳

preview

就緒

Harsh flash, timestamp corner, and raw textures. Feels like 2007 captured forever

$0.09每次運行·~11 / $1

下一步:

示例查看全部

Harsh flash, timestamp corner, and raw textures. Feels like 2007 captured forever

Harsh flash, timestamp corner, and raw textures. Feels like 2007 captured forever

Echoed figures, mirrored forms, or repeated poses-surreal symmetry that bends perception

Echoed figures, mirrored forms, or repeated poses-surreal symmetry that bends perception

Exaggerated depth and playful proportions. Face or object up close, background stretched to infinity

Exaggerated depth and playful proportions. Face or object up close, background stretched to infinity

subway

subway

Make the person with Carema become a giant person

Make the person with Carema become a giant person

相關模型

README

Higgsfield SOUL Image-to-Image

Higgsfield SOUL Image-to-Image is a high-quality image transformation model built for realism, tasteful aesthetics, and strong character consistency. It is designed for reference-driven image editing workflows where you want to restyle, reframe, or evolve a subject while preserving identity, structure, and visual coherence.

Why Choose This?

  • Elegant realism Produce believable lighting, natural skin and hair, and clean material rendering without an over-processed look.

  • Strong character consistency Keep the same identity across different scenes, poses, outfits, and lighting conditions.

  • Faithful image-to-image editing Preserve core composition and structure while changing styling, color, mood, wardrobe, or background.

  • Art-direction friendly Respond well to cinematic language such as lens cues, depth of field, rim light, palette direction, and grading intent.

  • Wide stylistic range Move between photoreal, editorial, painterly, or graphic looks without heavy prompt engineering.

Parameters

ParameterRequiredDescription
imageYesReference image used as the source for the transformation.
promptYesText instruction describing the desired transformation, style, setting, mood, or visual direction.
soul_idNoOptional Soul ID used to preserve identity consistency across multiple generations.
qualityNoOutput quality tier. Supported values: medium, high.
aspect_ratioNoOutput aspect ratio for the generated image.
strengthNoControls how closely the result follows the source image versus allowing freer restyling.

How to Use

  1. Upload a reference image — use a clean, well-lit image with minimal occlusion.
  2. Add a Soul ID (optional) — reuse the same ID if you want stronger identity continuity across multiple shots.
  3. Write your prompt — describe the subject, setting, outfit, props, lighting, lens feel, palette, and mood.
  4. Choose quality — use medium for lower cost or high for stronger final quality.
  5. Set aspect ratio — choose the composition format that fits your delivery needs.
  6. Adjust strength — use lower values for source-faithful edits and higher values for looser restyling.
  7. Submit — generate the image and iterate as needed.

Example Prompt

Turn this portrait into a cinematic editorial fashion shot at golden hour, soft rim light, clean skin texture, natural color grading, luxury wardrobe styling, realistic background depth

Pricing

Pricing depends on the selected quality tier.

QualityPrice per Image
Medium$0.09
High$0.19

Billing Rules

  • medium costs $0.09 per image
  • high costs $0.19 per image
  • Pricing depends only on the selected quality tier

Best Use Cases

  • Character lookbooks — Maintain one identity across many styled outputs.
  • Narrative keyframes — Generate consistent visual storytelling frames from a reference.
  • Brand mascots and recurring talent — Keep a recognizable subject across multiple campaigns.
  • Fashion and editorial sets — Explore wardrobe, lighting, and scene variations with consistency.
  • Advertising creatives — Produce campaign assets with stable character identity.
  • Storyboards and pre-visualization — Quickly develop coherent visual directions for production planning.

Pro Tips

  • Use a clean, front-facing, well-lit reference when identity preservation matters.
  • Keep prompts specific about what should change and what should remain consistent.
  • Reuse the same soul_id when building a sequence or visual set around the same subject.
  • Use lower strength when structure fidelity matters more than style variation.
  • Use high quality for final selects and medium when exploring directions more quickly.

Notes

  • This model is designed for image-to-image workflows with strong identity and style consistency.
  • Reusing the same soul_id can improve continuity across generations.
  • Better source images generally produce more stable and higher-quality results.
  • Pricing is fixed by quality tier.

Related Models

  • Other Higgsfield image generation workflows — Useful when you need text-to-image generation instead of reference-based editing.
  • Character-consistent generation workflows — Useful when maintaining identity across multiple scenes is the main priority.
無障礙:本網站使用的 AI 模型由第三方提供。

Soul Image To Image API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/higgsfield/soul/image-to-image with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Soul Image To Image below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/higgsfield/soul/image-to-image" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "size": "1152*2048",
    "style": "Creatures",
    "strength": 1,
    "quality": "medium",
    "seed": 0
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("higgsfield/soul/image-to-image", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "image": "https://example.com/your-input.jpg",
        "size": "1152*2048",
        "style": "Creatures",
        "strength": 1,
        "quality": "medium",
        "seed": 0
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "higgsfield/soul/image-to-image",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "size": "1152*2048",
    "style": "Creatures",
    "strength": 1,
    "quality": "medium",
    "seed": 0
}
)

print(output["outputs"][0])  # → URL of the generated output

Soul Image To Image API — Frequently asked questions

What is the Soul Image To Image API?

Soul Image To Image is a Higgsfield model for image editing, exposed as a REST API on WaveSpeedAI. SOUL is a realistic image-to-image engine for sophisticated visuals that, with Soul ID, preserves character consistency across scenes. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Soul Image To Image API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/higgsfield/higgsfield-soul-image-to-image.

How much does Soul Image To Image cost per run?

Soul Image To Image starts at $0.090 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Soul Image To Image accept?

Key inputs: `prompt`, `image`, `size`, `seed`, `quality`, `strength`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/higgsfield/higgsfield-soul-image-to-image.

How long does Soul Image To Image take to generate?

Average end-to-end generation time on WaveSpeedAI is around 32 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Soul Image To Image outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Higgsfield). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.