Seedance 2.0 20% TANIEJ | Twórz w Video Generator →

Nucleus Image Text to Image

wavespeed-ai /

Nucleus Image generates high-quality images from text prompts with flexible aspect ratios, adjustable inference steps, and classifier-free guidance. Supports negative prompts, reproducible seeds, and multiple output formats. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image
Wejście
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Bezczynny

A young woman in a beige trench coat sits by the window of a nearly empty subway car at 11:47 PM, cradling a bouquet of white baby's breath that's begun to wilt. A faded lipstick print stains the rim of her half-finished coffee cup; her eyes gaze blankly at the tunnel lights streaking past. At the far end of the car, a man in black-framed glasses watches her quietly — a concert ticket slipping from between his fingers. Cold white fluorescent ceiling light, faint fog condensation on the window glass, subtle reflections, 35mm film grain, Kodak Portra 400 aesthetic, cinematic 2.39:1 widescreen.

$0.01za uruchomienie·~100 / $1

Dalej:

PrzykładyZobacz wszystkie

A young woman in a beige trench coat sits by the window of a nearly empty subway car at 11:47 PM, cradling a bouquet of white baby's breath that's begun to wilt. A faded lipstick print stains the rim of her half-finished coffee cup; her eyes gaze blankly at the tunnel lights streaking past. At the far end of the car, a man in black-framed glasses watches her quietly — a concert ticket slipping from between his fingers. Cold white fluorescent ceiling light, faint fog condensation on the window glass, subtle reflections, 35mm film grain, Kodak Portra 400 aesthetic, cinematic 2.39:1 widescreen.

A young woman in a beige trench coat sits by the window of a nearly empty subway car at 11:47 PM, cradling a bouquet of white baby's breath that's begun to wilt. A faded lipstick print stains the rim of her half-finished coffee cup; her eyes gaze blankly at the tunnel lights streaking past. At the far end of the car, a man in black-framed glasses watches her quietly — a concert ticket slipping from between his fingers. Cold white fluorescent ceiling light, faint fog condensation on the window glass, subtle reflections, 35mm film grain, Kodak Portra 400 aesthetic, cinematic 2.39:1 widescreen.

A Martian base greenhouse, 2089. An American female botanist in her 40s kneels on the soil, staring through her helmet visor at the first sunflower ever grown on another planet. Rust-red Martian dust clings to her white spacesuit sleeves; her name tag reads "Dr. Lin." Outside the greenhouse dome: an orange Martian horizon with two small moons rising. In her other hand, she grips a folded, well-worn family photograph. Grounded sci-fi realism, soft diffused light, emotional contrast between warmth and desolation.

A Martian base greenhouse, 2089. An American female botanist in her 40s kneels on the soil, staring through her helmet visor at the first sunflower ever grown on another planet. Rust-red Martian dust clings to her white spacesuit sleeves; her name tag reads "Dr. Lin." Outside the greenhouse dome: an orange Martian horizon with two small moons rising. In her other hand, she grips a folded, well-worn family photograph. Grounded sci-fi realism, soft diffused light, emotional contrast between warmth and desolation.

Powiązane modele

README

Nucleus Image Text-to-Image

Nucleus Image Text-to-Image generates high-quality images from text prompts with precise control over inference steps, guidance scale, and aspect ratio. An affordable, flexible text-to-image model built for creative and production workflows.

Why Choose This?

  • Fine-grained generation control Adjust inference steps (1–100) and guidance scale (0–20) to tune the balance between creativity and prompt adherence.

  • Negative prompt support Specify what to exclude from the output for more precise control over the result.

  • Multiple aspect ratio presets Choose from 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, or 2:3 to match any platform or format.

  • Batch generation Generate up to 2 images per run for quick side-by-side comparison.

  • Reproducible results Use the seed parameter to lock in a specific output for consistent iteration.

  • Output format choice Export in PNG or JPEG based on your delivery requirements.

Parameters

ParameterRequiredDescription
promptYesText description of the image subject, style, and mood.
negative_promptNoElements to exclude from the generated image.
aspect_ratioNoOutput aspect ratio. Options: 1:1 (default), 16:9, 9:16, 4:3, 3:4, 3:2, 2:3.
num_imagesNoNumber of images to generate per run: 1 (default) or 2.
num_inference_stepsNoNumber of inference steps. Range: 1–100. Default: 50.
guidance_scaleNoClassifier-free guidance scale. Range: 0–20. Default: 8.
output_formatNoOutput file format: png (default) or jpeg.
seedNoRandom seed for reproducible results.

How to Use

  1. Write your prompt — describe the subject, scene, style, lighting, and mood.
  2. Add negative prompt (optional) — specify elements you want to exclude.
  3. Select aspect ratio — choose the format that fits your target platform.
  4. Set num_images (optional) — generate 1 or 2 images per run.
  5. Adjust inference steps and guidance scale (optional) — higher steps for more detail, higher guidance for stricter prompt adherence.
  6. Choose output format — png for lossless, jpeg for smaller file size.
  7. Set seed (optional) — fix the seed to reproduce a specific result.
  8. Submit — generate and download your image.

Pricing

Just $0.01 per image.

Best Use Cases

  • Rapid prototyping — Generate visual concepts quickly at very low cost for iteration and ideation.
  • Social media content — Create platform-optimized images across multiple aspect ratios.
  • High-volume workflows — Affordable per-image pricing makes it ideal for large-scale generation pipelines.
  • Creative exploration — Tune inference steps and guidance scale to explore different visual styles from the same prompt.
  • Developer integrations — Embed flexible, low-cost image generation into any app or workflow.

Pro Tips

  • Higher num_inference_steps (50–100) produces more detailed and refined results; lower values (10–20) are faster for quick drafts.
  • Increase guidance_scale for stricter prompt adherence; lower values allow more creative variation.
  • Use negative_prompt to avoid common artifacts like blurry faces, extra limbs, or unwanted styles.
  • Generate 2 images per run to quickly compare variations before committing to a final render.
  • Fix the seed while adjusting other parameters to isolate the effect of each change.

Notes

  • Only prompt is required; all other parameters are optional.
  • Maximum 2 images per generation run.
  • Please ensure your content complies with WaveSpeed AI's usage policies.
Dostępność:Ta strona korzysta z modeli AI udostępnianych przez podmioty trzecie.

Nucleus Image Text To Image API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/nucleus-image/text-to-image with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Nucleus Image Text To Image below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/nucleus-image/text-to-image" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "negative_prompt": "blurry, low quality, distorted",
    "aspect_ratio": "1:1",
    "num_images": 1,
    "num_inference_steps": 50,
    "guidance_scale": 8,
    "output_format": "png",
    "enable_base64_output": false,
    "seed": 0
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("wavespeed-ai/nucleus-image/text-to-image", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "negative_prompt": "blurry, low quality, distorted",
        "aspect_ratio": "1:1",
        "num_images": 1,
        "num_inference_steps": 50,
        "guidance_scale": 8,
        "output_format": "png",
        "enable_base64_output": false,
        "seed": 0
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/nucleus-image/text-to-image",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "negative_prompt": "blurry, low quality, distorted",
    "aspect_ratio": "1:1",
    "num_images": 1,
    "num_inference_steps": 50,
    "guidance_scale": 8,
    "output_format": "png",
    "enable_base64_output": false,
    "seed": 0
}
)

print(output["outputs"][0])  # → URL of the generated output

Nucleus Image Text To Image API — Frequently asked questions

What is the Nucleus Image Text To Image API?

Nucleus Image Text To Image is a WaveSpeedAI model for image generation, exposed as a REST API on WaveSpeedAI. Nucleus Image generates high-quality images from text prompts with flexible aspect ratios, adjustable inference steps, and classifier-free guidance. Supports negative prompts, reproducible seeds, and multiple output formats. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Nucleus Image Text To Image API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/nucleus-image-text-to-image.

How much does Nucleus Image Text To Image cost per run?

Nucleus Image Text To Image starts at $0.010 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Nucleus Image Text To Image accept?

Key inputs: `prompt`, `aspect_ratio`, `seed`, `guidance_scale`, `num_inference_steps`, `negative_prompt`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/nucleus-image-text-to-image.

How do I get started with the Nucleus Image Text To Image API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Nucleus Image Text To Image outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.