
text-to-image
Idle

Sua solicitação custará $0.02 por execução.
Por $1 você pode executar este modelo aproximadamente 50 vezes.
Mais uma coisa::











Wan 2.1 is part of the Wan 2.1 foundation model suite, an advanced AI system developed to redefine video and image generation. This model focuses on text-to-image synthesis — transforming detailed written prompts into vivid, high-resolution visuals with cinematic precision.
🎨 SOTA Image Quality Built on Wan 2.1’s next-generation video foundation, this model produces exceptional still-frame quality with realistic lighting, texture, and depth.
🧠 Multilingual Understanding Supports both Chinese and English prompts, ensuring accurate and context-rich image generation across languages.
⚙️ Fine Control with Parameters
Adjustable inputs such as strength, width, and height provide creators with direct control over composition and style.
🪄 Powerful Visual Consistency Based on Wan-VAE, enabling coherent detail, color fidelity, and stylistic alignment across resolutions.
💰 Lightweight and Efficient High-quality generation at a base cost of just $0.02 per image, ideal for scalable creative workflows.
| Parameter | Description |
|---|---|
| prompt* | Text description of the image to be generated (supports CN/EN). |
| image | (Optional) Upload a reference image for guided generation. |
| strength | Controls how strongly the image follows the prompt or reference (0–1). |
| size (width / height) | Define custom output resolution; max recommended ratio 2:1. |
| seed | Fix for reproducibility or randomize for variation. |
| output_format | Choose from jpeg, png, or webp. |
Envision an ethereal and highly decorative portrait of an androgynous Elven Monarch, seated upon a throne carved from living iridescent wood within a moonlit glade. Intricate Art Nouveau details, luminous textures, soft-focus background, cinematic lighting.
| Metric | Price |
|---|---|
| Per image generated | $0.02 / image |