Seedance 2.0 | Special Offer ✦ 10% OFF NOW | Ends May 13 (UTC+0)
Qwen AI Models

Qwen AI Models

Qwen multimodal models for image and video generation

Qwen multimodal models for image and video generation

所有模型

33 个模型
wavespeed-ai/qwen-image/text-to-image
text-to-image

wavespeed-ai/qwen-image/text-to-image

Qwen-Image is a 20B MMDiT next-gen text-to-image model that generates images from text prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/qwen-image/text-to-image-lora
lora-support

wavespeed-ai/qwen-image/text-to-image-lora

Qwen-Image LoRA is a 20B MMDiT next-gen text-to-image model with LoRA support for fast customization and refined image generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/qwen-image-lora-trainer
training

wavespeed-ai/qwen-image-lora-trainer

Train custom Qwen-Image LoRA models 10x faster. Style training, character training, object training. From concept to model in minutes, not hours. Upload a ZIP file containing images to start!

wavespeed-ai/qwen-image/edit
image-to-image

wavespeed-ai/qwen-image/edit

Qwen-Image-Edit is a 20B MMDiT image-to-image model offering precise bilingual (Chinese & English) text edits while preserving style. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/qwen-image/edit-lora
lora-support

wavespeed-ai/qwen-image/edit-lora

Qwen-Image-Edit LoRA (20B) enables bilingual Chinese/English image-to-image editing with style preservation and semantic and appearance edits. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/qwen-image/edit-plus
image-to-image

wavespeed-ai/qwen-image/edit-plus

Qwen-Image-Edit-Plus (2509) is a 20B MMDiT image editor with multi-image editing, single-image consistency and native ControlNet support. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/qwen-image/edit-plus-lora
lora-support

wavespeed-ai/qwen-image/edit-plus-lora

Qwen-Image-Edit-Plus (2509) is 20B MMDiT image-to-image editor supporting multi-image edits, single-image consistency, and native ControlNet. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

alibaba/qwen-image/translate
image-to-image

alibaba/qwen-image/translate

Qwen Vision Translate offers OCR-based image understanding and multilingual in-image text translation for context-aware results. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

alibaba/qwen3-tts-flash
text-to-audio

alibaba/qwen3-tts-flash

Qwen3 TTS Flash: Low-latency Text-to-Speech for English and Chinese with multiple voices, ideal for real-time dialogue. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/jib-mix-qwen-image/text-to-image
text-to-image

wavespeed-ai/jib-mix-qwen-image/text-to-image

Jib Mix Qwen is a next-gen Text-to-Image model optimized for producing natural, pretty faces with improved Asian facial rendering. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/jib-mix-qwen-image/text-to-image-lora
lora-support

wavespeed-ai/jib-mix-qwen-image/text-to-image-lora

Jib Mix Qwen LoRA specializes in producing more natural, attractive faces and is particularly strong at rendering Asian facial features for next-gen text-to-image generation with LoRA support. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/z-image/turbo
text-to-image

wavespeed-ai/z-image/turbo

Z-Image-Turbo is a 6 billion parameter text-to-image model that generates photorealistic images in sub-second time. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/z-image/turbo-lora
lora-support

wavespeed-ai/z-image/turbo-lora

Z-Image-Turbo LoRA (6B) enables ultra-fast text-to-image generation with external LoRA support. Generate photorealistic images in sub-second latency while applying up to 3 LoRAs for custom styles. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/z-image/turbo-inpaint
image-to-image

wavespeed-ai/z-image/turbo-inpaint

Z-Image Turbo Inpaint delivers ultra-fast image inpainting with natural-language instructions—seamlessly fill, fix, or replace regions in your images with production-quality results. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/qwen-image/layered
image-to-image

wavespeed-ai/qwen-image/layered

Qwen-Image Layered is a unified image-layer decomposition model for prompt-guided compositing. Provide points, boxes, or rough masks to isolate subjects and regions, and the model splits a single image into multiple RGBA layers with clean alpha, soft edges, and correct occlusion order. Ready-to-use REST inference API with fast response, no cold starts, and affordable pricing.

wavespeed-ai/qwen-image/edit-2511
image-to-image

wavespeed-ai/qwen-image/edit-2511

Qwen Image Edit 2511 is a major upgrade over 2509 for real-world image editing and design. It delivers stronger edit consistency, robust multi-person identity/pose consistency, built-in LoRA styles, enhanced industrial/product design, and improved geometric reasoning for structure-preserving edits. Built for stable production use with a ready-to-use REST API, no cold starts, and predictable pricing.

wavespeed-ai/qwen-image/edit-2511-lora
lora-support

wavespeed-ai/qwen-image/edit-2511-lora

Qwen Image Edit 2511 LoRA is an enhanced version with custom LoRA support for personalized styles. It delivers stronger edit consistency, robust multi-person identity/pose consistency, custom LoRA styles, enhanced industrial/product design, and improved geometric reasoning for structure-preserving edits. Built for stable production use with a ready-to-use REST API, no cold starts, and predictable pricing.

wavespeed-ai/qwen-image/text-to-image-2512
text-to-image

wavespeed-ai/qwen-image/text-to-image-2512

Qwen Image 2512 is Qwen's latest text-to-image model with enhanced prompt understanding, superior text rendering, and versatile aspect ratio support. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/qwen-image/text-to-image-2512-lora
lora-support

wavespeed-ai/qwen-image/text-to-image-2512-lora

Qwen-Image-2512 LoRA is an enhanced 20B MMDiT text-to-image model with LoRA support for fast customization and refined image generation. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/z-image-turbo/image-to-image
image-to-image

wavespeed-ai/z-image-turbo/image-to-image

Z-Image-Turbo Image-to-Image is a 6 billion parameter model that enhances the quality of reference images (similar to upscaling) in sub-second time. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/z-image-turbo/image-to-image-lora
lora-support

wavespeed-ai/z-image-turbo/image-to-image-lora

Z-Image-Turbo Image-to-Image LoRA transforms reference images with custom LoRA styles in sub-second time. Apply up to 3 LoRAs for personalized image transformation. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/qwen-image-2512-lora-trainer
training

wavespeed-ai/qwen-image-2512-lora-trainer

Qwen-Image-2512 LoRA Trainer lets you train custom LoRA models 10x faster with style, character, and object training. From concept to model in minutes, not hours—upload a ZIP file containing images to start. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/z-image-turbo/controlnet
text-to-image

wavespeed-ai/z-image-turbo/controlnet

Z-Image-Turbo ControlNet generates images guided by structural control signals (depth, canny edge, pose) for precise composition control. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/qwen-image/edit-multiple-angles
image-to-image

wavespeed-ai/qwen-image/edit-multiple-angles

Generate specific camera angles from a single image using a 96-pose camera system. Control horizontal rotation, vertical tilt, and zoom to create front, side, back views and more. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/qwen3-tts/text-to-speech
text-to-audio

wavespeed-ai/qwen3-tts/text-to-speech

Qwen3 TTS: Multi-language, multi-voice text-to-speech synthesis with style control. Supports 11 languages and 9 voice characters. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/qwen3-tts/voice-clone
audio-to-audio

wavespeed-ai/qwen3-tts/voice-clone

Qwen3 TTS Voice Clone: Clone any voice from a reference audio and generate speech in that voice. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/qwen3-tts/voice-design
text-to-audio

wavespeed-ai/qwen3-tts/voice-design

Qwen3 TTS Voice Design: Generate speech with custom voice characteristics described in natural language. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/z-image/base
text-to-image

wavespeed-ai/z-image/base

Z-Image-Base is a 6 billion-parameter text-to-image model with full CFG support. Supports negative prompting and fine-tuning capabilities for maximum control over image generation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/z-image/base-lora
lora-support

wavespeed-ai/z-image/base-lora

Z-Image-Base LoRA (6B) enables high-quality text-to-image generation with full CFG support and external LoRA support. Supports negative prompting while applying up to 3 LoRAs for custom styles. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/z-image/base-lora-trainer
training

wavespeed-ai/z-image/base-lora-trainer

Z-Image Base LoRA Trainer – train custom image LoRA models from your own dataset, with zip uploads, auto-tuned defaults and fast iteration for brand, character or IP looks. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

wavespeed-ai/qwen-image-max/text-to-image
text-to-image

wavespeed-ai/qwen-image-max/text-to-image

Qwen Image Max is a text-to-image model with high-quality image generation supporting Chinese and English prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/qwen-image-max/edit
image-to-image

wavespeed-ai/qwen-image-max/edit

Qwen Image Max Edit is an AI model for image editing with text prompts, supporting both Chinese and English languages. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/qwen-image/edit-2509-multiple-angles
image-to-image

wavespeed-ai/qwen-image/edit-2509-multiple-angles

Qwen Image Edit 2509 Multiple Angles is an AI image editing model that generates multiple-angle views of objects or scenes from a single image. Transform perspectives and create diverse viewpoints with text prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Qwen AI Models

Qwen multimodal models developed by Alibaba Cloud offer advanced capabilities in image and video generation. These models excel at creating high-quality visual content from text descriptions with a strong understanding of both Chinese and English prompts.

LoRA-ready Image Editing & Generation

  1. qwen-image/edit-plus-lora

Advanced image editing model with LoRA support, enabling precise style transfer, character customization, and high-fidelity local edits driven by text prompts.

  1. qwen-image/edit-lora

Lightweight edit model for LoRA-based style and character control, ideal for quick retouching, outfit changes, and consistent persona updates.

  1. qwen-image/text-to-image-lora

LoRA-enabled text-to-image generation that supports custom styles and characters while keeping strong prompt adherence and clean composition.

  1. jib-mix-qwen-image/text-to-image-lora

Mixed-style LoRA T2I model tuned for vivid anime and illustration aesthetics, combining sharp linework with rich color and expressive characters.

  1. qwen-image-lora-trainer

Training endpoint for building your own Qwen Image LoRA adapters from reference images, enabling personalized styles and characters across all LoRA-capable Qwen models.

Base Image Editing

  1. qwen-image/edit-plus

Enhanced image editing model for high-quality global and local edits, improving lighting, realism, and detail while preserving subject identity.

  1. qwen-image/edit

General-purpose edit model for everyday photo and artwork adjustments—ideal for quick fixes, background tweaks, and light retouching.

  1. qwen-image/edit-2511

High-consistency image editing model for reliable multi-subject, identity-preserving edits, delivering reduced drift, stronger geometric control, and cleaner, product-grade results for iterative, production workflows.

  1. qwen-image/edit-2511-edit-lora

LoRA-enhanced editing model built on the 2511 backbone—enables style injection, character customization, and fine-tuned aesthetic control while preserving the core stability of production-grade edits.

  1. qwen-image-max/edit

Advanced image editing model offering precise object manipulation, seamless background replacement, and intelligent style transfer, while preserving high-fidelity details and natural lighting.

Base Text-to-Image Generation

  1. qwen-image/text-to-image

Core T2I model that generates clean, realistic images from text prompts, suitable for product shots, portraits, and general creative use.

  1. jib-mix-qwen-image/text-to-image

Stylized T2I variant blending anime and illustration styles, producing vibrant, character-focused art with strong visual appeal.

  1. qwen-image/text-to-image-2512

Next-generation text-to-image model with enhanced prompt adherence, refined detail rendering, and improved compositional accuracy—engineered for photorealistic outputs and complex multi-element scene generation.

  1. qwen-image-max/text-to-image

Premium text-to-image model delivering exceptional detail, superior photorealism, and complex scene coherence. Designed for professional-grade generation with advanced lighting, texture rendering, and precise compositional control.

Utilities & Audio

  1. qwen-image/translate

Image translation utility that reads charts, UI screenshots, and text-heavy graphics, then outputs translated content while preserving layout semantics.

  1. qwen3-tts family

Fast text-to-speech model for natural-sounding voice previews, optimized for low latency in assistants, demos, and real-time applications.

Qwen AI Models API — 价格与性能

通过单一 REST API 运行 Qwen AI Models 系列中的任意模型。按生成计费 — 无订阅、无最低消费 — 在 99.9% 可用性的基础设施上提供行业领先的延迟。

为什么在 WaveSpeedAI 上运行 Qwen AI Models

透明定价

每个 Qwen AI Models 模型都有按调用计价。价格在每个模型的页面上列出 — 不收取额外的平台费。

为低延迟优化

大多数 Qwen AI Models 图像模型在 2 秒内完成。视频和 3D 模型比自托管方案快数倍。

99.9% 可用性

多区域故障转移和自动重试可确保您的生产流量保持在线 — 即使在供应商故障期间。

常见问题

Qwen AI Models API 多少钱?+

每个模型在其模型页面上都列有自己的按调用价格。我们按每次成功生成计费,没有订阅费或最低消费。

Qwen AI Models 模型在 WaveSpeedAI 上有多快?+

本系列中的图像模型通常在 2 秒内完成。视频和 3D 模型取决于时长和分辨率,但通常比自托管运行快数倍。

不用信用卡可以试用 API 吗?+

可以 — 每个账户在注册时获得 20 美元的免费额度,足以在大多数 Qwen AI Models 模型上进行数百次调用。

有速率限制吗?+

标准账户有充足的并发任务限制。企业版计划提供自定义 RPM、更高并发和专用容量 — 详情请联系销售。