qwen/qwen3.6-flash
1,000,000 context · $0.25/M input tokens · $1.50/M output tokens
Qwen3.6 Flash is a fast, efficient multimodal language model from Alibaba’s Qwen 3.6 series. It supports text, image, and video inputs with a 1M-token context window and up to 64K output tokens. The model is designed for high-throughput chat, lightweight agent workflows, long-document understanding, visual reasoning, summarization, extraction, and cost-sensitive production workloads. It supports thinking mode, function calling, built-in tools, structured outputs, and batch calling.
Bayar sesuai pemakaian
Tanpa biaya di muka, bayar hanya sesuai penggunaan
Gunakan contoh kode berikut untuk integrasi dengan API kami:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="qwen/qwen3.6-flash",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)Qwen3.6 Flash is a fast, efficient multimodal language model from Alibaba’s Qwen 3.6 series. It supports text, image, and video inputs with a 1M-token context window, making it a strong fit for high-volume chat, lightweight agents, long-document workflows, visual understanding, summarization, and structured extraction.
| Specification | Value |
|---|---|
| Provider | alibaba |
| Model Type | Chat Completions model |
| Architecture | text+image+video->text |
| Context Window | 1,000,000 tokens |
| Max Input | 934,464 tokens |
| Max Output | 65,536 tokens |
| Thinking Budget | 128K tokens |
| Input | Text, Image, Video |
| Output | Text |
| Vision | Supported |
| Function Calling | Supported |
| Built-in Tools | Supported |
| Structured Outputs | Supported |
| Batch Calling | Supported |
Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: qwen/qwen3.6-flash
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="qwen/qwen3.6-flash",
messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)qwen/qwen3.6-flash
Qwen3.6 Flash is a fast, efficient multimodal language model from Alibaba’s Qwen 3.6 series. It supports text, image, and video inputs with a 1M-token context window and up to 64K output tokens. The model is designed for high-throughput chat, lightweight agent workflows, long-document understanding, visual reasoning, summarization, extraction, and cost-sensitive production workloads. It supports thinking mode, function calling, built-in tools, structured outputs, and batch calling.
Input
$0.25 /M
Output
$1.5 /M
Konteks
1000K
Output Maks.
66K
Vision
Didukung
Penggunaan Tool
Didukung
Akses Qwen3.6 Flash melalui API terpadu kami — kompatibel dengan OpenAI, tanpa cold start, harga transparan.
Harga di WaveSpeedAI: $0.25 per juta token input dan $1.50 per juta token output. Prompt caching dan batch processing ditagih terpisah dan mengurangi biaya efektif pada beban kerja yang panjang dan berulang.
Qwen3.6 Flash mendukung hingga 1000K token konteks dengan hingga 66K token output per permintaan.
Ya. WaveSpeedAI menyediakan Qwen3.6 Flash melalui endpoint yang kompatibel dengan OpenAI di https://llm.wavespeed.ai/v1. Arahkan OpenAI SDK resmi ke base URL ini dengan API key WaveSpeedAI Anda — tanpa perubahan kode lainnya.
Masuk ke WaveSpeedAI, buat API key di Access Keys, lalu kirim permintaan ke https://llm.wavespeed.ai/v1/chat/completions dengan model id seperti ditampilkan di atas. Akun baru menerima kredit gratis untuk menguji Qwen3.6 Flash.