qwen/qwen3.6-max-preview
262,144 context · $1.30/M input tokens · $7.80/M output tokens
Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse Mixture-of-Experts architecture with approximately 1T total parameters. It is optimized for high-end reasoning, agentic coding, tool use, instruction following, and complex text generation workflows. The model supports a 262K-token context window, up to 64K output tokens, thinking mode, function calling, and structured outputs, making it suitable for demanding production tasks that require stronger reasoning capability over raw throughput.
従量課金
初期費用なし、使った分だけお支払い
以下のコード例を使用して API と連携してください:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="qwen/qwen3.6-max-preview",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse Mixture-of-Experts architecture with approximately 1T total parameters. It is optimized for high-end reasoning, agentic coding, tool use, instruction following, and complex text generation workflows.
| Specification | Value |
|---|---|
| Provider | alibaba |
| Model Type | Chat Completions model |
| Architecture | Sparse Mixture-of-Experts |
| Parameters | approximately 1T |
| Modalities | text->text |
| Context Window | 262,144 tokens |
| Max Input | 196,608 tokens |
| Max Output | 65,536 tokens |
| Thinking Budget | 128K tokens |
| Input | Text |
| Output | Text |
| Vision | Not listed |
| Function Calling | Supported |
| Structured Outputs | Supported |
| Thinking Mode | Supported |
| Release | April 2026 |
| Token Type | Cost |
|---|---|
| Input | $1.04 per million tokens |
| Output | $6.24 per million tokens |
| Cache Write | $1.30 per million tokens |
Base URL: https://llm.wavespeed.ai/v1
API Endpoint: chat/completions
Model ID: qwen/qwen3.6-max-preview
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="qwen/qwen3.6-max-preview",
messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
curl https://llm.wavespeed.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "qwen/qwen3.6-max-preview",
"messages": [{"role": "user", "content": "Hello!"}]
}'
qwen/qwen3.6-max-preview
Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse Mixture-of-Experts architecture with approximately 1T total parameters. It is optimized for high-end reasoning, agentic coding, tool use, instruction following, and complex text generation workflows. The model supports a 262K-token context window, up to 64K output tokens, thinking mode, function calling, and structured outputs, making it suitable for demanding production tasks that require stronger reasoning capability over raw throughput.
入力
$1.3 /M
出力
$7.8 /M
コンテキスト
262K
最大出力
66K
ツール利用
対応
統合APIを通じてQwen3.6 Max Previewにアクセス — OpenAI互換、コールドスタートなし、透明な料金。
WaveSpeedAI の料金: 入力 100 万トークンあたり $1.30、出力 100 万トークンあたり $7.80。プロンプトキャッシュとバッチ処理は別途料金で、長く反復的なワークロードでは実効コストを下げられます。
Qwen3.6 Max Preview はリクエストあたり最大 262K のコンテキストトークンと最大 66K の出力トークンをサポートします。
はい。WaveSpeedAI は OpenAI 互換エンドポイント https://llm.wavespeed.ai/v1 で Qwen3.6 Max Preview を提供します。公式 OpenAI SDK のベース URL をこちらに変更し WaveSpeedAI の API キーを設定するだけで利用可能です。
WaveSpeedAI にサインインし、Access Keys で API キーを作成して、上に表示されているモデル ID を指定して https://llm.wavespeed.ai/v1/chat/completions にリクエストを送信してください。新規アカウントには Qwen3.6 Max Preview を試用できる無料クレジットが付与されます。