z-ai/glm-5.2
Дата выпуска: 2026-06-17
1,048,576 context · $1.40/M input tokens · $4.40/M output tokens
GLM 5.2 is Z.ai’s most advanced reasoning model, built for long-context, agentic, and engineering-intensive workloads. With support for a 1M-token context window and configurable High/XHigh reasoning modes, it delivers state-of-the-art performance in coding, tool use, and complex task execution.From requirements gathering and architecture design to implementation, testing, and multi-platform deployment, GLM 5.2 can maintain project-level context and consistently follow engineering best practices throughout the entire software development lifecycle.
Оплата по факту использования
Никаких авансовых платежей — платите только за то, чем пользуетесь
Используйте следующие примеры кода для интеграции с нашим API:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="z-ai/glm-5.2",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)GLM 5.2 is Z.ai’s latest large-scale reasoning model, designed for long-context understanding, advanced coding, and complex agent workflows. With support for a 1M-token context window and configurable reasoning levels, it can maintain project-scale context across extended interactions, making it well-suited for software engineering, research, automation, and multi-step problem solving.
The model supports both High and XHigh reasoning modes, with XHigh enabling its maximum reasoning capability. GLM 5.2 excels at code generation, tool use, structured outputs, and long-horizon task execution, allowing developers to build sophisticated AI agents and automation systems that operate reliably over large amounts of context.
This model is available through the WaveSpeed AI OpenAI-compatible API and can be integrated into existing applications with minimal changes.
| Specification | Value |
|---|---|
| Provider | chatglm |
| Model Type | Chat Completions |
| Architecture | Text → Text |
| Context Window | 1,048,576 tokens |
| Max Input | 786,432 tokens |
| Max Output | 262,144 tokens |
| Input | Text |
| Output | Text |
| Function Calling | Supported |
| Structured Outputs | Supported |
Base URL
https://llm.wavespeed.ai/v1
Endpoint
POST /chat/completions
Model ID
z-ai/glm-5.2
z-ai/glm-5.2chatglmz-ai/glm-5.2
GLM 5.2 is Z.ai’s most advanced reasoning model, built for long-context, agentic, and engineering-intensive workloads. With support for a 1M-token context window and configurable High/XHigh reasoning modes, it delivers state-of-the-art performance in coding, tool use, and complex task execution.From requirements gathering and architecture design to implementation, testing, and multi-platform deployment, GLM 5.2 can maintain project-level context and consistently follow engineering best practices throughout the entire software development lifecycle.
Ввод
$1.4 /M
Вывод
$4.4 /M
Контекст
1049K
Макс. вывод
262K
Использование инструментов
Поддерживается
Доступ к GLM 5.2 через наш единый API — совместимость с OpenAI, без холодных стартов, прозрачные цены.
Цены на WaveSpeedAI: $1.40 за миллион входных токенов и $4.40 за миллион выходных токенов. Prompt caching и batch processing тарифицируются отдельно и снижают эффективную стоимость длинных повторяющихся нагрузок.
GLM 5.2 поддерживает до 1049K токенов контекста и до 262K токенов вывода на запрос.
Да. WaveSpeedAI предоставляет GLM 5.2 через OpenAI-совместимый endpoint по адресу https://llm.wavespeed.ai/v1. Направьте официальный OpenAI SDK на этот base URL с ключом API WaveSpeedAI — других изменений в коде не требуется.
Войдите в WaveSpeedAI, создайте API-ключ в Access Keys и отправьте запрос на https://llm.wavespeed.ai/v1/chat/completions с указанным выше model id. Новые аккаунты получают бесплатные кредиты для оценки GLM 5.2.