Seedance 2.0 15% DE DESCONTO | Crie no Video Generator →
chatglm
z-ai/glm-5.2

z-ai/glm-5.2

Data de lançamento: 2026-06-17

1,048,576 context · $1.40/M input tokens · $4.40/M output tokens

GLM 5.2 is Z.ai’s most advanced reasoning model, built for long-context, agentic, and engineering-intensive workloads. With support for a 1M-token context window and configurable High/XHigh reasoning modes, it delivers state-of-the-art performance in coding, tool use, and complex task execution.From requirements gathering and architecture design to implementation, testing, and multi-platform deployment, GLM 5.2 can maintain project-level context and consistently follow engineering best practices throughout the entire software development lifecycle.

Preços

Pagamento por uso

Sem custo inicial, pague apenas pelo que usar

Entrada$1.40 / M Tokens
Saída$4.40 / M Tokens
Cache Read$0.26 / M Tokens

Experimentar o modelo

z-ai/glm-5.2
Online
chatglm
Olá! Sou um assistente de IA útil. Em que posso ajudar?

Uso da API

Use os exemplos de código abaixo para integrar com nossa API:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="z-ai/glm-5.2",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Introdução do modelo

Z.ai: GLM 5.2

GLM 5.2 is Z.ai’s latest large-scale reasoning model, designed for long-context understanding, advanced coding, and complex agent workflows. With support for a 1M-token context window and configurable reasoning levels, it can maintain project-scale context across extended interactions, making it well-suited for software engineering, research, automation, and multi-step problem solving.

The model supports both High and XHigh reasoning modes, with XHigh enabling its maximum reasoning capability. GLM 5.2 excels at code generation, tool use, structured outputs, and long-horizon task execution, allowing developers to build sophisticated AI agents and automation systems that operate reliably over large amounts of context.

This model is available through the WaveSpeed AI OpenAI-compatible API and can be integrated into existing applications with minimal changes.


Why Choose GLM 5.2

  • Massive 1M-token context window for large documents, repositories, and long-running workflows
  • Strong reasoning performance for coding, planning, and complex multi-step tasks
  • Optimized for agentic applications with function calling and tool-use support
  • Structured output generation for JSON-based workflows and schema-constrained responses
  • Flexible reasoning controls for balancing speed, cost, and reasoning depth
  • Competitive pricing for large-context production workloads

Key Features

  • Context Window: 1,048,576 tokens
  • Max Input: 786,432 tokens
  • Max Output: 262,144 tokens
  • Architecture: Text → Text
  • Function Calling: Supported
  • Structured Outputs: Supported
  • Reasoning Controls: Supported
  • Vision: Not listed
  • Audio Input: Not listed
  • Image Generation: Not listed

Specifications

SpecificationValue
Providerchatglm
Model TypeChat Completions
ArchitectureText → Text
Context Window1,048,576 tokens
Max Input786,432 tokens
Max Output262,144 tokens
InputText
OutputText
Function CallingSupported
Structured OutputsSupported

API Integration

Base URL

https://llm.wavespeed.ai/v1

Endpoint

POST /chat/completions

Model ID

z-ai/glm-5.2

Common Use Cases

  • AI coding assistants
  • Software engineering agents
  • Large-scale codebase analysis
  • Research and document intelligence
  • Workflow automation
  • Multi-agent systems
  • Structured data extraction
  • Long-context reasoning applications

Notes

  • Model ID: z-ai/glm-5.2
  • Provider: chatglm

Info

Provedorchatglm
Tipollm

Funcionalidades suportadas

Entrada
Texto
Saída
Texto
Contexto1,048,576
Saída máx.262,144
Vision-
Function Calling✓ Suportado

Guia de acesso à API

Base URLhttps://llm.wavespeed.ai/v1
API Endpointchat/completions
ID do modeloz-ai/glm-5.2

GLM 5.2 API

z-ai/glm-5.2

GLM 5.2 is Z.ai’s most advanced reasoning model, built for long-context, agentic, and engineering-intensive workloads. With support for a 1M-token context window and configurable High/XHigh reasoning modes, it delivers state-of-the-art performance in coding, tool use, and complex task execution.From requirements gathering and architecture design to implementation, testing, and multi-platform deployment, GLM 5.2 can maintain project-level context and consistently follow engineering best practices throughout the entire software development lifecycle.

Entrada

$1.4 /M

Saída

$4.4 /M

Contexto

1049K

Saída máx.

262K

Uso de ferramentas

Suportado

Experimente GLM 5.2 no WaveSpeedAI

Acesse GLM 5.2 através da nossa API unificada — compatível com OpenAI, sem inicializações a frio, preços transparentes.

Perguntas frequentes sobre GLM 5.2

Quanto custa GLM 5.2 via API?+

Preços no WaveSpeedAI: $1.40 por milhão de tokens de entrada e $4.40 por milhão de tokens de saída. Prompt caching e batch processing são cobrados separadamente e reduzem o custo efetivo em cargas longas e repetitivas.

Qual é a janela de contexto do GLM 5.2?+

GLM 5.2 suporta até 1049K tokens de contexto e até 262K tokens de saída por requisição.

GLM 5.2 é compatível com OpenAI?+

Sim. O WaveSpeedAI expõe o GLM 5.2 através de um endpoint compatível com OpenAI em https://llm.wavespeed.ai/v1. Aponte o SDK oficial da OpenAI para esta base URL com sua chave API do WaveSpeedAI — sem outras alterações no código.

Como começo a usar o GLM 5.2?+

Entre no WaveSpeedAI, crie uma chave API em Access Keys, então envie uma requisição para https://llm.wavespeed.ai/v1/chat/completions com o model id mostrado acima. Contas novas recebem créditos grátis para avaliar o GLM 5.2.

APIs LLM relacionadas