GLM 4.5 Air | Z.ai LLM API

Name: GLM 4.5 Air API
Brand: z-ai
Price: 0.13 USD
Availability: InStock

Użycie API

Użyj poniższych przykładów kodu, aby zintegrować się z naszym API:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="z-ai/glm-4.5-air",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Wprowadzenie do modelu

Z-Ai glm-4.5-air

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

Why It Looks Great

Large Language Model architecture for efficient processing
131072 context window for long document handling
Competitive pricing at $0.1/$0.9 per million tokens

Key Features

Context Window: 131072 tokens
Max Output: 98304 tokens
Vision: Supported
Function Calling: Supported

Specifications

Specification	Value
Provider	Z-Ai
Model Type	Large Language Model (LLM)
Architecture	N/A
Context Window	131072 tokens
Max Output	98304 tokens
Input	Text
Output	Text
Vision	Supported
Function Calling	Supported

Pricing

Token Type	Cost per Million Tokens
Input	$0.1
Output	$0.9

How to Use

Write your prompt — describe the task, provide context, and specify desired output format.
Submit — the model processes your request and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: z-ai/glm-4.5-air

API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="z-ai/glm-4.5-air",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "z-ai/glm-4.5-air",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Notes

Model: z-ai/glm-4.5-air
Provider: Z-Ai

Najczęstsze pytania o GLM 4.5 Air

Ile kosztuje API GLM 4.5 Air?+

Cennik na WaveSpeedAI: $0.13 za milion tokenów wejściowych i $0.85 za milion tokenów wyjściowych. Prompt caching i przetwarzanie wsadowe są rozliczane oddzielnie i obniżają efektywny koszt długich, powtarzalnych obciążeń.

Jakie jest okno kontekstu GLM 4.5 Air?+

GLM 4.5 Air obsługuje do 131K tokenów kontekstu i do 98K tokenów wyjściowych na zapytanie.

Czy GLM 4.5 Air jest kompatybilny z OpenAI?+

Tak. WaveSpeedAI udostępnia GLM 4.5 Air przez endpoint kompatybilny z OpenAI pod https://llm.wavespeed.ai/v1. Skieruj oficjalny OpenAI SDK na ten base URL ze swoim kluczem API WaveSpeedAI — bez innych zmian w kodzie.

Jak zacząć z GLM 4.5 Air?+

Zaloguj się do WaveSpeedAI, utwórz klucz API w Access Keys, a następnie wyślij żądanie na https://llm.wavespeed.ai/v1/chat/completions z id modelu pokazanym powyżej. Nowe konta otrzymują darmowe kredyty na ocenę GLM 4.5 Air.

Cennik

Wypróbuj model