z-ai/glm-5.1
202,752 context · $1.40/M input tokens · $4.40/M output tokens
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on a single task for more than 8 hours, autonomously planning, executing, and improving itself throughout the process, ultimately delivering complete, engineering-grade results.
按量付费
无需预付费用,仅按实际使用量付费
使用以下代码示例接入我们的 API:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="z-ai/glm-5.1",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)z-ai/glm-5.1
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on a single task for more than 8 hours, autonomously planning, executing, and improving itself throughout the process, ultimately delivering complete, engineering-grade results.
输入
$1.4 /M
输出
$4.4 /M
上下文
203K
工具调用
支持
WaveSpeedAI 定价:输入每百万 token $1.40,输出每百万 token $4.40。Prompt 缓存和批处理单独计费,可显著降低长上下文、高重复任务的实际成本。
GLM 5.1 单次请求最多支持 203K 上下文 token,输出最多 — token。
是的。WaveSpeedAI 通过 https://llm.wavespeed.ai/v1 的 OpenAI 兼容端点提供 GLM 5.1。把官方 OpenAI SDK 的 base URL 指向该地址,使用 WaveSpeedAI 的 API Key 即可,无需任何其他代码改动。
登录 WaveSpeedAI,在 Access Keys 中生成 API Key,使用上方显示的 model id 向 https://llm.wavespeed.ai/v1/chat/completions 发送请求。新账户可获得免费额度,用于试用 GLM 5.1。