GPT 5.5 API

Name: GPT 5.5 API
Brand: openai
Price: 5 USD
Availability: InStock

Utilisation de l'API

Utilisez les exemples de code suivants pour intégrer notre API :

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="openai/gpt-5.5",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Introduction au modèle

OpenAI GPT-5.5

GPT-5.5 is OpenAI's frontier model released on April 23, 2026, designed for complex professional workloads including agentic coding, computer use, and deep research. Building on GPT-5.4, it delivers stronger reasoning, higher reliability with 60% fewer hallucinations, and improved token efficiency — matching GPT-5.4 per-token latency while performing at a significantly higher level of intelligence. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs.

Why It Looks Great

Frontier-class reasoning with 88.7% SWE-bench Verified and 92.4% MMLU scores
1M token context window for large-scale reasoning, coding, and multimodal workflows
60% fewer hallucinations compared to GPT-5.4 with improved token efficiency
Three variants: standard, Thinking (extended reasoning), and Pro (highest accuracy)
Same per-token latency as GPT-5.4 despite the intelligence leap

Key Features

Context Window: 1050000 tokens
Max Input: 922000 tokens
Max Output: 128000 tokens
Vision: Supported
Function Calling: Supported
Thinking Mode: Supported (low / medium / high / xhigh effort levels)
Variants: GPT-5.5, GPT-5.5 Thinking, GPT-5.5 Pro

Benchmarks

Benchmark	GPT-5.4	GPT-5.5	Claude Opus 4.7	Gemini 3.1 Pro
SWE-bench Verified	~74%	88.7%	87.6%	80.6%
MMLU	91.1%	92.4%	—	—
Terminal-Bench 2.0	—	82.7%	—	—
Expert-SWE	—	73.1%	—	—
GDPval (44 occupations)	—	84.9%	—	—
OSWorld-Verified	—	78.7%	—	—
Hallucination rate	baseline	−60%	—	—

Specifications

Specification	Value
Provider	OpenAI
Model Type	Large Language Model (LLM)
Architecture	Transformer (Frontier)
Context Window	1050000 tokens
Max Input	922000 tokens
Max Output	128000 tokens
Input	Text, Image
Output	Text
Vision	Supported
Function Calling	Supported
Thinking Mode	Supported
Release Date	April 23, 2026

Note: GPT-5.5 is 2× GPT-5.4 at the token level, but uses significantly fewer tokens to complete the same tasks. Independent testing puts the net cost increase at roughly 20% once token efficiency is factored in.

How to Use

Write your prompt — describe the task, provide context, and specify desired output format.
Submit — the model processes your request and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: openai/gpt-5.5

API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="openai/gpt-5.5",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "openai/gpt-5.5",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

What's New vs GPT-5.4

Aspect	GPT-5.4	GPT-5.5
SWE-bench Verified	~74%	88.7%
MMLU	91.1%	92.4%
Hallucination rate	baseline	−60%
Context window	1.05M	1.05M (922K input)
API input price	$2.50/M	$5.00/M
API output price	$15.00/M	$30.00/M
Computer use	Improving	Production-grade
Multi-step tool chains	Single-shot preferred	Full autonomous loops
Token efficiency	baseline	~40% fewer tokens on same tasks

GPT 5.5 API

openai/gpt-5.5

GPT-5.5 is OpenAI's frontier model released April 23, 2026, featuring a 1M+ token context window (922K input, 128K output) with text and image support. It scores 88.7% on SWE-bench Verified and 92.4% on MMLU with 60% fewer hallucinations than GPT-5.4, excelling at agentic coding, computer use, and deep research while matching GPT-5.4 per-token latency.

Questions fréquentes sur GPT 5.5

Combien coûte l'API GPT 5.5 ?+

Tarification sur WaveSpeedAI : $5.00 par million de tokens d'entrée et $30.00 par million de tokens de sortie. Le prompt caching et le traitement par batch sont facturés séparément et réduisent le coût effectif sur les charges longues et répétitives.

Quelle est la fenêtre de contexte de GPT 5.5 ?+

GPT 5.5 prend en charge jusqu'à 1050K tokens de contexte et jusqu'à 128K tokens de sortie par requête.

GPT 5.5 est-il compatible avec OpenAI ?+

Oui. WaveSpeedAI expose GPT 5.5 via un endpoint compatible OpenAI à https://llm.wavespeed.ai/v1. Pointez le SDK officiel d'OpenAI vers cette base URL avec votre clé API WaveSpeedAI — aucune autre modification de code requise.

Comment démarrer avec GPT 5.5 ?+

Connectez-vous à WaveSpeedAI, créez une clé API dans Access Keys, puis envoyez une requête à https://llm.wavespeed.ai/v1/chat/completions avec l'id du modèle affiché ci-dessus. Les nouveaux comptes reçoivent des crédits gratuits pour évaluer GPT 5.5.

Tarification