openai/gpt-5.5-pro
1,050,000 context · $30.00/M input tokens · $180.00/M output tokens
GPT-5.5 Pro is the highest-accuracy variant of OpenAI's GPT-5.5 frontier model, using parallel test-time compute for correctness-critical workloads. It shares the same 1M+ context window and multimodal capabilities as GPT-5.5 standard, with enhanced reliability for professional research, complex analysis, and production-grade agentic tasks.
Pay-per-use
No upfront costs, pay only for what you use
Use the following code examples to integrate with our API:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="openai/gpt-5.5-pro",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)GPT-5.5 Pro is the highest-accuracy variant of OpenAI's GPT-5.5 frontier model, released April 23, 2026. It applies parallel test-time compute on top of the same GPT-5.5 base model to maximize correctness on hard tasks — targeting professional research, complex multi-step analysis, and production-grade agentic workflows where getting the answer right matters more than speed or cost. Available to Pro, Business, and Enterprise users in ChatGPT, and via the Responses API for developers.
| Benchmark | GPT-5.4 Pro | GPT-5.5 | GPT-5.5 Pro | Claude Opus 4.7 |
|---|---|---|---|---|
| SWE-bench Verified | — | 88.7% | Higher | 87.6% |
| MMLU | — | 92.4% | Higher | — |
| Terminal-Bench 2.0 | — | 82.7% | Higher | — |
| GDPval (44 occupations) | — | 84.9% | Higher | — |
| Hallucination rate | — | −60% vs 5.4 | Best in class | — |
Note: OpenAI has not published separate benchmark numbers for GPT-5.5 Pro vs standard. The Pro variant uses additional compute at inference time to improve accuracy beyond the standard model's scores.
| Specification | Value |
|---|---|
| Provider | OpenAI |
| Model Type | Large Language Model (LLM) |
| Architecture | Transformer (Frontier, Parallel Test-Time Compute) |
| Context Window | 1050000 tokens |
| Max Input | 922000 tokens |
| Max Output | 128000 tokens |
| Input | Text, Image |
| Output | Text |
| Vision | Supported |
| Function Calling | Supported |
| Thinking Mode | Supported (parallel reasoning) |
| API | Responses API |
| Release Date | April 23, 2026 |
| Token Type | Cost per Million Tokens |
|---|---|
| Input | $30.00 |
| Output | $180.00 |
GPT-5.5 Pro is 6× the standard GPT-5.5 price. Use it for tasks where correctness justifies the cost — complex research synthesis, high-stakes code generation, and production decision support. For routine workloads, GPT-5.5 standard at $5/$30 is the better fit.
Base URL: https://llm.wavespeed.ai/v1 API Endpoint: responses Model ID: openai/gpt-5.5-pro
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="openai/gpt-5.5-pro",
messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
curl https://llm.wavespeed.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "openai/gpt-5.5-pro",
"messages": [{"role": "user", "content": "Hello!"}]
}'
| Aspect | GPT-5.5 | GPT-5.5 Pro |
|---|---|---|
| Target workload | General professional tasks | Correctness-critical tasks |
| Reasoning | Standard | Parallel test-time compute |
| API input price | $5.00/M | $30.00/M |
| API output price | $30.00/M | $180.00/M |
| API type | Chat Completions | Responses API |
| Context window | 1.05M | 1.05M |
| ChatGPT access | Plus, Pro, Business, Enterprise | Pro, Business, Enterprise only |
openai/gpt-5.5-pro
GPT-5.5 Pro is the highest-accuracy variant of OpenAI's GPT-5.5 frontier model, using parallel test-time compute for correctness-critical workloads. It shares the same 1M+ context window and multimodal capabilities as GPT-5.5 standard, with enhanced reliability for professional research, complex analysis, and production-grade agentic tasks.
Input
$30 /M
Output
$180 /M
Context
1050K
Max Output
128K
Vision
Supported
Tool Use
Supported
Access GPT 5.5 Pro through our unified API — OpenAI-compatible, no cold starts, transparent pricing.
Pricing on WaveSpeedAI: $30.00 per million input tokens and $180.00 per million output tokens. Prompt caching and batch processing are billed separately and reduce effective cost on long, repetitive workloads.
GPT 5.5 Pro supports up to 1050K tokens of context with up to 128K tokens of output per request.
Yes. WaveSpeedAI exposes GPT 5.5 Pro through an OpenAI-compatible endpoint at https://llm.wavespeed.ai/v1. Point the official OpenAI SDK at this base URL with your WaveSpeedAI API key — no other code changes required.
Sign in to WaveSpeedAI, create an API key in Access Keys, then send a request to https://llm.wavespeed.ai/v1/chat/completions with model id set to the value shown above. New accounts receive free credits to evaluate GPT 5.5 Pro before paying per token.