GPT 5.5 Pro API

Name: GPT 5.5 Pro API
Brand: openai
Price: 30 USD
Availability: InStock

API Usage

Use the following code examples to integrate with our API:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="openai/gpt-5.5-pro",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Model Introduction

OpenAI GPT-5.5 Pro

GPT-5.5 Pro is the highest-accuracy variant of OpenAI's GPT-5.5 frontier model, released April 23, 2026. It applies parallel test-time compute on top of the same GPT-5.5 base model to maximize correctness on hard tasks — targeting professional research, complex multi-step analysis, and production-grade agentic workflows where getting the answer right matters more than speed or cost. Available to Pro, Business, and Enterprise users in ChatGPT, and via the Responses API for developers.

Why It Looks Great

Highest-accuracy mode in the GPT-5.5 family, built for correctness-critical workloads
1M token context window (922K input, 128K output) for large-scale document reasoning
Parallel test-time compute delivers stronger results on hard coding, research, and analysis tasks
Same 60% hallucination reduction and multimodal capabilities as GPT-5.5 standard
Responses API support for structured agentic tool-use workflows

Key Features

Context Window: 1050000 tokens
Max Input: 922000 tokens
Max Output: 128000 tokens
Vision: Supported
Function Calling: Supported
Thinking Mode: Supported (extended parallel reasoning)
API: Responses API only

Benchmarks

Benchmark	GPT-5.4 Pro	GPT-5.5	GPT-5.5 Pro	Claude Opus 4.7
SWE-bench Verified	—	88.7%	Higher	87.6%
MMLU	—	92.4%	Higher	—
Terminal-Bench 2.0	—	82.7%	Higher	—
GDPval (44 occupations)	—	84.9%	Higher	—
Hallucination rate	—	−60% vs 5.4	Best in class	—

Note: OpenAI has not published separate benchmark numbers for GPT-5.5 Pro vs standard. The Pro variant uses additional compute at inference time to improve accuracy beyond the standard model's scores.

Specifications

Specification	Value
Provider	OpenAI
Model Type	Large Language Model (LLM)
Architecture	Transformer (Frontier, Parallel Test-Time Compute)
Context Window	1050000 tokens
Max Input	922000 tokens
Max Output	128000 tokens
Input	Text, Image
Output	Text
Vision	Supported
Function Calling	Supported
Thinking Mode	Supported (parallel reasoning)
API	Responses API
Release Date	April 23, 2026

Pricing

Token Type	Cost per Million Tokens
Input	$30.00
Output	$180.00

GPT-5.5 Pro is 6× the standard GPT-5.5 price. Use it for tasks where correctness justifies the cost — complex research synthesis, high-stakes code generation, and production decision support. For routine workloads, GPT-5.5 standard at $5/$30 is the better fit.

How to Use

Write your prompt — describe the task, provide context, and specify desired output format.
Submit — the model applies extended parallel reasoning and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: responses Model ID: openai/gpt-5.5-pro

API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="openai/gpt-5.5-pro",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "openai/gpt-5.5-pro",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

GPT-5.5 Pro vs GPT-5.5 Standard

Aspect	GPT-5.5	GPT-5.5 Pro
Target workload	General professional tasks	Correctness-critical tasks
Reasoning	Standard	Parallel test-time compute
API input price	$5.00/M	$30.00/M
API output price	$30.00/M	$180.00/M
API type	Chat Completions	Responses API
Context window	1.05M	1.05M
ChatGPT access	Plus, Pro, Business, Enterprise	Pro, Business, Enterprise only

GPT 5.5 Pro API

openai/gpt-5.5-pro

GPT-5.5 Pro is the highest-accuracy variant of OpenAI's GPT-5.5 frontier model, using parallel test-time compute for correctness-critical workloads. It shares the same 1M+ context window and multimodal capabilities as GPT-5.5 standard, with enhanced reliability for professional research, complex analysis, and production-grade agentic tasks.

Frequently Asked Questions about GPT 5.5 Pro

How much does GPT 5.5 Pro cost via the API?+

Pricing on WaveSpeedAI: $30.00 per million input tokens and $180.00 per million output tokens. Prompt caching and batch processing are billed separately and reduce effective cost on long, repetitive workloads.

What is the context window of GPT 5.5 Pro?+

GPT 5.5 Pro supports up to 1050K tokens of context with up to 128K tokens of output per request.

Is GPT 5.5 Pro OpenAI-compatible?+

Yes. WaveSpeedAI exposes GPT 5.5 Pro through an OpenAI-compatible endpoint at https://llm.wavespeed.ai/v1. Point the official OpenAI SDK at this base URL with your WaveSpeedAI API key — no other code changes required.

How do I get started with GPT 5.5 Pro?+

Sign in to WaveSpeedAI, create an API key in Access Keys, then send a request to https://llm.wavespeed.ai/v1/chat/completions with model id set to the value shown above. New accounts receive free credits to evaluate GPT 5.5 Pro before paying per token.

Pricing

Try the model

API Usage

Model Introduction

OpenAI GPT-5.5 Pro

Why It Looks Great

Key Features

Benchmarks

Specifications

Pricing

How to Use

API Integration

API Usage

Python SDK

cURL

GPT-5.5 Pro vs GPT-5.5 Standard

Info

Supported Functionality

API Access Guide

Try GPT 5.5 Pro on WaveSpeedAI

Frequently Asked Questions about GPT 5.5 Pro

Related LLM APIs