Seedance 2.0 | Special Offer ✦ 10% OFF NOW
openai
openai/gpt-5.5

openai/gpt-5.5

1,050,000 context · $5.00/M input tokens · $30.00/M output tokens

GPT-5.5 is OpenAI's frontier model released April 23, 2026, featuring a 1M+ token context window (922K input, 128K output) with text and image support. It scores 88.7% on SWE-bench Verified and 92.4% on MMLU with 60% fewer hallucinations than GPT-5.4, excelling at agentic coding, computer use, and deep research while matching GPT-5.4 per-token latency.

Pricing

Pay-per-use

No upfront costs, pay only for what you use

Input
272K $5.00 / M Tokens
> 272K $10.00 / M Tokens
Output
272K $30.00 / M Tokens
> 272K $45.00 / M Tokens

API Usage

Use the following code examples to integrate with our API:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="openai/gpt-5.5",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Model Introduction

OpenAI GPT-5.5

GPT-5.5 is OpenAI's frontier model released on April 23, 2026, designed for complex professional workloads including agentic coding, computer use, and deep research. Building on GPT-5.4, it delivers stronger reasoning, higher reliability with 60% fewer hallucinations, and improved token efficiency — matching GPT-5.4 per-token latency while performing at a significantly higher level of intelligence. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs.


Why It Looks Great

  • Frontier-class reasoning with 88.7% SWE-bench Verified and 92.4% MMLU scores
  • 1M token context window for large-scale reasoning, coding, and multimodal workflows
  • 60% fewer hallucinations compared to GPT-5.4 with improved token efficiency
  • Three variants: standard, Thinking (extended reasoning), and Pro (highest accuracy)
  • Same per-token latency as GPT-5.4 despite the intelligence leap

Key Features

  • Context Window: 1050000 tokens
  • Max Input: 922000 tokens
  • Max Output: 128000 tokens
  • Vision: Supported
  • Function Calling: Supported
  • Thinking Mode: Supported (low / medium / high / xhigh effort levels)
  • Variants: GPT-5.5, GPT-5.5 Thinking, GPT-5.5 Pro

Benchmarks

BenchmarkGPT-5.4GPT-5.5Claude Opus 4.7Gemini 3.1 Pro
SWE-bench Verified~74%88.7%87.6%80.6%
MMLU91.1%92.4%
Terminal-Bench 2.082.7%
Expert-SWE73.1%
GDPval (44 occupations)84.9%
OSWorld-Verified78.7%
Hallucination ratebaseline−60%

Specifications

SpecificationValue
ProviderOpenAI
Model TypeLarge Language Model (LLM)
ArchitectureTransformer (Frontier)
Context Window1050000 tokens
Max Input922000 tokens
Max Output128000 tokens
InputText, Image
OutputText
VisionSupported
Function CallingSupported
Thinking ModeSupported
Release DateApril 23, 2026

Note: GPT-5.5 is 2× GPT-5.4 at the token level, but uses significantly fewer tokens to complete the same tasks. Independent testing puts the net cost increase at roughly 20% once token efficiency is factored in.


How to Use

  1. Write your prompt — describe the task, provide context, and specify desired output format.
  2. Submit — the model processes your request and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: openai/gpt-5.5


API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="openai/gpt-5.5",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "openai/gpt-5.5",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

What's New vs GPT-5.4

AspectGPT-5.4GPT-5.5
SWE-bench Verified~74%88.7%
MMLU91.1%92.4%
Hallucination ratebaseline−60%
Context window1.05M1.05M (922K input)
API input price$2.50/M$5.00/M
API output price$15.00/M$30.00/M
Computer useImprovingProduction-grade
Multi-step tool chainsSingle-shot preferredFull autonomous loops
Token efficiencybaseline~40% fewer tokens on same tasks

Info

Provideropenai
Typellm

Supported Functionality

Input
TextImage
Output
Text
Context1,050,000
Max Output128,000
Vision✓ Supported
Function Calling✓ Supported

API Access Guide

Base URLhttps://llm.wavespeed.ai/v1
API Endpointchat/completions
Model IDopenai/gpt-5.5

GPT 5.5 API

openai/gpt-5.5

GPT-5.5 is OpenAI's frontier model released April 23, 2026, featuring a 1M+ token context window (922K input, 128K output) with text and image support. It scores 88.7% on SWE-bench Verified and 92.4% on MMLU with 60% fewer hallucinations than GPT-5.4, excelling at agentic coding, computer use, and deep research while matching GPT-5.4 per-token latency.

Input

$5 /M

Output

$30 /M

Context

1050K

Max Output

128K

Vision

Supported

Tool Use

Supported

Try GPT 5.5 on WaveSpeedAI

Access GPT 5.5 through our unified API — OpenAI-compatible, no cold starts, transparent pricing.

Open Playground

Frequently Asked Questions about GPT 5.5

How much does GPT 5.5 cost via the API?+

Pricing on WaveSpeedAI: $5.00 per million input tokens and $30.00 per million output tokens. Prompt caching and batch processing are billed separately and reduce effective cost on long, repetitive workloads.

What is the context window of GPT 5.5?+

GPT 5.5 supports up to 1050K tokens of context with up to 128K tokens of output per request.

Is GPT 5.5 OpenAI-compatible?+

Yes. WaveSpeedAI exposes GPT 5.5 through an OpenAI-compatible endpoint at https://llm.wavespeed.ai/v1. Point the official OpenAI SDK at this base URL with your WaveSpeedAI API key — no other code changes required.

How do I get started with GPT 5.5?+

Sign in to WaveSpeedAI, create an API key in Access Keys, then send a request to https://llm.wavespeed.ai/v1/chat/completions with model id set to the value shown above. New accounts receive free credits to evaluate GPT 5.5 before paying per token.

Related LLM APIs