Seedance 2.0 | Special Offer ✦ 10% OFF NOW
deepseek
deepseek/deepseek-v4-pro

deepseek/deepseek-v4-pro

1,048,576 context · $1.84/M input tokens · $3.66/M output tokens

DeepSeek V4 Pro is DeepSeek's flagship open-source model released in April 2026, featuring a 1.6T-parameter Mixture-of-Experts architecture with 49B parameters active per token. It supports a 1M-token context window through a novel hybrid attention mechanism combining Compressed Sparse Attention and DeepSeek Sparse Attention, reducing inference FLOPs to 27% and KV cache to 10% compared to V3.2 at million-token scale. Pre-trained on 33T tokens with post-training via GRPO reinforcement learning and on-policy distillation, V4 Pro delivers frontier-level performance in coding (LiveCodeBench 93.5, Codeforces 3206), math (IMOAnswerBench 89.8), and agentic tasks (SWE-bench Verified 80.6) — competitive with GPT-5.4 and Claude Opus 4.6 at a fraction of the cost. It natively supports thinking and non-thinking modes with configurable reasoning effort, function calling, JSON output, and has been specifically optimized for mainstream agent frameworks including Claude Code, OpenClaw, and OpenCode.

Harga

Bayar sesuai pemakaian

Tanpa biaya di muka, bayar hanya sesuai penggunaan

Input$1.84 / M Tokens
Output$3.66 / M Tokens

Penggunaan API

Gunakan contoh kode berikut untuk integrasi dengan API kami:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="deepseek/deepseek-v4-pro",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Pengenalan Model

Deepseek deepseek-v4-pro

DeepSeek-V4-Pro is DeepSeek's most powerful open-source model, released on April 24, 2026. It is a 1.6 trillion parameter Mixture-of-Experts (MoE) language model with 49B active parameters, pre-trained on 33T tokens, supporting a context length of one million tokens. V4-Pro achieves performance on par with top closed-source models like GPT-5.4 and Claude Opus 4.6 across coding, reasoning, and agentic benchmarks — at a fraction of the cost.


Why It Looks Great

  • Mixture-of-Experts architecture with 1.6T total parameters and only 49B active for efficient inference
  • 1000000 context window powered by Compressed Sparse Attention (CSA) and DeepSeek Sparse Attention (DSA)
  • World-class agentic capabilities — optimized for Claude Code, OpenClaw, OpenCode, and CodeBuddy

Key Features

  • Context Window: 1000000 tokens
  • Max Output: 384000 tokens
  • Vision: Not Supported
  • Function Calling: Supported
  • Thinking Mode: Supported (non-thinking / high / max)
  • JSON Output: Supported
  • FIM Completion: Supported (non-thinking mode only)

Benchmarks

BenchmarkV4-ProClaude Opus 4.6GPT-5.4Gemini 3.1 Pro
SWE-bench Verified80.680.880.6
LiveCodeBench93.588.891.791.7
Codeforces Rating320631683052
MMLU-Pro87.589.187.591.0
IMOAnswerBench89.875.391.481.0
Terminal Bench 2.067.965.475.168.5
Toolathlon51.847.254.648.8
BrowseComp83.483.785.9

Specifications

SpecificationValue
ProviderDeepseek
Model TypeLarge Language Model (LLM)
ArchitectureMixture-of-Experts (MoE)
Total Parameters1.6T (49B active)
Context Window1000000 tokens
Max Output384000 tokens
InputText
OutputText
VisionNot Supported
Function CallingSupported
Thinking ModeSupported (high / max)
Release DateApril 24, 2026

How to Use

  1. Write your prompt — describe the task, provide context, and specify desired output format.
  2. Submit — the model processes your request and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: deepseek/deepseek-v4-pro


API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="deepseek/deepseek-v4-pro",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "deepseek/deepseek-v4-pro",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Notes

  • Model: deepseek/deepseek-v4-pro
  • Provider: Deepseek
  • Open-source weights available on HuggingFace and ModelScope
  • Supports both OpenAI and Anthropic API formats
  • For complex Agent scenarios, use thinking mode with reasoning_effort set to max

Info

Penyediadeepseek
Tipellm

Fitur yang Didukung

Input
Teks
Output
Teks
Konteks1,048,576
Output Maks384,000
Vision-
Function Calling✓ Didukung

Panduan Akses API

Base URLhttps://llm.wavespeed.ai/v1
API Endpointchat/completions
Model IDdeepseek/deepseek-v4-pro

DeepSeek V4 Pro API

deepseek/deepseek-v4-pro

DeepSeek V4 Pro is DeepSeek's flagship open-source model released in April 2026, featuring a 1.6T-parameter Mixture-of-Experts architecture with 49B parameters active per token. It supports a 1M-token context window through a novel hybrid attention mechanism combining Compressed Sparse Attention and DeepSeek Sparse Attention, reducing inference FLOPs to 27% and KV cache to 10% compared to V3.2 at million-token scale. Pre-trained on 33T tokens with post-training via GRPO reinforcement learning and on-policy distillation, V4 Pro delivers frontier-level performance in coding (LiveCodeBench 93.5, Codeforces 3206), math (IMOAnswerBench 89.8), and agentic tasks (SWE-bench Verified 80.6) — competitive with GPT-5.4 and Claude Opus 4.6 at a fraction of the cost. It natively supports thinking and non-thinking modes with configurable reasoning effort, function calling, JSON output, and has been specifically optimized for mainstream agent frameworks including Claude Code, OpenClaw, and OpenCode.

Input

$1.84 /M

Output

$3.66 /M

Konteks

1049K

Output Maks.

384K

Penggunaan Tool

Didukung

Coba DeepSeek V4 Pro di WaveSpeedAI

Akses DeepSeek V4 Pro melalui API terpadu kami — kompatibel dengan OpenAI, tanpa cold start, harga transparan.

Buka Playground

Pertanyaan Umum tentang DeepSeek V4 Pro

Berapa biaya DeepSeek V4 Pro melalui API?+

Harga di WaveSpeedAI: $1.84 per juta token input dan $3.66 per juta token output. Prompt caching dan batch processing ditagih terpisah dan mengurangi biaya efektif pada beban kerja yang panjang dan berulang.

Berapa context window DeepSeek V4 Pro?+

DeepSeek V4 Pro mendukung hingga 1049K token konteks dengan hingga 384K token output per permintaan.

Apakah DeepSeek V4 Pro kompatibel dengan OpenAI?+

Ya. WaveSpeedAI menyediakan DeepSeek V4 Pro melalui endpoint yang kompatibel dengan OpenAI di https://llm.wavespeed.ai/v1. Arahkan OpenAI SDK resmi ke base URL ini dengan API key WaveSpeedAI Anda — tanpa perubahan kode lainnya.

Bagaimana memulai dengan DeepSeek V4 Pro?+

Masuk ke WaveSpeedAI, buat API key di Access Keys, lalu kirim permintaan ke https://llm.wavespeed.ai/v1/chat/completions dengan model id seperti ditampilkan di atas. Akun baru menerima kredit gratis untuk menguji DeepSeek V4 Pro.

API LLM terkait