DeepSeek V4 Pro | 1.6T MoE Flagship LLM API

Name: DeepSeek V4 Pro API
Brand: deepseek
Price: 1.84 USD
Availability: InStock

การใช้งาน API

ใช้ตัวอย่างโค้ดต่อไปนี้เพื่อผสานรวมกับ API ของเรา:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="deepseek/deepseek-v4-pro",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

แนะนำโมเดล

Deepseek deepseek-v4-pro

DeepSeek-V4-Pro is DeepSeek's most powerful open-source model, released on April 24, 2026. It is a 1.6 trillion parameter Mixture-of-Experts (MoE) language model with 49B active parameters, pre-trained on 33T tokens, supporting a context length of one million tokens. V4-Pro achieves performance on par with top closed-source models like GPT-5.4 and Claude Opus 4.6 across coding, reasoning, and agentic benchmarks — at a fraction of the cost.

Why It Looks Great

Mixture-of-Experts architecture with 1.6T total parameters and only 49B active for efficient inference
1000000 context window powered by Compressed Sparse Attention (CSA) and DeepSeek Sparse Attention (DSA)
World-class agentic capabilities — optimized for Claude Code, OpenClaw, OpenCode, and CodeBuddy

Key Features

Context Window: 1000000 tokens
Max Output: 384000 tokens
Vision: Not Supported
Function Calling: Supported
Thinking Mode: Supported (non-thinking / high / max)
JSON Output: Supported
FIM Completion: Supported (non-thinking mode only)

Benchmarks

Benchmark	V4-Pro	Claude Opus 4.6	GPT-5.4	Gemini 3.1 Pro
SWE-bench Verified	80.6	80.8	—	80.6
LiveCodeBench	93.5	88.8	91.7	91.7
Codeforces Rating	3206	—	3168	3052
MMLU-Pro	87.5	89.1	87.5	91.0
IMOAnswerBench	89.8	75.3	91.4	81.0
Terminal Bench 2.0	67.9	65.4	75.1	68.5
Toolathlon	51.8	47.2	54.6	48.8
BrowseComp	83.4	83.7	—	85.9

Specifications

Specification	Value
Provider	Deepseek
Model Type	Large Language Model (LLM)
Architecture	Mixture-of-Experts (MoE)
Total Parameters	1.6T (49B active)
Context Window	1000000 tokens
Max Output	384000 tokens
Input	Text
Output	Text
Vision	Not Supported
Function Calling	Supported
Thinking Mode	Supported (high / max)
Release Date	April 24, 2026

How to Use

Write your prompt — describe the task, provide context, and specify desired output format.
Submit — the model processes your request and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: deepseek/deepseek-v4-pro

API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="deepseek/deepseek-v4-pro",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "deepseek/deepseek-v4-pro",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Notes

Model: deepseek/deepseek-v4-pro
Provider: Deepseek
Open-source weights available on HuggingFace and ModelScope
Supports both OpenAI and Anthropic API formats
For complex Agent scenarios, use thinking mode with reasoning_effort set to max

DeepSeek V4 Pro API

deepseek/deepseek-v4-pro

DeepSeek V4 Pro is DeepSeek's flagship open-source model released in April 2026, featuring a 1.6T-parameter Mixture-of-Experts architecture with 49B parameters active per token. It supports a 1M-token context window through a novel hybrid attention mechanism combining Compressed Sparse Attention and DeepSeek Sparse Attention, reducing inference FLOPs to 27% and KV cache to 10% compared to V3.2 at million-token scale. Pre-trained on 33T tokens with post-training via GRPO reinforcement learning and on-policy distillation, V4 Pro delivers frontier-level performance in coding (LiveCodeBench 93.5, Codeforces 3206), math (IMOAnswerBench 89.8), and agentic tasks (SWE-bench Verified 80.6) — competitive with GPT-5.4 and Claude Opus 4.6 at a fraction of the cost. It natively supports thinking and non-thinking modes with configurable reasoning effort, function calling, JSON output, and has been specifically optimized for mainstream agent frameworks including Claude Code, OpenClaw, and OpenCode.

คำถามที่พบบ่อยเกี่ยวกับ DeepSeek V4 Pro

DeepSeek V4 Pro API ราคาเท่าไหร่?+

ราคาบน WaveSpeedAI: $1.84 ต่อล้านโทเคนอินพุต และ $3.66 ต่อล้านโทเคนเอาต์พุต Prompt caching และ batch processing คิดค่าบริการแยกและช่วยลดต้นทุนที่แท้จริงสำหรับภาระงานที่ยาวและทำซ้ำ

DeepSeek V4 Pro มี context window เท่าใด?+

DeepSeek V4 Pro รองรับสูงสุด 1049K โทเคนคอนเท็กซ์ และสูงสุด 384K โทเคนเอาต์พุตต่อคำขอ

DeepSeek V4 Pro เข้ากันได้กับ OpenAI หรือไม่?+

ใช่ WaveSpeedAI ให้บริการ DeepSeek V4 Pro ผ่าน endpoint ที่เข้ากันได้กับ OpenAI ที่ https://llm.wavespeed.ai/v1 ชี้ OpenAI SDK ทางการมาที่ base URL นี้ด้วย API key ของ WaveSpeedAI — ไม่ต้องแก้ไขโค้ดอื่น

จะเริ่มใช้ DeepSeek V4 Pro ได้อย่างไร?+

ลงชื่อเข้าใช้ WaveSpeedAI สร้าง API key ใน Access Keys จากนั้นส่งคำขอไปยัง https://llm.wavespeed.ai/v1/chat/completions พร้อม model id ที่แสดงด้านบน บัญชีใหม่จะได้รับเครดิตฟรีสำหรับทดลองใช้ DeepSeek V4 Pro

ราคา

ลองใช้โมเดล