Seedance 2.0 | Special Offer ✦ 10% OFF NOW
microsoft

microsoft/phi-4

microsoft/phi-4

16,384 context · $0.07/M input tokens · $0.14/M output tokens

Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

Цены

Оплата по факту использования

Никаких авансовых платежей — платите только за то, чем пользуетесь

Ввод$0.07 / M Tokens
Вывод$0.14 / M Tokens

Использование API

Используйте следующие примеры кода для интеграции с нашим API:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="microsoft/phi-4",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Описание модели

Microsoft phi-4

microsoft phi-4


Why It Looks Great

  • Large Language Model architecture for efficient processing
  • 16384 context window for long document handling
  • Competitive pricing at $0.1/$0.1 per million tokens

Key Features

  • Context Window: 16384 tokens
  • Max Output: 4096 tokens
  • Vision: Supported
  • Function Calling: Supported

Specifications

SpecificationValue
ProviderMicrosoft
Model TypeLarge Language Model (LLM)
ArchitectureN/A
Context Window16384 tokens
Max Output4096 tokens
InputText
OutputText
VisionSupported
Function CallingSupported

Pricing

Token TypeCost per Million Tokens
Input$0.1
Output$0.1

How to Use

  1. Write your prompt — describe the task, provide context, and specify desired output format.
  2. Submit — the model processes your request and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: microsoft/phi-4


API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="microsoft/phi-4",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "microsoft/phi-4",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Notes

  • Model: microsoft/phi-4
  • Provider: Microsoft

Info

Providermicrosoft
Typellm

Поддерживаемые возможности

Ввод
Text
Вывод
Text
Контекст16,384
Макс. вывод16,384
Vision-
Function Calling-

Руководство по доступу к API

Base URLhttps://llm.wavespeed.ai/v1
API Endpointchat/completions
Model IDmicrosoft/phi-4

microsoft/phi-4

Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

Input

$0.065 /M

Output

$0.14 /M

Context

16K

Max Output

16K

Try microsoft/phi-4 on WaveSpeedAI

Access microsoft/phi-4 through our unified API — OpenAI-compatible, no cold starts, transparent pricing.

Open Playground