Seedance 2.0 | Special Offer ✦ 10% OFF NOW
microsoft

microsoft/phi-4

microsoft/phi-4

16,384 context · $0.07/M input tokens · $0.14/M output tokens

Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

Preços

Pagamento por uso

Sem custo inicial, pague apenas pelo que usar

Entrada$0.07 / M Tokens
Saída$0.14 / M Tokens

Uso da API

Use os exemplos de código abaixo para integrar com nossa API:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="microsoft/phi-4",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Introdução do modelo

Microsoft phi-4

microsoft phi-4


Why It Looks Great

  • Large Language Model architecture for efficient processing
  • 16384 context window for long document handling
  • Competitive pricing at $0.1/$0.1 per million tokens

Key Features

  • Context Window: 16384 tokens
  • Max Output: 4096 tokens
  • Vision: Supported
  • Function Calling: Supported

Specifications

SpecificationValue
ProviderMicrosoft
Model TypeLarge Language Model (LLM)
ArchitectureN/A
Context Window16384 tokens
Max Output4096 tokens
InputText
OutputText
VisionSupported
Function CallingSupported

Pricing

Token TypeCost per Million Tokens
Input$0.1
Output$0.1

How to Use

  1. Write your prompt — describe the task, provide context, and specify desired output format.
  2. Submit — the model processes your request and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: microsoft/phi-4


API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="microsoft/phi-4",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "microsoft/phi-4",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Notes

  • Model: microsoft/phi-4
  • Provider: Microsoft

Info

Providermicrosoft
Typellm

Funcionalidades suportadas

Entrada
Text
Saída
Text
Contexto16,384
Saída máx.16,384
Vision-
Function Calling-

Guia de acesso à API

Base URLhttps://llm.wavespeed.ai/v1
API Endpointchat/completions
Model IDmicrosoft/phi-4

microsoft/phi-4

Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

Input

$0.065 /M

Output

$0.14 /M

Context

16K

Max Output

16K

Try microsoft/phi-4 on WaveSpeedAI

Access microsoft/phi-4 through our unified API — OpenAI-compatible, no cold starts, transparent pricing.

Open Playground