Seedance 2.0 | Special Offer ✦ 10% OFF NOW
microsoft

microsoft/phi-4

microsoft/phi-4

16,384 context · $0.07/M input tokens · $0.14/M output tokens

Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

Precios

Pago por uso

Sin costos iniciales, paga solo por lo que uses

Entrada$0.07 / M Tokens
Salida$0.14 / M Tokens

Uso de API

Usa los siguientes ejemplos de código para integrar con nuestra API:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="microsoft/phi-4",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Introducción del modelo

Microsoft phi-4

microsoft phi-4


Why It Looks Great

  • Large Language Model architecture for efficient processing
  • 16384 context window for long document handling
  • Competitive pricing at $0.1/$0.1 per million tokens

Key Features

  • Context Window: 16384 tokens
  • Max Output: 4096 tokens
  • Vision: Supported
  • Function Calling: Supported

Specifications

SpecificationValue
ProviderMicrosoft
Model TypeLarge Language Model (LLM)
ArchitectureN/A
Context Window16384 tokens
Max Output4096 tokens
InputText
OutputText
VisionSupported
Function CallingSupported

Pricing

Token TypeCost per Million Tokens
Input$0.1
Output$0.1

How to Use

  1. Write your prompt — describe the task, provide context, and specify desired output format.
  2. Submit — the model processes your request and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: microsoft/phi-4


API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="microsoft/phi-4",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "microsoft/phi-4",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Notes

  • Model: microsoft/phi-4
  • Provider: Microsoft

Info

Providermicrosoft
Typellm

Funcionalidades compatibles

Entrada
Text
Salida
Text
Contexto16,384
Salida máxima16,384
Vision-
Function Calling-

Guía de acceso a la API

Base URLhttps://llm.wavespeed.ai/v1
API Endpointchat/completions
Model IDmicrosoft/phi-4

microsoft/phi-4

Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

Input

$0.065 /M

Output

$0.14 /M

Context

16K

Max Output

16K

Try microsoft/phi-4 on WaveSpeedAI

Access microsoft/phi-4 through our unified API — OpenAI-compatible, no cold starts, transparent pricing.

Open Playground