microsoft/phi-4
16,384 context · $0.07/M input tokens · $0.14/M output tokens
Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...
Bayar sesuai pemakaian
Tanpa biaya di muka, bayar hanya sesuai penggunaan
Gunakan contoh kode berikut untuk integrasi dengan API kami:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="microsoft/phi-4",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)microsoft phi-4
| Specification | Value |
|---|---|
| Provider | Microsoft |
| Model Type | Large Language Model (LLM) |
| Architecture | N/A |
| Context Window | 16384 tokens |
| Max Output | 4096 tokens |
| Input | Text |
| Output | Text |
| Vision | Supported |
| Function Calling | Supported |
| Token Type | Cost per Million Tokens |
|---|---|
| Input | $0.1 |
| Output | $0.1 |
Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: microsoft/phi-4
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="microsoft/phi-4",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)
curl https://llm.wavespeed.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "microsoft/phi-4",
"messages": [{"role": "user", "content": "Hello!"}]
}'
Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...
Input
$0.065 /M
Output
$0.14 /M
Context
16K
Max Output
16K
Access microsoft/phi-4 through our unified API — OpenAI-compatible, no cold starts, transparent pricing.
Open Playground