google/gemini-3.5-flash
1,048,576 context · $1.50/M input tokens · $9.00/M output tokens
Gemini 3.5 Flash is Google’s high-efficiency multimodal model, delivering near-Pro-level reasoning and coding capabilities with Flash-class speed and cost efficiency. It is purpose-built for advanced coding workflows and parallel agentic execution, while supporting a wide range of input modalities including text, images, video, audio, and PDFs.
The model defaults to a medium reasoning mode to balance latency, quality, and cost, while also offering configurable thinking levels — minimal, low, medium, and high — for more precise performance and efficiency tuning across different workloads.
Bayar sesuai pemakaian
Tanpa biaya di muka, bayar hanya sesuai penggunaan
Gunakan contoh kode berikut untuk integrasi dengan API kami:
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="google/gemini-3.5-flash",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)Gemini 3.5 Flash is Google’s high-efficiency multimodal model, delivering near-Pro-level reasoning and coding capabilities with Flash-class speed and cost efficiency. It is purpose-built for advanced coding workflows and parallel agentic execution, while supporting a wide range of input modalities including text, images, video, audio, and PDFs.
The model defaults to a medium reasoning mode to balance latency, quality, and cost, while also offering configurable thinking levels — minimal, low, medium, and high — for more precise performance and efficiency tuning across different workloads.
| Specification | Value |
|---|---|
| Provider | |
| Model Type | Chat Completions model |
| Architecture | text+image+file+audio+video->text |
| Context Window | 1048576 tokens |
| Max Input | 983040 tokens |
| Max Output | 65536 tokens |
| Input | Text, Image, Video, file, Audio |
| Output | Text |
| Vision | Supported |
| Function Calling | Supported |
| Structured Outputs | Supported |
Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: google/gemini-3.5-flash
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://llm.wavespeed.ai/v1"
)
response = client.chat.completions.create(
model="google/gemini-3.5-flash",
messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
curl https://llm.wavespeed.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "google/gemini-3.5-flash",
"messages": [{"role": "user", "content": "Hello!"}]
}'google/gemini-3.5-flash
Gemini 3.5 Flash is Google’s high-efficiency multimodal model, delivering near-Pro-level reasoning and coding capabilities with Flash-class speed and cost efficiency. It is purpose-built for advanced coding workflows and parallel agentic execution, while supporting a wide range of input modalities including text, images, video, audio, and PDFs. The model defaults to a medium reasoning mode to balance latency, quality, and cost, while also offering configurable thinking levels — minimal, low, medium, and high — for more precise performance and efficiency tuning across different workloads.
Input
$1.5 /M
Output
$9 /M
Konteks
1049K
Output Maks.
66K
Vision
Didukung
Penggunaan Tool
Didukung
Akses Gemini 3.5 Flash melalui API terpadu kami — kompatibel dengan OpenAI, tanpa cold start, harga transparan.
Harga di WaveSpeedAI: $1.50 per juta token input dan $9.00 per juta token output. Prompt caching dan batch processing ditagih terpisah dan mengurangi biaya efektif pada beban kerja yang panjang dan berulang.
Gemini 3.5 Flash mendukung hingga 1049K token konteks dengan hingga 66K token output per permintaan.
Ya. WaveSpeedAI menyediakan Gemini 3.5 Flash melalui endpoint yang kompatibel dengan OpenAI di https://llm.wavespeed.ai/v1. Arahkan OpenAI SDK resmi ke base URL ini dengan API key WaveSpeedAI Anda — tanpa perubahan kode lainnya.
Masuk ke WaveSpeedAI, buat API key di Access Keys, lalu kirim permintaan ke https://llm.wavespeed.ai/v1/chat/completions dengan model id seperti ditampilkan di atas. Akun baru menerima kredit gratis untuk menguji Gemini 3.5 Flash.