Seedance 2.0 | Special Offer ✦ 10% OFF NOW
google

google/gemma-3n-e4b-it

google/gemma-3n-e4b-it

32,768 context · $0.06/M input tokens · $0.12/M output tokens

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...

Cennik

Płać za użycie

Bez kosztów początkowych, płacisz tylko za to, czego używasz

Wejście$0.06 / M Tokens
Wyjście$0.12 / M Tokens

Użycie API

Użyj poniższych przykładów kodu, aby zintegrować się z naszym API:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="google/gemma-3n-e4b-it",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Wprowadzenie do modelu

Google gemma-3n-e4b-it

google gemma-3n-e4b-it


Why It Looks Great

  • Large Language Model architecture for efficient processing
  • 32768 context window for long document handling
  • Competitive pricing at $0.0/$0.0 per million tokens

Key Features

  • Context Window: 32768 tokens
  • Max Output: 4096 tokens
  • Vision: Supported
  • Function Calling: Supported

Specifications

SpecificationValue
ProviderGoogle
Model TypeLarge Language Model (LLM)
ArchitectureN/A
Context Window32768 tokens
Max Output4096 tokens
InputText
OutputText
VisionSupported
Function CallingSupported

Pricing

Token TypeCost per Million Tokens
Input$0.0
Output$0.0

How to Use

  1. Write your prompt — describe the task, provide context, and specify desired output format.
  2. Submit — the model processes your request and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: google/gemma-3n-e4b-it


API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="google/gemma-3n-e4b-it",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "google/gemma-3n-e4b-it",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Notes

  • Model: google/gemma-3n-e4b-it
  • Provider: Google

Info

Providergoogle
Typellm

Obsługiwane funkcje

Wejście
Text
Wyjście
Text
Kontekst32,768
Maks. wyjście4,096
Vision-
Function Calling-

Przewodnik dostępu do API

Base URLhttps://llm.wavespeed.ai/v1
API Endpointchat/completions
Model IDgoogle/gemma-3n-e4b-it

google/gemma-3n-e4b-it

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...

Input

$0.06 /M

Output

$0.12 /M

Context

33K

Max Output

4K

Try google/gemma-3n-e4b-it on WaveSpeedAI

Access google/gemma-3n-e4b-it through our unified API — OpenAI-compatible, no cold starts, transparent pricing.

Open Playground