Seedance 2.0 | Special Offer ✦ 10% OFF NOW
google

google/gemma-3n-e4b-it

google/gemma-3n-e4b-it

32,768 context · $0.06/M input tokens · $0.12/M output tokens

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...

Preços

Pagamento por uso

Sem custo inicial, pague apenas pelo que usar

Entrada$0.06 / M Tokens
Saída$0.12 / M Tokens

Uso da API

Use os exemplos de código abaixo para integrar com nossa API:

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="google/gemma-3n-e4b-it",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Introdução do modelo

Google gemma-3n-e4b-it

google gemma-3n-e4b-it


Why It Looks Great

  • Large Language Model architecture for efficient processing
  • 32768 context window for long document handling
  • Competitive pricing at $0.0/$0.0 per million tokens

Key Features

  • Context Window: 32768 tokens
  • Max Output: 4096 tokens
  • Vision: Supported
  • Function Calling: Supported

Specifications

SpecificationValue
ProviderGoogle
Model TypeLarge Language Model (LLM)
ArchitectureN/A
Context Window32768 tokens
Max Output4096 tokens
InputText
OutputText
VisionSupported
Function CallingSupported

Pricing

Token TypeCost per Million Tokens
Input$0.0
Output$0.0

How to Use

  1. Write your prompt — describe the task, provide context, and specify desired output format.
  2. Submit — the model processes your request and returns the response.

API Integration

Base URL: https://llm.wavespeed.ai/v1 API Endpoint: chat/completions Model ID: google/gemma-3n-e4b-it


API Usage

Python SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://llm.wavespeed.ai/v1"
)

response = client.chat.completions.create(
    model="google/gemma-3n-e4b-it",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

cURL

curl https://llm.wavespeed.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "google/gemma-3n-e4b-it",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Notes

  • Model: google/gemma-3n-e4b-it
  • Provider: Google

Info

Providergoogle
Typellm

Funcionalidades suportadas

Entrada
Text
Saída
Text
Contexto32,768
Saída máx.4,096
Vision-
Function Calling-

Guia de acesso à API

Base URLhttps://llm.wavespeed.ai/v1
API Endpointchat/completions
Model IDgoogle/gemma-3n-e4b-it

google/gemma-3n-e4b-it

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...

Input

$0.06 /M

Output

$0.12 /M

Context

33K

Max Output

4K

Try google/gemma-3n-e4b-it on WaveSpeedAI

Access google/gemma-3n-e4b-it through our unified API — OpenAI-compatible, no cold starts, transparent pricing.

Open Playground