Seedance 2.0 立省 15% | 在 Video Generator 中创作 →

Any LLM

wavespeed-ai /

Any LLM is a versatile large language model for text generation, comprehension, and diverse NLP tasks such as chat and summarization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

llm
输入
Should reasoning be the part of the final answer.
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

就绪

$0.001每次运行·~1000 / $1

相关模型

README

Any LLM

Any LLM is a unified large language model gateway that provides access to multiple state-of-the-art AI models through a single interface. Chat, reason, and generate text using models from Google, OpenAI, Anthropic, and more — all in one place.

Why It Stands Out

  • Multi-model access: Choose from a variety of leading AI models including Gemini, GPT, Claude, and more.
  • Unified interface: One consistent API and playground for all supported models.
  • System prompt support: Customize model behavior with custom instructions.
  • Reasoning mode: Enable step-by-step reasoning for complex problem-solving tasks.
  • Priority control: Choose between latency-optimized or quality-optimized responses.
  • Flexible parameters: Fine-tune temperature, max tokens, and other settings.
  • Prompt Enhancer: Built-in AI-powered prompt optimization for better results.

Parameters

ParameterRequiredDescription
promptYesYour question or instruction to the model.
system_promptNoCustom instructions to guide model behavior.
reasoningNoInclude reasoning steps in the final answer.
priorityNoOptimize for latency or quality (default: latency).
temperatureNoControls randomness (lower = focused, higher = creative).
max_tokensNoMaximum length of the response.
modelNoSelect which LLM to use (e.g., google/gemini-2.5-flash).
enable_sync_modeNoWait for result before returning response (API only).

Supported Models

  • google/gemini-2.5-flash
  • anthropic/claude-3.5-sonnet
  • openai/gpt-5-chat
  • And more...

How to Use

  1. Write your prompt — enter your question or instruction. Use the Prompt Enhancer for AI-assisted optimization.
  2. Add a system prompt (optional) — provide custom instructions to guide the model's behavior.
  3. Enable reasoning (optional) — turn on for step-by-step explanations.
  4. Select priority — choose "latency" for faster responses or "quality" for better outputs.
  5. Adjust parameters (optional) — set temperature and max_tokens as needed.
  6. Select a model — choose from available LLMs.
  7. Click Run and receive your response.

Best Use Cases

  • General Q&A — Get answers to questions across any topic.
  • Writing Assistance — Draft emails, articles, reports, and creative content.
  • Code Generation — Write, debug, and explain code in multiple languages.
  • Research & Analysis — Summarize documents, analyze data, and extract insights.
  • Reasoning Tasks — Solve math problems, logic puzzles, and complex reasoning challenges.
  • Brainstorming — Generate ideas, outlines, and creative concepts.

Pro Tips for Best Quality

  • Use system prompts to define the model's role, tone, and output format.
  • Enable reasoning for math, logic, and multi-step problems.
  • Lower temperature (0.1–0.3) for factual, consistent answers.
  • Higher temperature (0.7–1.0) for creative, varied responses.
  • Choose "latency" priority for quick interactions, "quality" for important tasks.
  • Experiment with different models to find the best fit for your use case.

Notes

  • Processing time varies based on model selection and prompt complexity.
  • Please ensure your prompts comply with usage guidelines.
无障碍:本网站使用的 AI 模型由第三方提供。

Any Llm API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/any-llm with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Any Llm below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/any-llm" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "reasoning": false,
    "priority": "latency",
    "temperature": 0,
    "max_tokens": 1,
    "model": "google/gemini-2.5-flash",
    "enable_sync_mode": false
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("wavespeed-ai/any-llm", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "reasoning": false,
        "priority": "latency",
        "temperature": 0,
        "max_tokens": 1,
        "model": "google/gemini-2.5-flash",
        "enable_sync_mode": false
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/any-llm",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "reasoning": false,
    "priority": "latency",
    "temperature": 0,
    "max_tokens": 1,
    "model": "google/gemini-2.5-flash",
    "enable_sync_mode": false
}
)

print(output["outputs"][0])  # → URL of the generated output

Any Llm API — Frequently asked questions

What is the Any Llm API?

Any Llm is a WaveSpeedAI model for AI inference, exposed as a REST API on WaveSpeedAI. Any LLM is a versatile large language model for text generation, comprehension, and diverse NLP tasks such as chat and summarization. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Any Llm API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/any-llm.

How much does Any Llm cost per run?

Any Llm starts at $0.001 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Any Llm accept?

Key inputs: `prompt`, `enable_sync_mode`, `max_tokens`, `model`, `priority`, `reasoning`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/any-llm.

How long does Any Llm take to generate?

Average end-to-end generation time on WaveSpeedAI is around 15 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Any Llm outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.