50% di sconto sui modelli Vidu Q3 e Q3 Pro · Solo su WaveSpeedAI | 20 maggio – 2 giugno

Moondream3 Preview Query

wavespeed-ai /

Moondream3 Query answers natural language questions on images with visual Q&A and optional chain of thought for detailed explanations. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-text
Input

Trascina e rilascia o clicca per caricare

preview
Enable chain-of-thought reasoning to get more detailed explanations.
If set to true, the function will wait for the result before returning the response. This property is only available through the API.

Inattivo

{
  "answer": "The image shows a woman dressed in a princess costume, wearing a tiara and a necklace. She is standing in front of a building with cherry blossoms in the background. The woman is posing for the picture, looking directly at the camera. The scene evokes a sense of royalty and elegance, with the woman's attire and accessories suggesting a fairytale or fantasy setting."
}

$0.005per esecuzione·~200 / $1

Successivo:

EsempiVedi tutto

What the characters are doing?

What can you see from this image?

What emotions are visible in this scene?

What emotions are visible in this scene?

What is the person in the image doing?

What is the form of the character?

What clothes are the characters wearing?

What emotions are visible in this scene?

What is the person in the image doing?

What emotions are visible in this scene?

Modelli correlati

README

Moondream 3 — Visual Question Answering (VQA)

Moondream 3 Query is an advanced vision-language model designed to understand images and answer natural-language questions about them. It combines fast inference, accurate scene understanding, and optional reasoning for visual explanation — ideal for analysis, education, and creative applications.

✨ Key Features

  • Visual Q&A Ask questions about any image — people, objects, actions, or scenes — and receive natural language answers.

  • Chain-of-Thought Reasoning Enable reasoning mode to let the model explain how it reached its conclusion, useful for analysis and debugging.

  • Accurate Visual Understanding Trained on diverse, high-quality image-text datasets for reliable recognition of complex visual contexts.

  • Fast and Lightweight Optimized for low latency and efficient inference while maintaining strong reasoning performance.

⚙️ Example Usage

🔹 Basic Query

{
 "image": "https://example.com/photo.jpg",
 "prompt": "What is the person in the image doing?"
}

🔹 Query with Reasoning

{
 "image": "https://example.com/photo.jpg",
 "prompt": "What emotions are visible in this scene?",
 "reasoning": true
}

💡 Best Practices

  • Ask clear and specific questions for higher accuracy.
  • Enable reasoning mode for tasks that require multi-step or contextual analysis.
  • Supported image formats: JPEG, PNG, WebP
  • Maximum image size: 10 MB

💰 Pricing

  • $0.005 per request
  • Volume discounts available — please contact WaveSpeedAI for enterprise or batch pricing.
Accessibilità:Questo sito web utilizza modelli di intelligenza artificiale forniti da terze parti.

Moondream3 Preview Query API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/moondream3-preview/query with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Moondream3 Preview Query below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/moondream3-preview/query" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "reasoning": false,
    "enable_sync_mode": false
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("wavespeed-ai/moondream3-preview/query", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "image": "https://example.com/your-input.jpg",
        "reasoning": false,
        "enable_sync_mode": false
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/moondream3-preview/query",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "image": "https://example.com/your-input.jpg",
    "reasoning": false,
    "enable_sync_mode": false
}
)

print(output["outputs"][0])  # → URL of the generated output

Moondream3 Preview Query API — Frequently asked questions

What is the Moondream3 Preview Query API?

Moondream3 Preview Query is a WaveSpeedAI model for AI inference, exposed as a REST API on WaveSpeedAI. Moondream3 Query answers natural language questions on images with visual Q&A and optional chain of thought for detailed explanations. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Moondream3 Preview Query API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/moondream3-preview-query.

How much does Moondream3 Preview Query cost per run?

Moondream3 Preview Query starts at $0.005 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Moondream3 Preview Query accept?

Key inputs: `prompt`, `image`, `enable_sync_mode`, `reasoning`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/moondream3-preview-query.

How long does Moondream3 Preview Query take to generate?

Average end-to-end generation time on WaveSpeedAI is around 15 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Moondream3 Preview Query outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.