Vidu Q3 और Q3 Pro मॉडल पर 50% छूट · केवल WaveSpeedAI | 20 मई – 2 जून

SAM 3D Objects

wavespeed-ai /

Advanced SAM 3D objects generation model for creating detailed 3D object models from images with text prompts and optional mask-based segmentation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-3d
Input

Drag & drop करें या upload के लिए click करें

preview

Idle

$0.02per run·~50 / $1

ExamplesView all

front-view bright pink wired on-ear headphones on a clean white background.

front-view bright pink wired on-ear headphones on a clean white background.

the entire white retro handheld game console, not the background

the entire white retro handheld game console, not the background

the entire mint green rotary telephone in the image, including the handset, base, and coiled cord, as one complete object for 3d reconstruction, ignore the plain white background

the entire mint green rotary telephone in the image, including the handset, base, and coiled cord, as one complete object for 3d reconstruction, ignore the plain white background

A sitting teddy bear

A sitting teddy bear

a single bright orange traffic cone with two white reflective stripes, centered in the image, isolated object, clear edges, clean silhouette

a single bright orange traffic cone with two white reflective stripes, centered in the image, isolated object, clear edges, clean silhouette

Related Models

README

WaveSpeedAI SAM 3D Objects Image-to-3D

SAM 3D Objects (wavespeed-ai/sam-3d-objects) is a SAM-powered image-to-3D model that converts a single image into a 3D object asset. It’s designed for practical 3D pipelines where you want a fast way to turn product photos, props, or simple objects into editable 3D geometry.

For better isolation in complex scenes, you can optionally provide mask images to guide segmentation and get cleaner, more stable reconstructions.

Key capabilities

  • Image-to-3D object reconstruction from a single input image
  • Optional mask-guided segmentation to isolate a target object from cluttered backgrounds
  • Prompt-guided generation (optional) to nudge style/interpretation when the image is ambiguous
  • Stable, iterative workflow (generate → tweak mask/prompt → re-run) for asset refinement
  • Pipeline-friendly outputs returned as downloadable URLs (exact 3D format depends on endpoint configuration)
  • Best results on single, clearly visible objects with minimal occlusion

Parameters and how to use

  • image: (required) Input image URL for 3D object generation.
  • prompt: Text prompt to guide 3D object generation.
  • mask_images: Optional array of mask image URLs for specific region processing.

Prompt

Use prompt when the image alone leaves too much ambiguity (shape, material, or category). Keep it short and concrete:

  • Name the object and any critical parts: “leather handbag with a gold clasp”
  • If needed, add material/finish hints: “matte ceramic”, “glossy plastic”
  • Avoid long, story-like prompts—this model is primarily driven by the input image.

Media (Images)

Input image (image)

  • Provide a direct, accessible URL to the image.
  • If your image is local, upload it first.

Mask images (mask_images, optional)

  • Provide one or more mask image URLs (uploaded the same way as image).
  • Critical: the mask must match the input image size and framing exactly.
  • Keep masks tight to the object silhouette for cleaner geometry.
  • If you don’t need multi-region control, start with a single mask.

Note: The docs don’t mandate specific mask color conventions (white/black). If you see unexpected results, try a clean binary mask and iterate.

Other parameters

  • mask_images

  • Type: array of URLs

  • What it does: constrains processing to the masked region(s), helping isolate the target object.

  • Recommended: provide only what you need; start with one well-aligned mask.

  • prompt

  • Type: string

  • What it does: nudges generation when the image is ambiguous.

  • Recommended: short, object-focused hints (object name + material/finish if helpful).

After you finish configuring the parameters, click Run, preview the result, and iterate if needed.

Pricing

$0.02 per run

Notes

  • Avoid heavy occlusions (hands covering the object, overlapping items). Occlusion often causes broken or unstable geometry.
  • Mask alignment matters more than prompt quality. If results look “melted” or include background artifacts, fix the mask first.
  • The API is asynchronous: submit a task, then fetch the final asset URLs from the result endpoint when the status is completed.

Related Models

  • SAM 3D Body – Similar workflow, tuned for reconstructing human bodies from images with optional mask guidance.
  • Hyper3D Rodin v2 (Image-to-3D) – Image-to-3D model focused on production-ready assets (clean meshes with UVs/textures).
  • Tripo3D v2.5 (Image-to-3D) – Strong single-image reconstruction with controls geared toward edit-friendly outputs.
  • Tencent Hunyuan3D v2.1 – A higher-level 3D asset creation system for broader 3D workflows.
Accessibility:This website uses AI models provided by third parties.

Sam 3d Objects API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/sam-3d-objects with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Sam 3d Objects below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/sam-3d-objects" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "image": "https://example.com/your-input.jpg",
    "prompt": "A cinematic shot of a city at sunset, soft golden light"
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("wavespeed-ai/sam-3d-objects", {
        "image": "https://example.com/your-input.jpg",
        "prompt": "A cinematic shot of a city at sunset, soft golden light"
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/sam-3d-objects",
    {
    "image": "https://example.com/your-input.jpg",
    "prompt": "A cinematic shot of a city at sunset, soft golden light"
}
)

print(output["outputs"][0])  # → URL of the generated output

Sam 3d Objects API — Frequently asked questions

What is the Sam 3d Objects API?

Sam 3d Objects is a WaveSpeedAI model for 3D asset generation from images, exposed as a REST API on WaveSpeedAI. Advanced SAM 3D objects generation model for creating detailed 3D object models from images with text prompts and optional mask-based segmentation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Sam 3d Objects API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/sam-3d-objects.

How much does Sam 3d Objects cost per run?

Sam 3d Objects starts at $0.020 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Sam 3d Objects accept?

Key inputs: `prompt`, `image`, `mask_images`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/sam-3d-objects.

How long does Sam 3d Objects take to generate?

Average end-to-end generation time on WaveSpeedAI is around 33 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Sam 3d Objects outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.