Browse ModelsHyper3dHyper3d Rodin V2 Image To 3d

Hyper3d Rodin V2 Image To 3d

Hyper3d Rodin V2 Image To 3d

Playground

Try it on WavespeedAI!

Hyper3D Rodin v2 turns a single image into production-ready 3D assets with clean topology, UVs and textures. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Hyper3D-Rodin-Gen-2 — Text / Image to 3D

Hyper3D-Rodin-Gen-2 is Hyper3D’s commercial 3D generation system that turns text prompts or 2D images into production-ready 3D assets with UVs and textures. It targets game art, film/TV, XR, product visualisation and even 3D printing workflows.


🔧 What it does

  • Text-to-3D & Image-to-3D in one model Enter a prompt, upload one or more images, or combine both. Multi-view images help reconstruct more accurate shapes.

  • Geometry + textures, ready for DCC / engines Outputs UV-unwrapped meshes plus PBR or shaded textures for use in Unity, Unreal, Blender, Maya, 3D printing pipelines, etc.

  • Topology & resolution control Choose between quad meshes (good for sculpting / rigging) or triangle meshes (game-ready), and pick an approximate polycount tier.

  • Geometric & pose control Optional bounding-box constraints and T/A-pose enforcement help keep characters and props within expected proportions and ready for rigging.


🧩 Parameters

1. Core inputs

  • prompt Natural-language description of the object (shape, style, material, usage).

  • images* One or more reference images (front / side / 3-view / concept art).

    • With only text → Text-to-3D
    • With images (and optional text) → Image-to-3D / guided Text-to-3D
  • material Rendering/material mode for textures:

    • PBR – Physically-based maps (albedo, normal, roughness, metallic, etc.).
    • Shaded – Baked / stylised look.
    • All – Export both PBR and shaded variants.

2. Quality & mesh settings

  • quality_and_mesh Controls mesh type and target polycount:

    • 4k_Quad, 8k_Quad, 18k_Quad, 50k_Quad → Quad-dominant topology at roughly 4k / 8k / 18k / 50k faces. Best for character work, sculpting, retopology and rigging.

    • 2K_Triangle, 20K_Triangle, 250K_Triangle, 500K_Triangle → Triangle meshes at increasing density. Good for game engines, previs, or high-detail props.

    Higher tiers give more detail but larger file sizes and longer generation time.

  • addons Optional enhancement packs. Currently:

    • HighPack – Increases mesh and texture fidelity (higher polycount / resolution) for final-quality assets.

3. Output format

  • geometry_file_format Choose which 3D file you want back:

    • glb – Compact, modern, web-friendly (recommended default).
    • fbx – Widely used for DCC and game engines.
    • obj – Simple geometry + MTL, highly compatible.
    • stl – For 3D printing workflows.
    • usdz – Apple-friendly AR format.

4. Advanced geometric control

  • bbox_condition A ControlNet-style bounding box that limits the maximum size of the generated model (width / height / depth). Useful when you need consistent scaling across a whole asset library.

  • TAPose When enabled, forces humanoid characters into a T-pose / A-pose for easier rigging and animation downstream.

  • use_original_alpha If your input image has transparency, this option lets the model respect the original alpha silhouette during generation (handy for cut-out product shots or stylised characters).

  • preview_render Adds a quick preview render (e.g., turntable / shaded view) to the download bundle so you can inspect the result without opening a DCC tool.


5. Randomness & reproducibility

  • seed Random seed for generation:

    • Leave empty / default → random each time.
    • Set to a fixed integer → reproduce the same model configuration (useful for iteration with small prompt tweaks).

🚀 Typical workflow

  1. Decide on input mode

    • For concepting: start with prompt only.
    • For fidelity: upload one or more reference images and optionally add a short prompt.
  2. Pick material & mesh quality

    • PBR + 8k_Quad or 18k_Quad for game/film characters.
    • PBR + 20K_Triangle for background props.
    • Add HighPack when you’re close to final asset quality.
  3. Set geometry_file_format to match your pipeline (e.g., glb for web, fbx for DCC, stl for printing).

  4. (Optional) Add bbox_condition, enable TAPose for characters, and toggle use_original_alpha if your reference image uses transparency.

  5. (Optional) Turn on preview_render to get a ready-to-view render in the output zip.

  6. Set a seed if you want to be able to regenerate or slightly tweak the same base model.

  7. Click Run — once the job finishes, download the mesh + textures bundle and import into your DCC, engine, or 3D-printing tool.


Price

  • Per genration cost $0.3.

💡 Tips

  • Use clean, centered references with good lighting for image-to-3D. Multi-view images greatly improve shape accuracy.
  • Start with medium polycount tiers (8k_Quad, 20K_Triangle) for fast iteration, then switch to higher tiers + HighPack for final export.
  • For rigged characters, combine TAPose + quad meshes and export as fbx or glb.
  • If scale consistency matters across a project, define a shared bbox_condition and reuse it for all related assets.

Mode 3D Models

  • tripo3d/v2.5/image-to-3d Tripo3D’s v2.5 image-to-3D model turns a single product or concept image into a textured, game-ready 3D asset for e-commerce, AR/VR and real-time engines.

  • tripo3d/v2.5/multiview-to-3d Tripo3D’s multi-view 3D reconstruction model uses several photos of the same object to generate higher-fidelity meshes and textures for digital twins and 3D catalogs.

  • hunyuan3d/v2.1 Tencent Hunyuan3D v2.1 (hosted by WaveSpeedAI) converts text prompts into detailed 3D models, ideal for stylised characters, props and environment assets in games and animation.

  • hunyuan3d-v2-multi-view Tencent Hunyuan3D v2 multi-view leverages multiple reference images to create accurate, textured 3D assets for digital humans, product visualization and virtual production workflows.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/hyper3d/rodin-v2/image-to-3d" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringNo-A textual prompt to guide the model generation.
imagesarrayYes[]1 ~ 5 itemsImages to be used in generation, up to 5 images. As the form data request will preserve the order of the images, the first image will be the image for material generation.
materialstringNo-PBR, All, ShadedThe material type.
quality_and_meshstringNo-4k_Quad, 8k_Quad, 18k_Quad, 50k_Quad, 2K_Triangle, 20K_Triangle, 250K_Triangle, 500K_TriangleThe generation quality and mesh mode.
geometry_file_formatstringNo-glb, fbx, obj, stl, usdzThe format of the output geometry file.
addonsstringNo-HighPackGenerate 4K resolution texture instead of the default 2K. If Quad mode, the number of faces will be ~16 times of the number of faces selected in the quality parameter.
bbox_conditionarrayNo--This is a controlnet that controls the maxmimum sized of the generated model.
ta_posebooleanNo--Control the generation result to T/A Pose.
use_original_alphabooleanNo--Used when processing the image.
preview_renderbooleanNo--Provided in the download list.
seedintegerNo--1 ~ 2147483647Seed for random number generator. Set to 0 to use a random seed.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content (empty when status is not completed).
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.