Browse ModelsNvidiaNvidia Cosmos 3 Super Image To Video

Nvidia Cosmos 3 Super Image To Video

Nvidia Cosmos 3 Super Image To Video

Playground

Try it on WavespeedAI!

NVIDIA Cosmos 3 Super Image to Video is a fast AI image-to-video generation model that creates high-quality videos from a first-frame image and a motion prompt. Ready-to-use REST inference API for animating images, product videos, cinematic clips, social media content, advertising creatives, concept videos, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

Features

NVIDIA Cosmos 3 Super Image-to-Video

NVIDIA Cosmos 3 Super Image-to-Video generates short videos from a reference image and a natural-language prompt. It supports motion prompting, negative prompting, size presets, duration selection, inference-step tuning, and guidance scaling for high-quality image-driven video generation.


Why Choose This?

  • Image-guided video generation Start from a single reference image and animate it into a video clip.

  • Prompt-based motion control Describe motion, camera movement, atmosphere, and scene behavior using natural language.

  • Negative prompt support Use negative_prompt to steer the model away from unwanted content or artifacts.

  • Flexible size presets Generate videos in common aspect ratios such as 16:9, 1:1, and 9:16.

  • Simple duration control Choose a fixed output duration from 1 to 7 seconds.

  • Production-ready API Suitable for concept visualization, animated keyframes, creator content, marketing clips, and short cinematic sequences.


Parameters

ParameterRequiredDescription
promptYesText prompt describing the motion and scene of the video to generate.
imageYesFirst-frame image for the generated video.
negative_promptNoContent to steer the generation away from.
sizeNoOutput video size preset. Supported values: 16:9, 4:3, 1:1, 3:4, 9:16. Default: 16:9.
durationNoOutput video duration in seconds. Supported values: 1, 2, 3, 4, 5, 6, 7. Default: 7.
num_inference_stepsNoNumber of denoising steps.
guidance_scaleNoClassifier-free guidance scale.

How to Use

  1. Upload the reference image — provide the image you want to animate.
  2. Write the prompt — describe the desired motion, camera action, and scene evolution.
  3. Add a negative prompt (optional) — specify things you want to avoid.
  4. Choose the output size — select the aspect ratio that matches your intended output.
  5. Set duration (optional) — choose a fixed output length between 1 and 7 seconds.
  6. Tune generation settings (optional) — adjust num_inference_steps and guidance_scale if needed.
  7. Submit — run the model and download the generated video.

Example Prompt

A cinematic slow push-in as the subject turns slightly toward the camera, soft wind movement in the hair, subtle background motion, realistic lighting, polished commercial look


Pricing

Pricing is based on the selected duration.

DurationCost
1 second$0.05
2 seconds$0.10
3 seconds$0.15
4 seconds$0.20
5 seconds$0.25
6 seconds$0.30
7 seconds$0.35

Billing Rules

  • Pricing is $0.05 per second
  • Billing follows the selected duration
  • Minimum billed duration is 1 second
  • Maximum billed duration is 7 seconds
  • size, negative_prompt, num_inference_steps, and guidance_scale do not directly affect pricing

Best Use Cases

  • Image-to-video animation — Turn a still image into a dynamic short clip.
  • Concept visualization — Explore motion ideas from a single frame.
  • Social media content — Create short animated content from static artwork or portraits.
  • Marketing creatives — Animate posters, product shots, or character stills.
  • Cinematic prototyping — Test prompt-driven motion, framing, and visual storytelling.

Pro Tips

  • Use a clear, high-quality reference image for better motion stability.
  • Keep prompts focused on motion, camera behavior, and scene change rather than restating static details already visible in the image.
  • Match the input image aspect ratio to the selected size whenever possible.
  • If the input image ratio does not match the selected size, the result may appear stretched or distorted.
  • Start with the default settings first, then tune guidance_scale or num_inference_steps only if needed.
  • Use negative_prompt to reduce unwanted artifacts or style drift.

Notes

  • prompt and image are required.
  • duration is selected directly from 1 to 7 seconds.
  • size defaults to 16:9.
  • Pricing depends only on the selected duration.
  • For best results, the input image ratio should match the selected size ratio to avoid distortion.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/nvidia/cosmos-3-super/image-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "size": "16:9",
    "duration": "7",
    "num_inference_steps": 28,
    "guidance_scale": 6
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-Text prompt describing the motion and scene of the video to generate.
imagestringYes-First-frame image for the generated video.
negative_promptstringNo-Content to steer the generation away from.
sizestringNo16:916:9, 4:3, 1:1, 3:4, 9:16Output video size preset.
durationstringNo71, 2, 3, 4, 5, 6, 7Output video duration in seconds.
num_inference_stepsintegerNo281 ~ 50Number of denoising steps.
guidance_scalenumberNo60.0 ~ 20.0Classifier-free guidance scale.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content.
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.