Hunyuan Video 1.5 Image to Video | Fast Image-to-Video API

Seedream 5.0 Pro jest już LIVE | Wypróbuj w Generator obrazów →

Panel Odkrywaj Generator AIGorące Aplikacja desktopowa

LLM

Ustawienia

Strona główna/Odkrywaj/WaveSpeed/Hunyuan Video 1.5/Image To Video

wavespeed-ai /

HunyuanVideo-1.5 (i2v) is a lightweight 8.3B parameter image-to-video model that generates high-quality videos from images with top-tier visual quality and motion coherence. Optimized for fast inference on consumer-grade GPUs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

image-to-video

Wejście

Enable Safety Checker

Bezczynny

$0.1za uruchomienie·~10 / $1

Dalej:

PrzykładyZobacz wszystkie

A cinematic anime-style image-to-video sequence. Start from this frame: a high-school girl with long dark hair stands in the rain at night, holding a clear umbrella in a neon-lit Japanese backstreet, her uniform shirt slightly wet, looking back with a faintly sad expression. Continue the story at normal speed, no slow motion: the camera slowly tracks forward toward her as raindrops tap on the umbrella and colorful signs reflect on the wet pavement. Her phone vibrates in her pocket, she glances down to read a short message, her eyes soften with mixed relief and disappointment. She looks once more down the empty street as if hoping to see someone arrive, then takes a quiet breath, tightens her grip on the umbrella, and starts walking toward the brighter end of the alley, passing steaming food stalls and blurred pedestrians under umbrellas. Neon reflections ripple around her footsteps, city sounds grow louder, anime aesthetic, detailed rain effects, gentle handheld-style motion, melancholic yet hopeful mood.

A cinematic noir-style image-to-video shot. Start from the given frame: a middle-aged detective in a beige trench coat and fedora stands in the rain on a 1940s city street under the glowing “BLUE MOON TAVERN” neon sign, holding a slightly crumpled black-and-white photograph. Continue the story: light rain falls on the wet cobblestones, car headlights pass slowly behind him, reflections shimmer on the street. The camera moves forward at a natural pace, no slow motion, as he studies the photo, turns it over to reveal a handwritten message, then frowns. The tavern sign flickers, a distant car door slams, and a shadowy figure briefly appears in the tavern doorway before disappearing inside. The detective pockets the photograph, looks up with quiet determination, and walks toward the tavern entrance through the rain. Realistic lighting, detailed textures, classic film-noir atmosphere, normal speed, cinematic framing.

A cinematic cyberpunk image-to-video sequence. Start from this frame: a lone armored figure with a glowing visor walks down the center of a neon-soaked street at night, skyscrapers and holographic billboards towering on both sides, rain falling and reflecting the colors on the wet asphalt. Continue the story at normal speed (no slow motion): the camera tracks backward as the figure keeps walking with steady, confident steps, drones and flying cars crossing the sky above, their lights sweeping past. A message flashes briefly across one of the giant screens with a warning about a citywide lockdown; the character glances up, the visor UI flickers, then they receive a holographic mission briefing projected from the helmet. Police sirens echo in the distance, a car screeches to a stop behind them, and a drone turns its camera toward the character, scanning. The figure clenches their fists, the visor shifts to a combat color, and they stride forward toward the end of the street, disappearing into a haze of neon fog. High detail, rich reflections, dynamic city atmosphere, smooth camera motion, real-time pacing.

A romantic vintage-style image-to-video sequence. Start from this frame: a couple in classic coats stand close together on a wet cobblestone street at dusk, sharing a single umbrella, laughing and looking into each other’s eyes, warm streetlamps glowing behind them and a small cinema sign in the background. The camera gently tracks around them at normal speed (no slow motion) as raindrops fall and reflections shimmer on the stones. They calm their laughter, he brushes a raindrop from her hair, she playfully nudges his shoulder, then they decide to walk. Still under the umbrella, they turn and stroll down the street toward the distant lights, occasionally bumping shoulders and exchanging soft smiles, passing the “Cinéma du Cœur” entrance as its marquee flickers on. The city feels quiet and intimate, warm golden light, soft bokeh, natural real-time motion, cozy romantic mood.

Image-to-video horror sequence. Start from this exact frame: an old, decaying house labeled “ST. JUDE’S ORPHANAGE – EST. 1888,” shattered windows, overgrown yard, and a long-haired figure in a torn white gown standing motionless at the top of the steps. A single distant flash of lightning briefly lights the sky, then fades, leaving only the dim, overcast gloom. The camera makes a very slow, steady push toward the porch at normal speed (no slow motion), as light rain begins to fall and the old wood creaks quietly. Keep the figure almost completely still, with only tiny natural motions: her dress and hair moving slightly in the wind, and the faint rise and fall of breathing. Once, a weak glow appears behind an upstairs window, suggesting vague child-shaped shadows before it goes dark again. As the camera reaches the bottom of the steps, the orphanage sign swings gently with a soft squeak, and the figure’s head slowly lifts a few degrees toward the camera in one continuous, smooth movement. No jump cuts, no teleporting, just subtle, realistic motion, gritty cinematic look, heavy atmosphere, detailed rain and shadows, unsettling supernatural mood, real-time pacing.

Powiązane modele

hunyuan-image-3

text-to-image

hunyuan-3d-v3.1/image-to-3d-rapid

image-to-3d

hunyuan-3d-v3.1/text-to-3d-rapid

text-to-3d

hunyuan3d-v3/text-to-3d

text-to-3d

hunyuan3d-v3/image-to-3d

image-to-3d

hunyuan3d-v3/sketch-to-3d

image-to-3d

README

HunyuanVideo-1.5 Image-to-Video

HunyuanVideo-1.5 is Tencent’s lightweight, state-of-the-art video generation model. The image-to-video variant on WaveSpeedAI lets you animate a single still image into a smooth, cinematic clip guided by your text prompt, while keeping the original visual style and character identity stable.

Key features

High-quality image-to-video generation with strong motion coherence
Lightweight 8.3B-parameter design for fast inference
Multiple resolutions: 480p, 720p
Video durations: 5 s, 8 s, and 10 s

Limits and performance

Input: single image (any reasonable resolution; automatically resized/preprocessed)
Output: short video clip at selected resolution, duration, and aspect ratio
Recommended duration: up to 10 seconds per clip
Best performance with clear, well-lit images and a prompt that specifies motion, camera behavior, and mood

Pricing

Resolution	Price per second
480p	$0.02 / s
720p	$0.04 / s

How to use

Upload your input image (this becomes the starting frame of the video).
Enter a prompt describing the motion, camera movement, environment changes, and overall mood.
Choose the resolution: 480p, 720p.
Select the aspect ratio (16:9 for landscape or 9:16 for vertical/mobile).
Choose the duration: 5, 8, or 10 seconds.
Optionally set the seed for reproducibility.
Run the job and wait for processing.
Preview the generated video and download it from the WaveSpeedAI dashboard.

Tips for best results

Use a clean, high-resolution input image; avoid heavy compression and motion blur.
In the prompt, specify both what moves (hair, clothes, camera, background elements) and what stays stable (character pose, framing).
Mention camera behavior explicitly (e.g., “slow push-in,” “handheld shake,” “static camera with subtle parallax”).
Shorter durations (5–8 s) tend to produce the most coherent motion for complex scenes.
For a series of related clips, reuse the same seed and similar prompts to keep style and identity consistent.

Notes

HunyuanVideo-1.5 I2V is ideal for creators who want fast, controllable animation from still images without heavyweight hardware. It can be combined with high-end image models on WaveSpeedAI (such as Nano Banana Pro or Seedream v4) for a full pipeline: generate a keyframe with an image model, then bring it to life with HunyuanVideo.

Uwaga:Ta strona korzysta z modeli AI udostępnianych przez podmioty trzecie.

Hunyuan Video 1.5 Image To Video API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/hunyuan-video-1.5/image-to-video with your input as JSON. The endpoint returns a prediction id. Start polling the result endpoint around every 2 seconds, increase the interval for long-running tasks, and stop on any terminal status. On completed, read output values from data.outputs. Examples for Hunyuan Video 1.5 Image To Video below.

HTTP example

set -euo pipefail

: "${WAVESPEED_API_KEY:?Set WAVESPEED_API_KEY}"

REQUEST_BODY=$(cat <<'JSON'
{
    "image": "https://interactive-examples.mdn.mozilla.net/media/cc0-images/painted-hand-298-332.jpg",
    "resolution": "720p",
    "duration": 5,
    "seed": -1
}
JSON
)

# 1. Submit the prediction.
SUBMIT_RESPONSE=$(curl --silent --show-error --fail-with-body \
  -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/hunyuan-video-1.5/image-to-video" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d "$REQUEST_BODY")

TASK=$(printf '%s' "$SUBMIT_RESPONSE" | jq 'if has("data") then .data else . end')
PREDICTION_ID=$(printf '%s' "$TASK" | jq -r '.id')
if [ -z "$PREDICTION_ID" ] || [ "$PREDICTION_ID" = "null" ]; then
  printf 'Submission response did not contain a prediction id
' >&2
  exit 1
fi
RESULT_URL=$(printf '%s' "$TASK" | jq -r '.urls.get // empty')
if [ -z "$RESULT_URL" ]; then
  RESULT_URL="https://api.wavespeed.ai/api/v3/predictions/$PREDICTION_ID/result"
fi

# 2. Poll until the prediction finishes.
while true; do
  RESPONSE=$(curl --silent --show-error --fail-with-body "$RESULT_URL" \
    -H "Authorization: Bearer $WAVESPEED_API_KEY")
  RESULT=$(printf '%s' "$RESPONSE" | jq 'if has("data") then .data else . end')
  STATUS=$(printf '%s' "$RESULT" | jq -r '.status')
  case "$STATUS" in
    completed) printf '%s\n' "$RESULT" | jq '.outputs'; break ;;
    failed|cancelled|timeout) printf '%s\n' "$RESULT" | jq . >&2; exit 1 ;;
    created|processing) sleep 2 ;;
    *) printf 'Unexpected status: %s
' "$STATUS" >&2; exit 1 ;;
  esac
done

Node.js example

const submitUrl = "https://api.wavespeed.ai/api/v3/wavespeed-ai/hunyuan-video-1.5/image-to-video";
const apiKey = process.env.WAVESPEED_API_KEY;
if (!apiKey) throw new Error('Set WAVESPEED_API_KEY');

async function requestJson(url, options = {}) {
  const response = await fetch(url, options);
  if (!response.ok) throw new Error(await response.text());
  return response.json();
}

// 1. Submit the prediction.
const body = await requestJson(submitUrl, {
  method: "POST",
  headers: {
    "Authorization": `Bearer ${apiKey}`,
    "Content-Type": "application/json",
  },
  body: JSON.stringify({
        "image": "https://interactive-examples.mdn.mozilla.net/media/cc0-images/painted-hand-298-332.jpg",
        "resolution": "720p",
        "duration": 5,
        "seed": -1
}),
});
const task = body.data ?? body;
if (!task.id) throw new Error("Submission response did not contain a prediction id");
const resultUrl = task.urls?.get ||
  `https://api.wavespeed.ai/api/v3/predictions/${task.id}/result`;

// 2. Poll until the prediction finishes.
while (true) {
  const resultBody = await requestJson(resultUrl, {
    headers: { "Authorization": `Bearer ${apiKey}` },
  });
  const result = resultBody.data ?? resultBody;
  if (result.status === "completed") {
    console.log(result.outputs);
    break;
  }
  if (["failed", "cancelled", "timeout"].includes(result.status)) throw new Error(JSON.stringify(result));
  if (!["created", "processing"].includes(result.status)) throw new Error("Unexpected status: " + result.status);
  await new Promise(resolve => setTimeout(resolve, 2000));
}

Python example

import json
import os
import time
from urllib.request import Request, urlopen

api_key = os.environ["WAVESPEED_API_KEY"]
headers = {"Authorization": f"Bearer {api_key}", "Content-Type": "application/json"}
payload = {
    "image": "https://interactive-examples.mdn.mozilla.net/media/cc0-images/painted-hand-298-332.jpg",
    "resolution": "720p",
    "duration": 5,
    "seed": -1
}

def request_json(url, data=None):
    request = Request(url, data=data, headers=headers, method="POST" if data else "GET")
    with urlopen(request) as response:
        return json.load(response)

# 1. Submit the prediction.
body = request_json("https://api.wavespeed.ai/api/v3/wavespeed-ai/hunyuan-video-1.5/image-to-video", json.dumps(payload).encode())
task = body.get("data", body)
if not task.get("id"):
    raise RuntimeError("Submission response did not contain a prediction id")
result_url = task.get("urls", {}).get("get") or f"https://api.wavespeed.ai/api/v3/predictions/{task['id']}/result"

# 2. Poll until the prediction finishes.
while True:
    result_body = request_json(result_url)
    result = result_body.get("data", result_body)
    status = result.get("status")
    if status == "completed":
        print(result.get("outputs", []))
        break
    if status in {"failed", "cancelled", "timeout"}:
        raise RuntimeError(result)
    if status not in {"created", "processing"}:
        raise RuntimeError(f"Unexpected status: {status}")
    time.sleep(2)

Hunyuan Video 1.5 Image To Video API — Frequently asked questions

What is the Hunyuan Video 1.5 Image To Video API?

Hunyuan Video 1.5 Image To Video is a WaveSpeedAI model for video generation from images, exposed as a REST API on WaveSpeedAI. HunyuanVideo-1.5 (i2v) is a lightweight 8.3B parameter image-to-video model that generates high-quality videos from images with top-tier visual quality and motion coherence. Optimized for fast inference on consumer-grade GPUs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Hunyuan Video 1.5 Image To Video API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID. Poll the result endpoint starting around every 2 seconds, increase the interval for long-running tasks, and stop on any terminal status. The playground generates production-oriented Python, JavaScript, and cURL examples with timeouts, transient-error handling, and safe GET retries. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/hunyuan-video-1.5-image-to-video.

How much does Hunyuan Video 1.5 Image To Video cost per run?

Hunyuan Video 1.5 Image To Video starts at $0.10 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Hunyuan Video 1.5 Image To Video accept?

Key inputs: `prompt`, `image`, `resolution`, `duration`, `seed`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/hunyuan-video-1.5-image-to-video.

How long does Hunyuan Video 1.5 Image To Video take to generate?

Median end-to-end generation time on WaveSpeedAI is around 216 seconds per request, based on recent successful runs. Queue time varies with global demand; live status is visible in the prediction record.

Can I use Hunyuan Video 1.5 Image To Video outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.

Hunyuan Video 1.5 Image to Video | Fast Image-to-Video API | WaveSpeedAI

PrzykładyZobacz wszystkie

Powiązane modele

README

HunyuanVideo-1.5 Image-to-Video

Key features

Limits and performance

Pricing

How to use

Tips for best results

Notes

Hunyuan Video 1.5 Image To Video API — Quick start

Hunyuan Video 1.5 Image To Video API — Frequently asked questions

Dowiedz się więcej

Informacje prawne

Zasoby

Modele

Narzędzia