50% zniżki na modele Vidu Q3 i Q3 Pro · Tylko w WaveSpeedAI | 20 maja – 2 czerwca

Hunyuan Video 1.5 Text to Video

wavespeed-ai /

HunyuanVideo-1.5 (t2v) is a lightweight 8.3B parameter text-to-video model that generates high-quality videos with top-tier visual quality and motion coherence. Optimized for fast inference on consumer-grade GPUs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-video
Wejście

Bezczynny

$0.1za uruchomienie·~10 / $1

Dalej:

PrzykładyZobacz wszystkie

Macro cinematography. A tiny, 3-inch tall robot made of scrap metal and old watch parts is hiking across a wooden kitchen table like it is a mountain range. It struggles to climb over a giant ceramic coffee mug. Suddenly, a giant house cat's yellow eye appears in the blurred background, blinking and staring at the robot. The robot freezes and its lens-eye zooms in. whimsical, detailed textures, depth of field.

Commercial slow-motion shot. Extreme close-up of fresh milk being poured into a clear glass of dark coffee. The liquids swirl and mix, creating intricate marble patterns. Coffee droplets splash upwards in slow motion. Warm morning sunlight backlighting the glass. Crystal clear focus, 8k resolution, high-speed photography aesthetic.

Cinematic close-up. An astronaut in a weathered, futuristic space suit is kneeling on a desolate, red dusty planet surface. The camera reflects off his golden visor. He reaches out a gloved hand, trembling slightly, to touch a single, glowing white flower growing out of a crack in the rock. The contrast between the dead environment and the delicate life is striking. Dust swirls around him. 4k, emotional sci-fi atmosphere.

Surreal dreamscape. A massive, translucent blue humpback whale is swimming slowly and gracefully through the thick white clouds above a modern city skyline at sunset. Golden sunlight passes through the whale's fins. The camera pans to follow the whale's movement. Below, the city lights are just starting to turn on. Magical realism, ethereal, breathtaking scale, Studio Ghibli vibe but photorealistic.

High-speed automotive commercial. A silver sports car is driving aggressively through a well-lit tunnel. The camera is mounted low on the front bumper (POV), rushing forward. The tunnel lights above streak past as continuous lines of light due to the extreme speed. Reflections on the car hood change rapidly. The sense of speed is intense. 4k, hyper-realistic.

Powiązane modele

README

HunyuanVideo-1.5 Text-to-Video

HunyuanVideo-1.5 is Tencent’s lightweight text-to-video generation model that delivers state-of-the-art visual quality and motion coherence with only 8.3B parameters. It is designed to be both powerful and efficient, making high-quality video generation accessible for everyday creators and production workflows on WaveSpeedAI.

Key Features

  • High-quality video generation directly from text prompts
  • Lightweight 8.3B parameters for fast inference on consumer-grade GPUs
  • Video durations: 5 s, 8 s, and 10 s
  • Strong motion coherence and stable subject identity

Pricing

ResolutionPrice per second
480p$0.02 / s
720p$0.04 / s

How to Use

  1. Write your text prompt describing the scene, characters, motion, camera movement, and overall style.
  2. Select the duration: 5 s, 8 s, or 10 s.
  3. Optionally tweak inference steps or seed to balance speed, quality, and reproducibility.
  4. Run the job from the WaveSpeedAI interface.
  5. Preview the generated clip and download it from the dashboard.

Tips for Best Results

  • Be explicit: describe who is in the scene, what they are doing, where they are, and how the camera moves.
  • Mention style and mood (for example, “cinematic lighting,” “handheld documentary,” “anime style,” “neon cyberpunk city”).
  • Shorter clips (5–8 s) generally produce the most coherent and visually stable results.
  • Reuse similar prompts and seeds when you want a series of related shots that share style and characters.

Upscaling for Higher Quality

After generating your base video with HunyuanVideo-1.5, you can use WaveSpeedAI’s dedicated video super-resolution models to enhance clarity and sharpness:

Generate efficiently at 480p or 720p, then upscale to higher resolutions for a better final viewing experience.

Dostępność:Ta strona korzysta z modeli AI udostępnianych przez podmioty trzecie.

Hunyuan Video 1.5 Text To Video API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/hunyuan-video-1.5/text-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Hunyuan Video 1.5 Text To Video below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/hunyuan-video-1.5/text-to-video" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "negative_prompt": "blurry, low quality, distorted",
    "size": "1280*720",
    "duration": 5,
    "seed": -1
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("wavespeed-ai/hunyuan-video-1.5/text-to-video", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "negative_prompt": "blurry, low quality, distorted",
        "size": "1280*720",
        "duration": 5,
        "seed": -1
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/hunyuan-video-1.5/text-to-video",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "negative_prompt": "blurry, low quality, distorted",
    "size": "1280*720",
    "duration": 5,
    "seed": -1
}
)

print(output["outputs"][0])  # → URL of the generated output

Hunyuan Video 1.5 Text To Video API — Frequently asked questions

What is the Hunyuan Video 1.5 Text To Video API?

Hunyuan Video 1.5 Text To Video is a WaveSpeedAI model for video generation, exposed as a REST API on WaveSpeedAI. HunyuanVideo-1.5 (t2v) is a lightweight 8.3B parameter text-to-video model that generates high-quality videos with top-tier visual quality and motion coherence. Optimized for fast inference on consumer-grade GPUs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Hunyuan Video 1.5 Text To Video API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/hunyuan-video-1.5-text-to-video.

How much does Hunyuan Video 1.5 Text To Video cost per run?

Hunyuan Video 1.5 Text To Video starts at $0.10 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Hunyuan Video 1.5 Text To Video accept?

Key inputs: `prompt`, `duration`, `size`, `seed`, `negative_prompt`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/hunyuan-video-1.5-text-to-video.

How long does Hunyuan Video 1.5 Text To Video take to generate?

Average end-to-end generation time on WaveSpeedAI is around 99 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Hunyuan Video 1.5 Text To Video outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.