Alibaba Happy Horse 1.0 (Reference-to-Video) generates new video scenes guided by reference images, maintaining consistent characters, styles, and visual identity. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.
Ocioso
$0.7por execução·~14 / $10
A lone female explorer wearing a sand-colored cloak walks across a vast desert at dawn, carrying an ancient map and a small lantern. Wind blows sand across the dunes. In the distance, a massive forgotten temple slowly emerges from the morning mist. She stops, unfolds the map, and realizes she has found the lost city. Begin with an aerial wide shot over endless dunes, then track behind her footsteps in the sand. Cut to a close-up of her hands holding the old map, then a dramatic low-angle shot of the temple rising in the distance. Epic cinematic adventure, golden sunrise, sweeping camera movement, mysterious atmosphere, highly detailed environment.
Alibaba Happy Horse 1.0 Reference-to-Video generates new video scenes guided by one or more reference images, helping maintain consistent characters, styles, and visual identity across the output. It combines reference-image grounding with natural-language prompting to create cinematic videos in 720p or 1080p.
Reference-guided consistency Use up to multiple reference images to preserve character identity, visual style, outfit details, and overall scene language.
Prompt + image control Combine reference images with a text prompt to control the scene, action, mood, and camera behavior more precisely.
Cinematic motion Generate smooth, expressive video motion while keeping important visual elements stable and recognizable.
Flexible output settings Choose output resolution, aspect ratio, duration, and seed to match your creative and production needs.
Production-ready API Access the model through a REST inference API with no cold starts for scalable integration into apps and workflows.
| Parameter | Required | Description |
|---|---|---|
| images | Yes | Reference image URLs. Supports 1–9 images. |
| prompt | Yes | Text description of the desired scene, action, style, or motion. |
| resolution | No | Output resolution: 720p (default) or 1080p. |
| aspect_ratio | No | Output aspect ratio. Default: 16:9. |
| duration | No | Video length in seconds. Range: 3–15, default 5. |
| seed | No | Random seed for reproducibility. Range: 0–2147483647. |
1–9 image URLs that define the character, style, or visual identity you want to preserve.720p for lower-cost iteration or 1080p for higher-quality final output.3 and 15 seconds.A cinematic fashion scene with the same character walking through a softly lit modern city street at night, gentle camera tracking, subtle wind in the hair and clothing, elegant movement, realistic lighting, premium commercial style
| Resolution | Cost |
|---|---|
| 720p | $0.70 |
| 1080p | $1.40 |
| Resolution | 3s | 5s | 10s | 15s |
|---|---|---|---|---|
| 720p | $0.42 | $0.70 | $1.40 | $2.10 |
| 1080p | $0.84 | $1.40 | $2.80 | $4.20 |
720p costs $0.70 per 5 seconds1080p costs 2× the 720p ratetotal_price = 0.70 × (resolution == "1080p" ? 2 : 1) × duration / 5720p for rapid testing, then switch to 1080p for final-quality renders.seed when you want more reproducible outputs.images and prompt are required.images supports 1–9 reference image URLs.3–15 seconds.720p and 1080p.duration.1080p pricing is exactly 2× the 720p rate.Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/alibaba/happyhorse-1.0/reference-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Happyhorse 1.0 Reference To Video below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/alibaba/happyhorse-1.0/reference-to-video" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"resolution": "720p",
"aspect_ratio": "16:9",
"duration": 5,
"seed": 0
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("alibaba/happyhorse-1.0/reference-to-video", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"resolution": "720p",
"aspect_ratio": "16:9",
"duration": 5,
"seed": 0
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"alibaba/happyhorse-1.0/reference-to-video",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"resolution": "720p",
"aspect_ratio": "16:9",
"duration": 5,
"seed": 0
}
)
print(output["outputs"][0]) # → URL of the generated outputHappyhorse 1.0 Reference To Video is a Alibaba model for video generation from images, exposed as a REST API on WaveSpeedAI. Alibaba Happy Horse 1.0 (Reference-to-Video) generates new video scenes guided by reference images, maintaining consistent characters, styles, and visual identity. Ready-to-use REST API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/alibaba/alibaba-happyhorse-1.0-reference-to-video.
Happyhorse 1.0 Reference To Video starts at $0.70 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `images`, `aspect_ratio`, `resolution`, `duration`, `seed`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/alibaba/alibaba-happyhorse-1.0-reference-to-video.
Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.
Commercial usage rights depend on the model's license, set by its provider (Alibaba). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.