Alibaba Wan 2.6 Image To Video Flash

Playground

WAN 2.6 Flash converts images into videos (720p/1080p) with optional audio, optimized for speed and cost. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Wan 2.6 Image-to-Video Flash is a fast image-to-video generation model from. Upload an image, describe the motion you want, and generate videos up to 15 seconds with optional synchronized audio — all with quick turnaround and flexible pricing.

Why Choose This?

Image-driven video generation Transform static images into dynamic videos with AI-generated motion.
Optional audio generation Generate videos with synchronized audio or output silent videos.
Custom audio input Optionally upload your own audio to sync with the generated video.
Multiple shot types Choose single or multi-shot modes for different creative needs.
Prompt Enhancer Built-in tool to automatically optimize your prompts for better results.
Fast generation Flash variant optimized for quick turnaround.

Parameters

Parameter	Required	Description
image	Yes	Source image to animate (upload or URL)
prompt	Yes	Describe the motion and action you want
audio	No	Custom audio file to sync with video
negative_prompt	No	Describe what to avoid in the output
resolution	No	Output resolution: 720p or 1080p (default: 720p)
duration	No	Video length in seconds, up to 15 (default: 15)
shot_type	No	Shot mode: single or multi (default: single)
enable_prompt_expansion	No	Enable prompt optimizer for better results
enable_audio	No	Output video with audio (default: true)
seed	No	Random seed for reproducibility (-1 for random)

Shot Type Options

Mode	Description
single	Single continuous shot
multi	Multiple shots with scene transitions

How to Use

Upload your image — the starting frame for your video.
Write your prompt — describe the motion and action you want.
Upload audio (optional) — add custom audio to sync with video.
Add negative prompt (optional) — specify what to avoid.
Set resolution — choose 720p or 1080p.
Set duration — select video length up to 15 seconds.
Choose shot type — single for continuous, multi for scene cuts.
Enable audio — check for video with sound, uncheck for silent.
Run — submit and download the generated video.

Pricing

Resolution	Audio	Cost per 5 seconds
720p	Off	$0.125
720p	On	$0.25
1080p	Off	$0.1875
1080p	On	$0.375

Billing Rules

Base rate: $0.125 per 5 seconds (720p, no audio)
1080p multiplier: 1.5×
Audio multiplier: 2×
Total cost = base_price × (duration / 5) × resolution_multiplier × audio_multiplier

Examples

5s @ 720p, no audio → $0.125
10s @ 720p, with audio → $0.125 × 2 × 2 = $0.50
15s @ 1080p, with audio → $0.125 × 3 × 1.5 × 2 = $1.125

Best Use Cases

Social Media Content — Create engaging short videos from photos.
Product Animation — Bring product images to life with motion.
Creative Projects — Transform artwork and illustrations into animated videos.
Marketing — Generate dynamic content from static promotional images.
Storytelling — Create video sequences from key story moments.

Pro Tips

Use clear, high-quality source images for best results.
Be specific about motion direction and speed in your prompt.
Use negative prompts to avoid unwanted movements or artifacts.
Single shot mode is better for smooth, continuous motion.
Multi shot mode works well for dynamic, action-packed content.
Disable audio if you plan to add your own soundtrack later.

Notes

Maximum video duration is 15 seconds.
Custom audio will be synced to the generated video.
1080p resolution costs 1.5× more than 720p.
Audio generation doubles the cost.

Wan 2.6 Text-to-Video — Generate videos from text prompts only.
Wan 2.6 Image-to-Video — Standard quality image-to-video generation.
LTX-2 19B Image-to-Video — Alternative I2V model with audio support.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result

set -euo pipefail

export WAVESPEED_API_KEY="your-api-key"

REQUEST_BODY=$(cat <<'JSON'
{
  "prompt": "A cinematic ocean wave at sunrise, highly detailed",
  "image": "https://interactive-examples.mdn.mozilla.net/media/cc0-images/painted-hand-298-332.jpg",
  "resolution": "720p",
  "duration": 5,
  "shot_type": "single",
  "enable_prompt_expansion": false,
  "enable_audio": true,
  "seed": -1
}
JSON
)

# 1. Submit the prediction.
SUBMIT_RESPONSE=$(curl --silent --show-error --fail-with-body \
  -X POST "https://api.wavespeed.ai/api/v3/alibaba/wan-2.6/image-to-video-flash" \
  -H "Authorization: Bearer ${WAVESPEED_API_KEY}" \
  -H "Content-Type: application/json" \
  -d "${REQUEST_BODY}")

TASK=$(printf '%s' "${SUBMIT_RESPONSE}" | jq 'if type == "object" and has("data") then .data else . end')
PREDICTION_ID=$(printf '%s' "${TASK}" | jq -r '.id // empty')
if [ -z "${PREDICTION_ID}" ]; then
  printf 'Submission response did not contain a prediction id
' >&2
  exit 1
fi
RESULT_URL=$(printf '%s' "${TASK}" | jq -r '.urls.get // empty')
if [ -z "${RESULT_URL}" ]; then RESULT_URL="https://api.wavespeed.ai/api/v3/predictions/${PREDICTION_ID}/result"; fi

# 2. Poll until the prediction finishes.
while true; do
  RESPONSE=$(curl --silent --show-error --fail-with-body \
    "${RESULT_URL}" \
    -H "Authorization: Bearer ${WAVESPEED_API_KEY}")
  RESULT=$(printf '%s' "${RESPONSE}" | jq 'if type == "object" and has("data") then .data else . end')
  STATUS=$(printf '%s' "${RESULT}" | jq -r '.status // empty')

  case "${STATUS}" in
    completed) printf '%s\n' "${RESULT}" | jq '.outputs'; break ;;
    failed|cancelled|timeout) printf '%s\n' "${RESULT}" | jq . >&2; exit 1 ;;
    created|processing) sleep 2 ;;
    *) printf 'Unexpected status: %s
' "${STATUS}" >&2; exit 1 ;;
  esac
done

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	The positive prompt for the generation.
image	string	Yes		-	The image for generating the output.
audio	string	No	-	-	Audio URL to guide generation (optional).
negative_prompt	string	No		-	The negative prompt for the generation.
resolution	string	No	720p	720p, 1080p	The resolution of the generated media.
duration	integer	No	5	2 ~ 15	The duration of the generated media in seconds.
shot_type	string	No	single	single, multi	The type of shots to generate.
enable_prompt_expansion	boolean	No	false	-	If set to true, the prompt optimizer will be enabled.
enable_audio	boolean	No	true	-	If set to true, outputs video with audio. If false, outputs silent video.
seed	integer	No	-1	-	The random seed to use for the generation. -1 means a random seed will be used.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Output values, usually URL strings; some models return text strings or structured result objects (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction
data.model	string	Model ID used for the prediction
data.outputs	array<string \| object>	Array of generated outputs (empty when status is not completed). Items are usually URL strings, but may be text strings or structured result objects, depending on the model.
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to poll for the prediction result
data.status	string	Status: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Overview

Alibaba Wan 2.6 Image To Video Flash

Playground

Features

Why Choose This?

Parameters

Shot Type Options

How to Use

Pricing

Billing Rules

Examples

Best Use Cases

Pro Tips

Notes

Related Models

Authentication

API Endpoints

Submit Task & Query Result

Parameters

Task Submission Parameters

Request Parameters

Response Parameters

Result Request Parameters

Result Response Parameters