Skywork Ai Skyreels V4 Image To Video

Playground

SkyReels V4 Image to Video is a fast AI image-to-video generation model that creates high-quality videos from image references and text prompts using the SkyReels V4 image2video workflow. Ready-to-use REST inference API for animating images, product videos, character motion, branded storytelling, social media clips, advertising creatives, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

Features

Skywork AI SkyReels V4 Image-to-Video

Skywork AI SkyReels V4 Image-to-Video generates videos from a starting image, with optional middle-frame and end-frame guidance for stronger visual control. It supports standard and fast generation modes, multiple resolutions, optional sound effects, and prompt-driven motion design for cinematic, product, and storytelling workflows.

Why Choose This?

Image-guided video generation Start from a first-frame image and turn it into a motion video clip with prompt-based control.
Multi-frame guidance Optionally add middle-frame images and an end-frame image to better control progression, structure, and visual consistency.
Two generation modes Choose std for higher-quality output or fast for quicker, lower-cost generation.
Multiple resolution options Supports 480p, 720p, and 1080p to balance quality and budget.
Optional sound effects Enable sound when you want the video generated with audio effects.
Production-ready workflow Suitable for product videos, stylized motion design, short-form storytelling, and visual prototyping.

Parameters

Parameter	Required	Description
prompt	Yes	The prompt describing the video motion or camera behavior.
first_frame_image	Yes	First frame image URL.
end_frame_image	No	Optional end frame image URL.
images	No	Optional middle frame image URLs. Upload up to `6` images.
duration	No	Duration of the generated video in seconds. Range: `3–15`. Default: `5`.
resolution	No	Output video resolution. Supported values: `480p`, `720p`, `1080p`. Default: `1080p`.
sound	No	Whether to generate sound effects with the video. Default: `false`.
mode	No	Quality/performance mode. Supported values: `std`, `fast`. Default: `std`. `fast` mode currently requires `sound=false`.

How to Use

Upload the first frame — provide the starting image for the video.
Write your prompt — describe the motion, camera movement, pacing, and visual behavior you want.
Add guide frames (optional) — upload middle-frame images and/or an end-frame image for more controlled progression.
Choose duration — select a clip length between 3 and 15 seconds.
Choose resolution — use 480p, 720p, or 1080p depending on quality and budget needs.
Choose mode — use std for higher quality or fast for quicker generation.
Enable sound (optional) — turn this on if you want generated sound effects. If using fast, keep sound=false.
Submit — run the model and download the generated video.

Example Prompt

A cinematic product reveal with smooth forward camera motion, soft reflections, elegant studio lighting, subtle object rotation, and premium commercial pacing.

Pricing

Pricing depends on duration, resolution, and mode.

Standard Mode

Resolution	Per Second	5s Cost
480p	$0.11	$0.55
720p	$0.14	$0.70
1080p	$0.35	$1.75

Fast Mode

Resolution	Per Second	5s Cost
480p	$0.08	$0.40
720p	$0.11	$0.55
1080p	$0.275	$1.375

Example Costs

Standard Mode

Resolution	3s	5s	10s	15s
480p	$0.33	$0.55	$1.10	$1.65
720p	$0.42	$0.70	$1.40	$2.10
1080p	$1.05	$1.75	$3.50	$5.25

Fast Mode

Resolution	3s	5s	10s	15s
480p	$0.24	$0.40	$0.80	$1.20
720p	$0.33	$0.55	$1.10	$1.65
1080p	$0.825	$1.375	$2.75	$4.125

Billing Rules

Base multiplier starts from $0.10 per second
Pricing scales linearly with duration
std and fast use different resolution multipliers
sound does not affect pricing directly
fast mode currently requires sound=false

Best Use Cases

Product motion videos — Turn still product shots into polished reveal clips.
Storytelling sequences — Use start, middle, and end guidance to shape a clear visual arc.
Creative prototyping — Test motion concepts quickly with fast mode.
Social and ad content — Generate short-form videos with clear visual direction.
Cinematic image animation — Create controlled motion from a sequence of reference frames.

Pro Tips

Use a strong first-frame image for better visual consistency.
Add middle or end frames when scene progression matters more than freeform motion.
Keep prompts focused on motion, pacing, and camera behavior rather than static visual details already present in the images.
Use fast mode for quick iteration, then switch to std for final-quality output.
Keep sound=false when using fast mode.
Start with shorter durations to validate motion before generating longer clips.

Notes

prompt and first_frame_image are required.
images supports up to 6 optional middle-frame images.
duration supports 3–15 seconds.
resolution defaults to 1080p.
mode defaults to std.
fast mode currently requires sound=false.
Pricing depends on duration, resolution, and mode.

Skywork AI SkyReels V4 Text-to-Video — Generate videos directly from text prompts.
Skywork AI SkyReels V3 Reference-to-Video — Generate videos from one to four reference images and a prompt.
Skywork AI SkyReels V3 Extend Video — Continue an existing video clip with newly generated footage.
Skywork AI SkyReels V3 Pro Multi Avatar — Higher-tier two-speaker avatar generation from one scene image.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/skywork-ai/skyreels-v4/image-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "duration": 5,
    "resolution": "1080p",
    "sound": false,
    "mode": "std"
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	The prompt describing the video motion or camera behavior.
first_frame_image	string	Yes	-	-	First frame image URL.
end_frame_image	string	No	-	-	Optional end frame image URL.
images	array	No	[]	-	Optional middle frame image URLs. Upload up to 6 images.
duration	integer	No	5	3 ~ 15	Duration of the generated video in seconds.
resolution	string	No	1080p	480p, 720p, 1080p	Output video resolution.
sound	boolean	No	false	-	Whether to generate sound effects with the video.
mode	string	No	std	std, fast	Quality/performance mode. Fast mode currently requires sound to be false.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content.
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Skywork AI Skyreels V3 Standard Single Avatar Skywork AI Skyreels V4 Reference To Video

Skywork Ai Skyreels V4 Image To Video

Playground

Features

Skywork AI SkyReels V4 Image-to-Video

Why Choose This?

Parameters

How to Use

Example Prompt

Pricing

Standard Mode

Fast Mode

Example Costs

Standard Mode

Fast Mode

Billing Rules

Best Use Cases

Pro Tips

Notes

Related Models

Authentication

API Endpoints

Submit Task & Query Result

Parameters

Task Submission Parameters

Request Parameters

Response Parameters

Result Request Parameters

Result Response Parameters