Browse ModelsSkywork AISkywork AI Skyreels V4 Image To Video

Skywork Ai Skyreels V4 Image To Video

Skywork Ai Skyreels V4 Image To Video

Playground

Try it on WavespeedAI!

SkyReels V4 Image to Video is a fast AI image-to-video generation model that creates high-quality videos from image references and text prompts using the SkyReels V4 image2video workflow. Ready-to-use REST inference API for animating images, product videos, character motion, branded storytelling, social media clips, advertising creatives, and professional image-to-video workflows with simple integration, no coldstarts, and affordable pricing.

Features

Skywork AI SkyReels V4 Image-to-Video

Skywork AI SkyReels V4 Image-to-Video generates videos from a starting image, with optional middle-frame and end-frame guidance for stronger visual control. It supports standard and fast generation modes, multiple resolutions, optional sound effects, and prompt-driven motion design for cinematic, product, and storytelling workflows.


Why Choose This?

  • Image-guided video generation Start from a first-frame image and turn it into a motion video clip with prompt-based control.

  • Multi-frame guidance Optionally add middle-frame images and an end-frame image to better control progression, structure, and visual consistency.

  • Two generation modes Choose std for higher-quality output or fast for quicker, lower-cost generation.

  • Multiple resolution options Supports 480p, 720p, and 1080p to balance quality and budget.

  • Optional sound effects Enable sound when you want the video generated with audio effects.

  • Production-ready workflow Suitable for product videos, stylized motion design, short-form storytelling, and visual prototyping.


Parameters

ParameterRequiredDescription
promptYesThe prompt describing the video motion or camera behavior.
first_frame_imageYesFirst frame image URL.
end_frame_imageNoOptional end frame image URL.
imagesNoOptional middle frame image URLs. Upload up to 6 images.
durationNoDuration of the generated video in seconds. Range: 3–15. Default: 5.
resolutionNoOutput video resolution. Supported values: 480p, 720p, 1080p. Default: 1080p.
soundNoWhether to generate sound effects with the video. Default: false.
modeNoQuality/performance mode. Supported values: std, fast. Default: std. fast mode currently requires sound=false.

How to Use

  1. Upload the first frame — provide the starting image for the video.
  2. Write your prompt — describe the motion, camera movement, pacing, and visual behavior you want.
  3. Add guide frames (optional) — upload middle-frame images and/or an end-frame image for more controlled progression.
  4. Choose duration — select a clip length between 3 and 15 seconds.
  5. Choose resolution — use 480p, 720p, or 1080p depending on quality and budget needs.
  6. Choose mode — use std for higher quality or fast for quicker generation.
  7. Enable sound (optional) — turn this on if you want generated sound effects. If using fast, keep sound=false.
  8. Submit — run the model and download the generated video.

Example Prompt

A cinematic product reveal with smooth forward camera motion, soft reflections, elegant studio lighting, subtle object rotation, and premium commercial pacing.


Pricing

Pricing depends on duration, resolution, and mode.

Standard Mode

ResolutionPer Second5s Cost
480p$0.11$0.55
720p$0.14$0.70
1080p$0.35$1.75

Fast Mode

ResolutionPer Second5s Cost
480p$0.08$0.40
720p$0.11$0.55
1080p$0.275$1.375

Example Costs

Standard Mode

Resolution3s5s10s15s
480p$0.33$0.55$1.10$1.65
720p$0.42$0.70$1.40$2.10
1080p$1.05$1.75$3.50$5.25

Fast Mode

Resolution3s5s10s15s
480p$0.24$0.40$0.80$1.20
720p$0.33$0.55$1.10$1.65
1080p$0.825$1.375$2.75$4.125

Billing Rules

  • Base multiplier starts from $0.10 per second
  • Pricing scales linearly with duration
  • std and fast use different resolution multipliers
  • sound does not affect pricing directly
  • fast mode currently requires sound=false

Best Use Cases

  • Product motion videos — Turn still product shots into polished reveal clips.
  • Storytelling sequences — Use start, middle, and end guidance to shape a clear visual arc.
  • Creative prototyping — Test motion concepts quickly with fast mode.
  • Social and ad content — Generate short-form videos with clear visual direction.
  • Cinematic image animation — Create controlled motion from a sequence of reference frames.

Pro Tips

  • Use a strong first-frame image for better visual consistency.
  • Add middle or end frames when scene progression matters more than freeform motion.
  • Keep prompts focused on motion, pacing, and camera behavior rather than static visual details already present in the images.
  • Use fast mode for quick iteration, then switch to std for final-quality output.
  • Keep sound=false when using fast mode.
  • Start with shorter durations to validate motion before generating longer clips.

Notes

  • prompt and first_frame_image are required.
  • images supports up to 6 optional middle-frame images.
  • duration supports 3–15 seconds.
  • resolution defaults to 1080p.
  • mode defaults to std.
  • fast mode currently requires sound=false.
  • Pricing depends on duration, resolution, and mode.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/skywork-ai/skyreels-v4/image-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "duration": 5,
    "resolution": "1080p",
    "sound": false,
    "mode": "std"
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-The prompt describing the video motion or camera behavior.
first_frame_imagestringYes--First frame image URL.
end_frame_imagestringNo--Optional end frame image URL.
imagesarrayNo[]-Optional middle frame image URLs. Upload up to 6 images.
durationintegerNo53 ~ 15Duration of the generated video in seconds.
resolutionstringNo1080p480p, 720p, 1080pOutput video resolution.
soundbooleanNofalse-Whether to generate sound effects with the video.
modestringNostdstd, fastQuality/performance mode. Fast mode currently requires sound to be false.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content.
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.