Browse ModelsSkywork AISkywork AI Skyreels V4 Reference To Video

Skywork Ai Skyreels V4 Reference To Video

Skywork Ai Skyreels V4 Reference To Video

Playground

Try it on WavespeedAI!

SkyReels V4 Reference to Video is a fast AI reference-to-video generation model that creates high-quality videos from reference images or a reference video and a text prompt using the SkyReels V4 omni-video workflow. Ready-to-use REST inference API for character-consistent videos, product visuals, branded storytelling, social media clips, advertising creatives, concept videos, and professional reference-based video generation workflows with simple integration, no coldstarts, and affordable pricing.

Features

Skywork AI SkyReels V4 Reference-to-Video

Skywork AI SkyReels V4 Reference-to-Video generates videos from reference images, with optional reference video guidance for stronger motion and scene control. It supports standard and fast modes, multiple resolutions, optional sound effects, and prompt-driven generation for character consistency, product motion, scene transfer, and other reference-based video workflows.


Why Choose This?

  • Reference-guided video generation Use up to 3 reference images to guide identity, appearance, style, or scene elements.

  • Optional reference video support Add up to 1 reference video when you want stronger motion or temporal guidance.

  • Two generation modes Choose std for higher-quality output or fast for quicker, lower-cost generation.

  • Multiple resolutions Supports 480p, 720p, and 1080p to balance quality and budget.

  • Flexible aspect ratios Choose 16:9, 9:16, or 1:1 depending on your target platform.

  • Production-ready workflow Suitable for character-driven video, product clips, stylized motion design, and reference-based storytelling.


Parameters

ParameterRequiredDescription
promptYesThe prompt describing the generated video. Reference tags are added automatically when missing.
imagesNoReference image URLs. Upload up to 3 images.
ref_videosNoOptional reference video URL. Upload at most 1 video.
aspect_ratioNoAspect ratio of the generated video. Supported values: 16:9, 9:16, 1:1. Default: 16:9.
durationNoDuration of the generated video in seconds. Range: 3–15. Default: 5.
resolutionNoOutput video resolution. Supported values: 480p, 720p, 1080p. Default: 1080p.
soundNoWhether to generate sound effects with the video. Default: false.
modeNoQuality/performance mode. Supported values: std, fast. Default: std. fast mode currently requires sound=false.

How to Use

  1. Write your prompt — describe the motion, scene, camera movement, and visual direction you want.
  2. Add reference images (optional) — upload up to 3 images to guide identity, look, or scene consistency.
  3. Add a reference video (optional) — upload 1 video if you want stronger motion or temporal guidance.
  4. Choose aspect ratio — select 16:9, 9:16, or 1:1.
  5. Set duration — choose a clip length between 3 and 15 seconds.
  6. Choose resolution — use 480p, 720p, or 1080p depending on quality and cost needs.
  7. Choose mode — use std for higher quality or fast for quicker generation.
  8. Enable sound (optional) — turn this on if you want generated sound effects. If using fast, keep sound=false.
  9. Submit — run the model and download the generated video.

Example Prompt

A cinematic fashion shot with smooth camera movement, elegant character motion, soft studio lighting, premium commercial pacing, and stable identity across the sequence.


Pricing

Pricing depends on duration, resolution, mode, and whether you use a reference video.

Without Reference Video

Standard Mode

ResolutionPer Second5s Cost
480p$0.11$0.55
720p$0.14$0.70
1080p$0.35$1.75

Fast Mode

ResolutionPer Second5s Cost
480p$0.08$0.40
720p$0.11$0.55
1080p$0.275$1.375

With Reference Video

Standard Mode

ResolutionPer Second5s Cost
480p$0.18$0.90
720p$0.25$1.25
1080p$0.625$3.125

Fast Mode

ResolutionPer Second5s Cost
480p$0.15$0.75
720p$0.20$1.00
1080p$0.50$2.50

Example Costs

Without Reference Video · Standard Mode

Resolution3s5s10s15s
480p$0.33$0.55$1.10$1.65
720p$0.42$0.70$1.40$2.10
1080p$1.05$1.75$3.50$5.25

Without Reference Video · Fast Mode

Resolution3s5s10s15s
480p$0.24$0.40$0.80$1.20
720p$0.33$0.55$1.10$1.65
1080p$0.825$1.375$2.75$4.125

With Reference Video · Standard Mode

Resolution3s5s10s15s
480p$0.54$0.90$1.80$2.70
720p$0.75$1.25$2.50$3.75
1080p$1.875$3.125$6.25$9.375

With Reference Video · Fast Mode

Resolution3s5s10s15s
480p$0.45$0.75$1.50$2.25
720p$0.60$1.00$2.00$3.00
1080p$1.50$2.50$5.00$7.50

Billing Rules

  • Base multiplier starts from $0.10 per second
  • Pricing scales linearly with duration
  • Prices differ between std and fast
  • Adding a ref_videos input increases the rate
  • sound does not affect pricing directly
  • fast mode currently requires sound=false

Best Use Cases

  • Character consistency — Use reference images to keep the same person or style across a clip.
  • Motion-guided generation — Add a reference video when movement or pacing matters.
  • Product and fashion videos — Build polished, controlled motion clips from still references.
  • Creative prototyping — Use fast mode for quick iteration before moving to std.
  • Reference-based storytelling — Combine image identity guidance with optional motion guidance.

Pro Tips

  • Use reference images when identity, styling, or appearance matters most.
  • Add a reference video only when you need stronger motion or temporal guidance.
  • Keep prompts focused on motion, camera behavior, and scene progression.
  • Use fast mode for rough iteration, then switch to std for final-quality output.
  • Keep sound=false in fast mode.
  • Start with shorter durations to validate the concept before generating longer clips.

Notes

  • prompt is required.
  • images supports up to 3 reference images.
  • ref_videos supports up to 1 reference video.
  • duration supports 3–15 seconds.
  • resolution defaults to 1080p.
  • mode defaults to std.
  • fast mode currently requires sound=false.
  • Pricing depends on duration, resolution, mode, and whether a reference video is included.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/skywork-ai/skyreels-v4/reference-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "aspect_ratio": "16:9",
    "duration": 5,
    "resolution": "1080p",
    "sound": false,
    "mode": "std"
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-The prompt describing the generated video. Reference tags are added automatically when missing.
imagesarrayNo[]-Reference image URLs. Upload up to 3 images.
ref_videosarrayNo--Optional reference video URL. Upload at most 1 video.
aspect_ratiostringNo16:916:9, 9:16, 1:1Aspect ratio of the generated video.
durationintegerNo53 ~ 15Duration of the generated video in seconds.
resolutionstringNo1080p480p, 720p, 1080pOutput video resolution.
soundbooleanNofalse-Whether to generate sound effects with the video.
modestringNostdstd, fastQuality/performance mode. Fast mode currently requires sound to be false.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content.
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.