Midjourney Image to Video turns a single image into an artistically rich, high quality AI video with strong visual style and creative motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle
$0.35per run·~28 / $10
A determined young Steampunk adventurer, looking forward with focus, adjusting her elaborate brass and leather goggles on her head, wearing a gear-adorned top hat, dramatic low-key lighting, deep warm tones, cinematic shot, hyper-detailed, action pose.
A detailed illustration of an anthropomorphic fox wearing glasses, reading in an alchemist's workshop filled with magical artifacts, finely detailed illustration, warm oil lamp lighting, high level of detail,
Zoom in and girl smiling at the camera.
The girl fiddled with her pose and winked at the camera.
A holographic swarm of butterflies flickering around the fingertips of a girl in a high-tech fiber suit, Digital Art, Octane Render, neon green and electric blue light sources, semi-transparent material texture, extremely sharp image fidelity.
鱼在池塘里游荡,高级感,电影质感
Explosive close-up of a shattered biomechanical heart, crafted from dark polished chrome and intricate clockwork gears, mechanical debris flying outwards. Suspended in a deep crimson void, dramatic low-key lighting, highly reflective surfaces, concept art, ultra-detailed, cinematic composition.
A bird flew over from outside the picture and stopped on the flowers.
The camera zooms in, and the characters are looking for light.
Midjourney Image-to-Video generates a video from a single input image. Upload an image, optionally describe the desired motion or scene, choose the output resolution, and select the motion strength for the generated video.
Image-to-video generation
Convert a single input image into a generated video.
Optional motion guidance
Add a prompt to describe the motion, camera movement, or scene direction.
Resolution options
Choose between 480p and 720p depending on your quality and cost requirements.
Motion strength control
Select low or high motion strength to control how much movement appears in the generated video.
Simple workflow
Only an input image is required. If no prompt is provided, a gentle cinematic camera movement is used.
| Parameter | Required | Description |
|---|---|---|
| image | Yes | Input image URL used as the first frame. |
| prompt | No | Optional motion or scene guidance. If omitted, a gentle cinematic camera movement is used. |
| resolution | No | Output video resolution. Supported values: 480p, 720p. Default: 480p. |
| motion | No | Motion strength for the generated video. Supported values: low, high. Default: low. |
480p or 720p. The default is 480p.low or high. The default is low.Returns generated video URL(s) in the standard WaveSpeed prediction response.
| Resolution | Cost |
|---|---|
| 480p | $0.35 |
| 720p | $1.10 |
480p costs $0.35 per generation.720p costs $1.10 per generation.resolution.prompt empty when you want the default gentle cinematic camera movement.low motion for more subtle movement.high motion when you want stronger movement.480p for lower-cost testing and 720p for higher-resolution output.image is the only required field.prompt, resolution, and motion are optional.resolution is 480p.motion is low.prompt is omitted, a gentle cinematic camera movement is used.Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/midjourney/image-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Image To Video below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/midjourney/image-to-video" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"image": "https://example.com/your-input.jpg",
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"resolution": "480p",
"motion": "low"
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("midjourney/image-to-video", {
"image": "https://example.com/your-input.jpg",
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"resolution": "480p",
"motion": "low"
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"midjourney/image-to-video",
{
"image": "https://example.com/your-input.jpg",
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"resolution": "480p",
"motion": "low"
}
)
print(output["outputs"][0]) # → URL of the generated outputImage To Video is a Midjourney model for video generation from images, exposed as a REST API on WaveSpeedAI. Midjourney Image to Video turns a single image into an artistically rich, high quality AI video with strong visual style and creative motion. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/midjourney/midjourney-image-to-video.
Image To Video starts at $0.35 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `image`, `resolution`, `motion`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/midjourney/midjourney-image-to-video.
Average end-to-end generation time on WaveSpeedAI is around 85 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (Midjourney). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.