Midjourney Image-to-Video turns a single image into an artistically rich, high-quality video using Midjourney's creative AI. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Bereit
$0.025pro Durchlauf·~40 / $1
A determined young Steampunk adventurer, looking forward with focus, adjusting her elaborate brass and leather goggles on her head, wearing a gear-adorned top hat, dramatic low-key lighting, deep warm tones, cinematic shot, hyper-detailed, action pose.
A detailed illustration of an anthropomorphic fox wearing glasses, reading in an alchemist's workshop filled with magical artifacts, finely detailed illustration, warm oil lamp lighting, high level of detail,
Zoom in and girl smiling at the camera.
The girl fiddled with her pose and winked at the camera.
A holographic swarm of butterflies flickering around the fingertips of a girl in a high-tech fiber suit, Digital Art, Octane Render, neon green and electric blue light sources, semi-transparent material texture, extremely sharp image fidelity.
鱼在池塘里游荡,高级感,电影质感
Explosive close-up of a shattered biomechanical heart, crafted from dark polished chrome and intricate clockwork gears, mechanical debris flying outwards. Suspended in a deep crimson void, dramatic low-key lighting, highly reflective surfaces, concept art, ultra-detailed, cinematic composition.
A bird flew over from outside the picture and stopped on the flowers.
The camera zooms in, and the characters are looking for light.
Midjourney Image-to-Video transforms static artworks or illustrations into cinematic video clips with motion, lighting, and depth — while maintaining Midjourney’s signature artistic fidelity. This model captures subtle camera movement, breathing effects, and dynamic perspective shifts, bringing your images vividly to life.
chaos, stylize, and weird.Input: one reference image (JPEG / PNG / WebP).
Output duration: fixed short cinematic clip (~5 seconds).
Resolutions:
480p for faster, lower-cost generation.
720p for enhanced clarity and professional output.
Supported aspect ratios: 1:1, 16:9, 4:3, 3:4, 2:3, 9:16, etc.
Estimated processing time: 30–60 seconds per video (depending on prompt complexity).
| Resolution | Example Price |
|---|---|
| 480p | $0.15 |
| 720p | $0.48 |
| Parameter | Description |
|---|---|
| resolution | Choose 480p or 720p. |
| aspect_ratio | Controls video composition (e.g., 16:9 for widescreen). |
| quality | Higher values produce more refined visuals but increase render time. |
| stylize | Adjusts how strongly Midjourney’s artistic style influences the animation. |
| chaos | Adds controlled randomness to camera and scene motion. |
| weird | Introduces surreal, experimental effects for creative exploration. |
| version | Model version selector (7 is current). |
| niji | Toggles anime-style adaptation (close for realistic style). |
| seed | Fix this for reproducible motion results. |
stylize for artistrychaos and weird for creativityquality for detailThe stylize parameter defines how strongly Midjourney’s internal aesthetic “rules” guide your output.
“Salvador Dali painting of surreal architecture, by Salvador Dali —stylize 250”
Midjourney has confirmed: the higher the
--stylizevalue, the closer results align with the average user’s preferred visual aesthetic.
| Goal | Suggested Settings |
|---|---|
| Precise control / faithful to prompt | stylize=100, chaos=0, weird=0 |
| Visually beautiful / cohesive | stylize=300, chaos=0, weird=0 |
| Creative exploration | stylize=150, chaos=20–40, weird=100–400 |
stylize to make Midjourney listen more closely to your wording.stylize for a more polished or realistic look.chaos or weird unless experimenting with concept art.seed to refine results iteratively.weird values can produce abstract deformation effects — experiment for creativity.Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/midjourney/image-to-video with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Image To Video below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/midjourney/image-to-video" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"resolution": "480p",
"aspect_ratio": "1:1",
"motion": "low",
"quality": 1,
"stylize": 0,
"chaos": 0,
"weird": 0,
"seed": -1,
"enable_base64_output": false
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("midjourney/image-to-video", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"resolution": "480p",
"aspect_ratio": "1:1",
"motion": "low",
"quality": 1,
"stylize": 0,
"chaos": 0,
"weird": 0,
"seed": -1,
"enable_base64_output": false
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"midjourney/image-to-video",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"image": "https://example.com/your-input.jpg",
"resolution": "480p",
"aspect_ratio": "1:1",
"motion": "low",
"quality": 1,
"stylize": 0,
"chaos": 0,
"weird": 0,
"seed": -1,
"enable_base64_output": false
}
)
print(output["outputs"][0]) # → URL of the generated outputImage To Video is a Midjourney model for video generation from images, exposed as a REST API on WaveSpeedAI. Midjourney Image-to-Video turns a single image into an artistically rich, high-quality video using Midjourney's creative AI. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/midjourney/midjourney-image-to-video.
Image To Video starts at $0.025 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `image`, `aspect_ratio`, `resolution`, `seed`, `chaos`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/midjourney/midjourney-image-to-video.
Average end-to-end generation time on WaveSpeedAI is around 85 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (Midjourney). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.