Davinci Magihuman Text To Video

Playground

daVinci MagiHuman Text-to-Video API — a 15B parameter omni video generation model, the new open-source king on par with WAN 2.5. Generates high-quality AI videos from text prompts with optional audio input. Supports digital humans, talking heads, flexible aspect ratios, durations, and resolutions. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

DaVinci MagiHuman Text-to-Video

DaVinci MagiHuman Text-to-Video is a cinematic text-to-video model specialized in generating realistic human-centered scenes. Describe your scene in natural language, optionally provide an audio track to synchronize, and get a high-quality video with natural human motion, expressive storytelling, and cinematic atmosphere — at multiple resolution and duration options.

Why Choose This?

Human-focused generation Optimized for realistic human motion, expressions, and interactions — ideal for portrait-style and character-driven video content.
Audio input support Upload an audio track to guide the rhythm, mood, and pacing of the generated video for synchronized results.
Multiple resolution tiers Generate at 256p, 720p, or 1080p to match your delivery requirements and budget.
Flexible aspect ratio Supports 16:9 landscape and 9:16 portrait orientations for both cinematic and social media formats.
Adjustable duration Generate clips from 5 to 10 seconds with per-second granularity.
Reproducible results Use the seed parameter to lock in a specific output for exact reproduction.
Prompt Enhancer Built-in tool to automatically improve your scene descriptions for richer output.

Parameters

Parameter	Required	Description
prompt	Yes	Text description of the scene, subject, motion, camera style, and mood.
audio	No	Optional audio track to synchronize with the generated video.
aspect_ratio	No	Output aspect ratio: 16:9 (default) or 9:16.
resolution	No	Output resolution: 256p, 720p (default), or 1080p.
duration	No	Clip length in seconds. Options: 5, 6, 7, 8, 9, 10. Default: 5.
seed	No	Random seed for reproducible results. Use -1 for a random seed.

How to Use

Write your prompt — describe the subject, scene, motion, camera movement, and atmosphere. Use the Prompt Enhancer for better results.
Upload audio (optional) — provide an audio file or URL to synchronize the video to a specific track.
Select aspect ratio — 16:9 for landscape/cinematic, 9:16 for portrait/social.
Select resolution — 256p for drafts, 720p for standard output, 1080p for final production.
Set duration — choose between 5 and 10 seconds.
Set seed (optional) — fix the seed to reproduce a specific result in future runs.
Submit — generate, preview, and download your video.

Pricing

Duration	256p	720p	1080p
5s	$0.10	$0.15	$0.20
6s	$0.12	$0.18	$0.24
7s	$0.14	$0.21	$0.28
8s	$0.16	$0.24	$0.32
9s	$0.18	$0.27	$0.36
10s	$0.20	$0.30	$0.40

Billing Rules

256p: $0.02 per second
720p: $0.03 per second
1080p: $0.04 per second
Duration options: 5–10 seconds

Best Use Cases

Cinematic Storytelling — Generate atmospheric, character-driven scenes from detailed text descriptions.
Social Media Content — Produce portrait-format human-centered clips for Reels, TikTok, and Shorts.
Music Video & Audio-Visual — Synchronize generated video to a music track or voiceover for cohesive results.
Marketing & Brand Video — Quickly produce human-focused promotional content without a film crew.
Concept Visualization — Bring narrative ideas and moods to life for pitching and storyboarding.

Pro Tips

Include specific camera style references in your prompt (handheld, dolly, bokeh, cinematic color grade) for more expressive results.
Use 256p at shorter durations to rapidly test prompts before committing to a 1080p final render.
Providing an audio track significantly improves rhythm and pacing alignment in the output.
Fix the seed once you find a result you like to iterate on it consistently across resolution and duration changes.
9:16 aspect ratio works best for close-up portrait and upper-body compositions.

Notes

Only prompt is required; all other parameters are optional.
Duration is selectable in 1-second increments from 5 to 10 seconds.
Ensure audio URLs are publicly accessible if using a link rather than a direct upload.

DaVinci MagiHuman Image-to-Video — Animate a reference image into a human-centered cinematic video.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/davinci-magihuman/text-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "aspect_ratio": "16:9",
    "resolution": "720p",
    "duration": 5,
    "seed": -1
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	The positive prompt for the generation.
audio	string	No	-	-	The audio URL for generating the output. If provided, the model will generate a video synchronized with the audio.
aspect_ratio	string	No	16:9	16:9, 9:16	The aspect ratio of the generated video.
resolution	string	No	720p	256p, 720p, 1080p	The resolution of the generated video.
duration	integer	No	5	5, 6, 7, 8, 9, 10	The duration of the generated video in seconds.
seed	integer	No	-1	-1 ~ 2147483647	The random seed to use for the generation. -1 means a random seed will be used.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Davinci Magihuman Image To Video Depth Anything Video

Davinci Magihuman Text To Video

Playground

Features

DaVinci MagiHuman Text-to-Video

Why Choose This?

Parameters

How to Use

Pricing

Billing Rules

Best Use Cases

Pro Tips

Notes

Related Models

Authentication

API Endpoints

Submit Task & Query Result

Parameters

Task Submission Parameters

Request Parameters

Response Parameters

Result Request Parameters

Result Response Parameters