
image-to-video
Idle
您的请求将花费 $0.1 每次运行。
使用 $1 您可以运行此模型大约 10 次。
还有一件事:
LTX-2.3 is a significant update to the LTX-2 model, featuring improved audio and visual quality with enhanced prompt adherence. As a DiT-based (Diffusion Transformer) audio-video foundation model, it animates your input image into a high-fidelity video with synchronized audio generated in a single pass.
Improved quality Enhanced audio and visual quality compared to LTX-2, with better prompt adherence and more coherent outputs.
Image-conditioned video with audio Transforms a static image into a moving video with synchronized audio in a single model pass.
Preserves input composition Maintains the subject, framing, and lighting of your reference image while adding natural motion.
DiT-based architecture Built on Diffusion Transformer technology for detailed, temporally consistent video generation.
Flexible resolution Supports 480p, 720p, and 1080p outputs to balance quality and cost.
Variable duration Generate clips from 5 to 20 seconds.
| Parameter | Required | Description |
|---|---|---|
| image | Yes | Reference image to animate (JPG or PNG) |
| prompt | Yes | Text description of motion, action, and audio cues |
| resolution | No | Output resolution: 480p, 720p (default), or 1080p |
| duration | No | Video length in seconds (5-20) |
| seed | No | Random seed for reproducibility (-1 for random) |
| Resolution | Best For |
|---|---|
| 480p | Fast previews, iteration, lowest cost |
| 720p | Balanced quality and cost (default) |
| 1080p | Final delivery, maximum detail |
| Resolution | 5s | 10s | 15s | 20s |
|---|---|---|---|---|
| 480p | $0.10 | $0.20 | $0.30 | $0.40 |
| 720p | $0.15 | $0.30 | $0.45 | $0.60 |
| 1080p | $0.20 | $0.40 | $0.60 | $0.80 |