
lora-support
Idle
您的請求將花費 $0.1 每次運行。
使用 $1 您可以運行此模型大約 10 次。
LTX-2.3 is a significant update to the LTX-2 model, featuring improved audio and visual quality with enhanced prompt adherence. As a DiT-based (Diffusion Transformer) audio-video foundation model, it generates synchronized video and audio from text prompts in a single pass, bringing together the core building blocks of modern video generation with open weights and practical execution.
Improved quality Enhanced audio and visual quality compared to LTX-2, with better prompt adherence and more coherent outputs.
Synchronized audio-video Generates video with matching audio in a single model pass, no separate audio production needed.
DiT-based architecture Built on Diffusion Transformer technology for high-fidelity, temporally consistent video generation.
Flexible resolution Supports 480p, 720p, and 1080p outputs to balance quality and cost.
Variable duration Generate clips from 5 to 20 seconds.
| Parameter | Required | Description |
|---|---|---|
| loras | No | List of LoRA models to apply (max 3, each with path and scale) |
| prompt | Yes | Text description of the video scene, motion, and audio |
| resolution | No | Output resolution: 480p, 720p (default), or 1080p |
| duration | No | Video length in seconds (5-20) |
| seed | No | Random seed for reproducibility (-1 for random) |
| Resolution | Best For |
|---|---|
| 480p | Fast previews, iteration, lowest cost |
| 720p | Balanced quality and cost (default) |
| 1080p | Final delivery, maximum detail |
| Resolution | 5s | 10s | 15s | 20s |
|---|---|---|---|---|
| 480p | $0.15 | $0.30 | $0.45 | $0.60 |
| 720p | $0.20 | $0.40 | $0.60 | $0.80 |
| 1080p | $0.25 | $0.50 | $0.75 | $1.00 |