LTX-2.3 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model, with improved audio and visual quality as well as enhanced prompt adherence. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle
$0.5per run·~20 / $10
LTX-2.3 Video Extend seamlessly extends existing videos by generating additional frames that naturally continue your content. Upload a video clip and specify how many seconds to add — the model generates smooth, coherent footage that matches the original motion, style, and atmosphere.
Seamless extension Generates new frames that naturally continue the motion and style of your original video.
Flexible duration Extend videos by 1 to 20 seconds based on your needs.
Prompt guidance Optional prompts to describe how the video should continue.
LTX-2.3 quality Built on the improved LTX-2.3 architecture for better temporal consistency.
Simple pricing Straightforward per-second billing at $0.10/second.
Prompt Enhancer Built-in tool to automatically improve your continuation descriptions.
| Parameter | Required | Description |
|---|---|---|
| video | Yes | Source video to extend (URL or upload) |
| duration | No | Extension length in seconds (1–20, default: 6) |
| prompt | No | Describe how the video should continue |
| Duration | Cost |
|---|---|
| 1 s | $0.10 |
| 5 s | $0.50 |
| 10 s | $1.00 |
| 20 s | $2.00 |