LTX-2.3
by Lightricks · Mar 2026
The open-source flagship. First open model to close the gap with proprietary leaders.
Specifications
Max Resolution
4K
Max Duration
20s
FPS
Up to 50
Native Audio
Yes
ComfyUI Support
Yes
Fine-tunable
Yes
Min VRAM
32GB (full) / 12GB (distilled)
Cost / Second
$0.04
Architecture
DiT (22B) + Rebuilt VAE
Parameters
22B
Inputs
T2V, I2V, V2V, Audio-cond
License
Apache 2.0 (<$10M rev)
Strengths & Trade-offs
Strengths
- 22B params
- true 4K at 50fps
- first open model with synced audio
- rebuilt VAE
- native portrait
- 4 variants
Weaknesses
- Full 4K needs 48GB
- dialogue lip-sync inconsistent
- in-scene text flaky
Best For
- Local 4K production
- open-source audio-video
- studios needing IP control
Scores
Workflows on Floyo
LTX 2.3 Pro Image to Video
Upload a still image and describe the motion you want. The model reads composition, lighting, and depth from your image, then animates it with prompt-controlled camera moves, particle effects, and environmental dynamics. Supports optional end-frame for locked start/finish transitions. Up to 2160p with built-in audio generation.
Open Workflow →LTX 2.3 Audio to Video
Feed in an audio file and the model generates video that follows the rhythm, intensity, and structure of the sound. Works with music, speech, or sound effects. Fully automated pipeline with no manual parameter tuning required. Ideal for music visuals, audio-reactive content, and quick audio-driven animations.
Open Workflow →LTX 2.3 Pro Text to Video
Generate video from a text prompt using the Pro flow. Higher fidelity output with enhanced detail and stability across longer sequences. Supports resolutions up to 4K, multiple FPS options (24/25/48/50), and durations up to 20 seconds. Built-in audio generation included.
Open Workflow →LTX 2.3 T2V (Community)
Community-built text-to-video workflow using LTX 2.3. Lightweight setup for quick text prompt to video generation.
Open Workflow →Compare with other models