LTX API Pricing
Usage-based pricing by endpoint and output quality.
Text-to-Video
Designed for quick iteration, previews, and fast creative exploration.
- Same pricing applies for text input and pure prompt-based generation.
Text-to-Video
Optimized for higher fidelity and increased temporal stability. Best for production-ready output and final renders.
- Deal for client-facing content or polished deliverables.
- Higher compute level → higher visual quality.
Image-to-Video
Designed for quick iteration, previews, and fast creative exploration.
- Same compute cost as Text-to-Video Fast.
- Resolution and duration determine total cost.
Image-to-Video
For detailed, stable motion derived from a still image. Best for high-quality sequences, storytelling, and production use.
- Uses the Pro rendering path for maximum fidelity.
- Ideal when visual consistency is critical.
Text-to-Video
Designed for quick iteration, previews, and fast creative exploration.
- Same pricing applies for text input and pure prompt-based generation.
Text-to-Video
Optimized for higher fidelity and increased temporal stability. Best for production-ready output and final renders.
- Deal for client-facing content or polished deliverables.
- Higher compute level → higher visual quality.
Text-to-Video
Optimized for higher fidelity and increased temporal stability. Best for production-ready output and final renders.
- Deal for client-facing content or polished deliverables.
- Higher compute level → higher visual quality.
Text-to-Video
Designed for quick iteration, previews, and fast creative exploration.
- Same pricing applies for text input and pure prompt-based generation.
Image-to-Video
Designed for quick iteration, previews, and fast creative exploration.
- Same compute cost as Text-to-Video Fast.
- Resolution and duration determine total cost.
Image-to-Video
For detailed, stable motion derived from a still image. Best for high-quality sequences, storytelling, and production use.
- Uses the Pro rendering path for maximum fidelity.
- Ideal when visual consistency is critical.
Image-to-Video
For detailed, stable motion derived from a still image. Best for high-quality sequences, storytelling, and production use.
- Uses the Pro rendering path for maximum fidelity.
- Ideal when visual consistency is critical.
Image-to-Video
Designed for quick iteration, previews, and fast creative exploration.
- Same compute cost as Text-to-Video Fast.
- Resolution and duration determine total cost.
Retake - Video Editing
Refine only the parts that need adjustment - no need to regenerate the whole video. Perfect for fixing scenes, adjusting elements, or improving localized areas.
- Currently available in 1080p only.
- Billed per second of input video.
Retake - Video Editing
Refine only the parts that need adjustment - no need to regenerate the whole video. Perfect for fixing scenes, adjusting elements, or improving localized areas.
- Currently available in 1080p only.
- Billed per second of input video.
Audio to Video (A2V)
Generate video directly from audio — where voice, music, and sound define structure, pacing, and motion.
- Audio: WAV, MP3, M4A, OGG
- Image (optional): PNG, JPEG, WEBP
- Billed per second of input audio.
- Generates up to ~20 seconds per request.
- Full-length videos can be created by chaining multiple requests.
- Currently available in 1080p only.
Audio to Video (A2V)
Generate video directly from audio — where voice, music, and sound define structure, pacing, and motion.
- Audio: WAV, MP3, M4A, OGG
- Image (optional): PNG, JPEG, WEBP
- Billed per second of input audio.
- Generates up to ~20 seconds per request.
- Full-length videos can be created by chaining multiple requests.
- Currently available in 1080p only.