LTX API Pricing
Usage-based pricing by endpoint and output quality.
Text-to-Video
Designed for quick iteration, previews, and fast creative exploration.
- Same pricing applies for text input and pure prompt-based generation.
Text-to-Video
Optimized for higher fidelity and increased temporal stability. Best for production-ready output and final renders.
- Deal for client-facing content or polished deliverables.
- Higher compute level → higher visual quality.
Text-to-Video
Optimized for higher fidelity and increased temporal stability. Best for production-ready output and final renders.
- Deal for client-facing content or polished deliverables.
- Higher compute level → higher visual quality.
Text-to-Video
Designed for quick iteration, previews, and fast creative exploration.
- Same pricing applies for text input and pure prompt-based generation.
Image-to-Video
Designed for quick iteration, previews, and fast creative exploration.
- Same compute cost as Text-to-Video Fast.
- Resolution and duration determine total cost.
Image-to-Video
For detailed, stable motion derived from a still image. Best for high-quality sequences, storytelling, and production use.
- Uses the Pro rendering path for maximum fidelity.
- Ideal when visual consistency is critical.
Image-to-Video
For detailed, stable motion derived from a still image. Best for high-quality sequences, storytelling, and production use.
- Uses the Pro rendering path for maximum fidelity.
- Ideal when visual consistency is critical.
Image-to-Video
Designed for quick iteration, previews, and fast creative exploration.
- Same compute cost as Text-to-Video Fast.
- Resolution and duration determine total cost.
Retake - Video Editing
Refine only the parts that need adjustment - no need to regenerate the whole video. Perfect for fixing scenes, adjusting elements, or improving localized areas.
- Currently available in 1080p only.
- Billed per second of input video.
Retake - Video Editing
Refine only the parts that need adjustment - no need to regenerate the whole video. Perfect for fixing scenes, adjusting elements, or improving localized areas.
- Currently available in 1080p only.
- Billed per second of input video.
Audio to Video (A2V)
Generate video directly from audio — where voice, music, and sound define structure, pacing, and motion.
- Audio: WAV, MP3, M4A, OGG
- Image (optional): PNG, JPEG, WEBP
- Billed per second of input audio.
- Generates up to ~20 seconds per request.
- Full-length videos can be created by chaining multiple requests.
- Currently available in 1080p only.
Audio to Video (A2V)
Generate video directly from audio — where voice, music, and sound define structure, pacing, and motion.
- Audio: WAV, MP3, M4A, OGG
- Image (optional): PNG, JPEG, WEBP
- Billed per second of input audio.
- Generates up to ~20 seconds per request.
- Full-length videos can be created by chaining multiple requests.
- Currently available in 1080p only.
Beta - HDR Video Generation
Convert SDR video to 16-bit HDR for greater dynamic range and post-production flexibility — built for professional grading and finishing workflows.
- Up to 1920×1080 — $0.20/sec
(~7s max per request) - Up to 2560×1440 — $0.40/sec
(~4s max per request)
- Video-to-video only (SDR → HDR)
- Output delivered as per-frame 16-bit EXR (ZIP)
- Billed per second of input video
- Max duration depends on resolution tier (up to ~7s at 1080p)