AI Video to Video model

Transform existing footage with precise control over structure, motion, and continuity. Reimagine scenes while preserving lighting, camera behavior, and temporal flow.

//

Key Capabilities

  • Edit without starting over

    Modify existing footage with surgical precision. Use Retake to rewrite a specific segment with a new prompt, or Extend Scene to lengthen existing clips while preserving motion and continuity
  • Restore and enhance video quality

    Improve output quality from footage you already have. Detail Upscaling enhances resolution and recovers fine detail without regenerating the clip.
  • Precise control over motion, structure, and camera

    Direct every element of the transformation with intention, not guesswork. From body pose-driven motion and depth-aware spatial structure to explicit camera behavior like static, dolly, and l

AI video retakes & revisions

Regenerate specific moments, fix dialogue delivery, adjust emotion, or change action without re-rendering the entire video. Define a start time and duration, update the prompt, and only the targeted segment is regenerated while surrounding content stays intact.

Footage restyling for brand & campaign

You have existing footage but need a different visual treatment, a new color palette, a cinematic look, or an animated aesthetic. Run it through the video-to-video pipeline to reinterpret the style while keeping the original motion and composition intact.

Guided generation from reference video

Start with a rough cut, a storyboard animatic, or a body-motion reference and use IC-LoRA conditioning (depth, pose, edge) to generate a polished video that follows the same spatial layout and movement without building it from scratch.

Rapid iteration on AI-generated content

Generate a full video and test variations, different actions, alternate dialogue, adjusted pacing. Combine Retake for segment-level changes with full-clip transformations to iterate fast without regenerating everything from zero.

How the Video to Video model works

Input:

  • Video (required): MP4, MOV, or MKV
  • Supported codecs: H.264, H.265
  • Maximum resolution: 3840Γ—2160 (4K)
  • Maximum duration: ~21 seconds

Output:

  • Edited MP4 video
  • Segment-based editing regenerates only the selected range, surrounding frames remain unchanged.
  • Full-scene mode reinterprets the entire clip while preserving structural motion dynamics.

How the Video to Video model works

Input

Provide a source video clip and define what to change: a segment, the full scene, or just the audio. A text prompt guides the transformation.

Technical characteristics:

  • Video (required): MP4, MOV, or MKV
  • Supported codecs: H.264, H.265
  • Maximum resolution: 3840Γ—2160 (4K)
  • Maximum duration: ~21 seconds

Output

Receive an edited MP4 with only the targeted segment regenerated, or the full clip reinterpreted while preserving structural motion dynamics.

Technical characteristics:

  • Edited MP4 video
  • Segment-based editing regenerates only the selected range, surrounding frames remain unchanged.
  • Full-scene mode reinterprets the entire clip while preserving structural motion dynamics.

Retake - Video Editing

LTX-2
Pro

Refine only the parts that need adjustment - no need to regenerate the whole video. Perfect for fixing scenes, adjusting elements, or improving localized areas.

URL path:
/v1/retake
Pricing:
  • 1920Γ—1080 β€” $0.10/sec
Notes:
  • Currently available in 1080p only.
  • Billed per second of input video.

Retake - Video Editing

LTX-2.3
Pro

Refine only the parts that need adjustment - no need to regenerate the whole video. Perfect for fixing scenes, adjusting elements, or improving localized areas.

URL path:
/v1/retake
Pricing:
  • 1920Γ—1080 β€” $0.10/sec
Notes:
  • Currently available in 1080p only.
  • Billed per second of input video.