AI Video to Video model

Transform existing footage with precise control over structure, motion, and continuity. Reimagine scenes while preserving lighting, camera behavior, and temporal flow.

//

Key Capabilities

  • Edit without starting over

    Modify existing footage with surgical precision. Use Retake to rewrite a specific segment with a new prompt, or Extend Scene to lengthen existing clips while preserving motion and continuity
  • Restore and enhance video quality

    Improve output quality from footage you already have. Detail Upscaling enhances resolution and recovers fine detail without regenerating the clip.
  • Precise control over motion, structure, and camera

    Direct every element of the transformation with intention, not guesswork. From body pose-driven motion and depth-aware spatial structure to explicit camera behavior like static, dolly, and l

AI video retakes & revisions

Regenerate specific moments, fix dialogue delivery, adjust emotion, or change action without re-rendering the entire video. Define a start time and duration, update the prompt, and only the targeted segment is regenerated while surrounding content stays intact.

Footage restyling for brand & campaign

You have existing footage but need a different visual treatment, a new color palette, a cinematic look, or an animated aesthetic. Run it through the video-to-video pipeline to reinterpret the style while keeping the original motion and composition intact.

Guided generation from reference video

Start with a rough cut, a storyboard animatic, or a body-motion reference and use IC-LoRA conditioning (depth, pose, edge) to generate a polished video that follows the same spatial layout and movement without building it from scratch.

Rapid iteration on AI-generated content

Generate a full video and test variations, different actions, alternate dialogue, adjusted pacing. Combine Retake for segment-level changes with full-clip transformations to iterate fast without regenerating everything from zero.

How the Video to Video model works

Input:

  • Video (required): MP4, MOV, or MKV
  • Supported codecs: H.264, H.265
  • Maximum resolution: 3840Γ—2160 (4K)
  • Maximum duration: ~21 seconds

Output:

  • Edited MP4 video
  • Segment-based editing regenerates only the selected range, surrounding frames remain unchanged.
  • Full-scene mode reinterprets the entire clip while preserving structural motion dynamics.

Designed for real-world deployment

A production-ready video-to-video AI model for teams building scalable, controllable video generation workflows.

Builders

Product teams, AI startups, and developers building AI-powered video features. Add production-grade video generation as a product capability, not a research project. One API, production-ready results, and no custom orchestration.

Producers at scale

Brands, agencies, and creative teams producing high volumes of content. Turn existing assets into video at scale. Faster iteration, lower production cost, and more output from what you already have.

On-prem operators

Teams that require full control over deployment and data. Run video generation in your own environment. On-premises, no cloud dependency, and full infrastructure ownership.

Platform teams

Platforms powering creative tools with multiple AI models. Upgrade your video output with a best-in-class engine. Improve generation quality, retain users, and differentiate with a model built for production, not prototypes.

How the Video to Video model works

Input

Provide a source video clip and define what to change: a segment, the full scene, or just the audio. A text prompt guides the transformation.

Technical characteristics:

  • Video (required): MP4, MOV, or MKV
  • Supported codecs: H.264, H.265
  • Maximum resolution: 3840Γ—2160 (4K)
  • Maximum duration: ~21 seconds

Output

Receive an edited MP4 with only the targeted segment regenerated, or the full clip reinterpreted while preserving structural motion dynamics.

Technical characteristics:

  • Edited MP4 video
  • Segment-based editing regenerates only the selected range, surrounding frames remain unchanged.
  • Full-scene mode reinterprets the entire clip while preserving structural motion dynamics.

Retake - Video Editing

LTX-2
Pro

Refine only the parts that need adjustment - no need to regenerate the whole video. Perfect for fixing scenes, adjusting elements, or improving localized areas.

URL path:
/v1/retake
Pricing:
  • 1920Γ—1080 β€” $0.10/sec
Notes:
  • Currently available in 1080p only.
  • Billed per second of input video.

Retake - Video Editing

LTX-2.3
Pro

Refine only the parts that need adjustment - no need to regenerate the whole video. Perfect for fixing scenes, adjusting elements, or improving localized areas.

URL path:
/v1/retake
Pricing:
  • 1920Γ—1080 β€” $0.10/sec
Notes:
  • Currently available in 1080p only.
  • Billed per second of input video.