lightricks/ltx-video ðŸ”Ē🖞ïļâ“📝 → 🖞ïļ

â–ķïļ 160.4K runs 📅 Nov 2024 ⚙ïļ Cog 0.13.0 🔗 GitHub ⚖ïļ License
image-to-video real-time text-to-video

About

LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched.

Example Output

Prompt:

"A woman with long brown hair and light skin smiles at another woman with long blonde hair. The woman with brown hair wears a black jacket and has a small, barely noticeable mole on her right cheek. The camera angle is a close-up, focused on the woman with brown hair's face. The lighting is warm and natural, likely from the setting sun, casting a soft glow on the scene. The scene appears to be real-life footage."

Output

Performance Metrics

11.80s Prediction Time
11.81s Total Time
All Input Parameters
{
  "cfg": 3,
  "steps": 30,
  "length": 97,
  "prompt": "A woman with long brown hair and light skin smiles at another woman with long blonde hair. The woman with brown hair wears a black jacket and has a small, barely noticeable mole on her right cheek. The camera angle is a close-up, focused on the woman with brown hair's face. The lighting is warm and natural, likely from the setting sun, casting a soft glow on the scene. The scene appears to be real-life footage.",
  "target_size": 640,
  "aspect_ratio": "16:9",
  "negative_prompt": "low quality, worst quality, deformed, distorted, watermark"
}
Input Parameters
cfg Type: numberDefault: 3Range: 1 - 20
How strongly the video follows the prompt
seed Type: integer
Set a seed for reproducibility. Random by default.
image Type: string
Optional input image to use as the starting frame
model Default: 0.9.1
Model version to use
steps Type: integerDefault: 30Range: 1 - 50
Number of steps
length Default: 97
Length of the output video in frames
prompt Type: stringDefault: best quality, 4k, HDR, a tracking shot of a beautiful scene
Text prompt for the video. This model needs long descriptive prompts, if the prompt is too short the quality won't be good.
target_size Default: 640
Target size for the output video
aspect_ratio Default: 3:2
Aspect ratio of the output video. Ignored if an image is provided.
negative_prompt Type: stringDefault: low quality, worst quality, deformed, distorted
Things you do not want to see in your video
image_noise_scale Type: numberDefault: 0.15Range: 0 - 1
Lower numbers stick more closely to the input image
Output Schema

Output

Type: array â€Ē Items Type: string â€Ē Items Format: uri

Example Execution Logs
Random seed set to: 2624027272
Checking inputs
====================================
Running workflow
[ComfyUI] got prompt
Executing node 85, title: Width and height from aspect ratio ðŸŠī, class type: Width and height from aspect ratio ðŸŠī
Executing node 84, title: EmptyLTXVLatentVideo, class type: EmptyLTXVLatentVideo
Executing node 71, title: LTXVScheduler, class type: LTXVScheduler
Executing node 72, title: SamplerCustom, class type: SamplerCustom
[ComfyUI]
[ComfyUI] 0%|          | 0/30 [00:00<?, ?it/s]
[ComfyUI] 3%|▎         | 1/30 [00:00<00:04,  6.37it/s]
[ComfyUI] 7%|▋         | 2/30 [00:00<00:07,  3.95it/s]
[ComfyUI] 10%|█         | 3/30 [00:00<00:07,  3.52it/s]
[ComfyUI] 13%|█▎        | 4/30 [00:01<00:07,  3.34it/s]
[ComfyUI] 17%|█▋        | 5/30 [00:01<00:07,  3.25it/s]
[ComfyUI] 20%|██        | 6/30 [00:01<00:07,  3.19it/s]
[ComfyUI] 23%|██▎       | 7/30 [00:02<00:07,  3.16it/s]
[ComfyUI] 27%|██▋       | 8/30 [00:02<00:07,  3.14it/s]
[ComfyUI] 30%|███       | 9/30 [00:02<00:06,  3.12it/s]
[ComfyUI] 33%|███▎      | 10/30 [00:03<00:06,  3.12it/s]
[ComfyUI] 37%|███▋      | 11/30 [00:03<00:06,  3.11it/s]
[ComfyUI] 40%|████      | 12/30 [00:03<00:05,  3.10it/s]
[ComfyUI] 43%|████▎     | 13/30 [00:04<00:05,  3.10it/s]
[ComfyUI] 47%|████▋     | 14/30 [00:04<00:05,  3.10it/s]
[ComfyUI] 50%|█████     | 15/30 [00:04<00:04,  3.10it/s]
[ComfyUI] 53%|█████▎    | 16/30 [00:05<00:04,  3.10it/s]
[ComfyUI] 57%|█████▋    | 17/30 [00:05<00:04,  3.09it/s]
[ComfyUI] 60%|██████    | 18/30 [00:05<00:03,  3.09it/s]
[ComfyUI] 63%|██████▎   | 19/30 [00:05<00:03,  3.09it/s]
[ComfyUI] 67%|██████▋   | 20/30 [00:06<00:03,  3.09it/s]
[ComfyUI] 70%|███████   | 21/30 [00:06<00:02,  3.09it/s]
[ComfyUI] 73%|███████▎  | 22/30 [00:06<00:02,  3.09it/s]
[ComfyUI] 77%|███████▋  | 23/30 [00:07<00:02,  3.09it/s]
[ComfyUI] 80%|████████  | 24/30 [00:07<00:01,  3.09it/s]
[ComfyUI] 83%|████████▎ | 25/30 [00:07<00:01,  3.09it/s]
[ComfyUI] 87%|████████▋ | 26/30 [00:08<00:01,  3.09it/s]
[ComfyUI] 90%|█████████ | 27/30 [00:08<00:00,  3.09it/s]
[ComfyUI] 93%|█████████▎| 28/30 [00:08<00:00,  3.09it/s]
[ComfyUI] 97%|█████████▋| 29/30 [00:09<00:00,  3.09it/s]
[ComfyUI] 100%|██████████| 30/30 [00:09<00:00,  3.09it/s]
Executing node 8, title: VAE Decode, class type: VAEDecode
Executing node 79, title: Video Combine ðŸŽĨðŸ…Ĩ🅗ðŸ…Ē, class type: VHS_VideoCombine
[ComfyUI] 100%|██████████| 30/30 [00:09<00:00,  3.15it/s]
[ComfyUI] Prompt executed in 11.64 seconds
outputs:  {'79': {'gifs': [{'filename': 'R8_LTX_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 25.0}]}}
====================================
R8_LTX_00001.png
R8_LTX_00001.mp4
Version Details
Version ID
8c47da666861d081eeb4d1261853087de23923a268a69b63febdf5dc1dee08e4
Version Created
January 6, 2025
Run on Replicate →