cjwbw/tokenflow 🔢🖼️📝 → 🖼️
About
Consistent Diffusion Features for Consistent Video Editing

Example Output
Output
Performance Metrics
215.74s
Prediction Time
217.04s
Total Time
All Input Parameters
{ "fps": 20, "video": "https://replicate.delivery/pbxt/KGcnUKDlfys2uAo8eHbWSE8Pzc17hI5IOGdstViaBvfPJWHY/wolf.mp4", "width": 512, "height": 512, "n_frames": 40, "guidance_scale": 7.5, "diffusion_prompt": "a shiny silver robotic wolf", "inversion_prompt": "", "num_diffusion_steps": 50, "num_inversion_steps": 50, "negative_diffusion_prompt": "ugly, blurry, low res, unrealistic, unaesthetic" }
Input Parameters
- fps
- Frames per second in the output video
- seed
- Random seed. Leave blank to randomize the seed.
- video (required)
- Input video.
- width
- Width of the output video. For non-square videos, we recommend using 672 x 384 or 384 x 672, aspect ratio 1.75.
- height
- Height of the output video. For non-square videos, we recommend using 672 x 384 or 384 x 672, aspect ratio 1.75.
- n_frames
- Number of frames in the video to process.
- guidance_scale
- Scale for classifier-free guidance
- diffusion_prompt (required)
- Describe your output video.
- inversion_prompt
- Describe your input video or leave it empty.
- num_diffusion_steps
- Number of sampling step.
- num_inversion_steps
- Number of inversion step.
- negative_diffusion_prompt
- Specify things to not see in the output
Output Schema
Output
Example Execution Logs
Using seed: 60822 /root/.pyenv/versions/3.11.7/lib/python3.11/site-packages/torchvision/io/video.py:161: UserWarning: The pts_unit 'pts' gives wrong results. Please use pts_unit 'sec'. warnings.warn("The pts_unit 'pts' gives wrong results. Please use pts_unit 'sec'.") 0%| | 0/50 [00:00<?, ?it/s] 2%|▏ | 1/50 [00:00<00:24, 1.99it/s] 4%|▍ | 2/50 [00:01<00:24, 1.98it/s] 6%|▌ | 3/50 [00:01<00:23, 1.99it/s] 8%|▊ | 4/50 [00:02<00:23, 1.99it/s] 10%|█ | 5/50 [00:02<00:22, 1.99it/s] 12%|█▏ | 6/50 [00:03<00:22, 1.99it/s] 14%|█▍ | 7/50 [00:03<00:21, 1.99it/s] 16%|█▌ | 8/50 [00:04<00:21, 1.99it/s] 18%|█▊ | 9/50 [00:04<00:20, 1.99it/s] 20%|██ | 10/50 [00:05<00:20, 1.99it/s] 22%|██▏ | 11/50 [00:05<00:19, 1.98it/s] 24%|██▍ | 12/50 [00:06<00:19, 1.98it/s] 26%|██▌ | 13/50 [00:06<00:18, 1.98it/s] 28%|██▊ | 14/50 [00:07<00:18, 1.98it/s] 30%|███ | 15/50 [00:07<00:17, 1.98it/s] 32%|███▏ | 16/50 [00:08<00:17, 1.99it/s] 34%|███▍ | 17/50 [00:08<00:16, 1.99it/s] 36%|███▌ | 18/50 [00:09<00:16, 2.00it/s] 38%|███▊ | 19/50 [00:09<00:15, 2.00it/s] 40%|████ | 20/50 [00:10<00:15, 2.00it/s] 42%|████▏ | 21/50 [00:10<00:14, 1.99it/s] 44%|████▍ | 22/50 [00:11<00:14, 2.00it/s] 46%|████▌ | 23/50 [00:11<00:13, 2.00it/s] 48%|████▊ | 24/50 [00:12<00:13, 2.00it/s] 50%|█████ | 25/50 [00:12<00:12, 1.99it/s] 52%|█████▏ | 26/50 [00:13<00:12, 2.00it/s] 54%|█████▍ | 27/50 [00:13<00:11, 1.99it/s] 56%|█████▌ | 28/50 [00:14<00:11, 1.99it/s] 58%|█████▊ | 29/50 [00:14<00:10, 1.99it/s] 60%|██████ | 30/50 [00:15<00:10, 1.99it/s] 62%|██████▏ | 31/50 [00:15<00:09, 1.99it/s] 64%|██████▍ | 32/50 [00:16<00:09, 1.99it/s] 66%|██████▌ | 33/50 [00:16<00:08, 1.99it/s] 68%|██████▊ | 34/50 [00:17<00:08, 1.99it/s] 70%|███████ | 35/50 [00:17<00:07, 1.99it/s] 72%|███████▏ | 36/50 [00:18<00:07, 1.99it/s] 74%|███████▍ | 37/50 [00:18<00:06, 1.99it/s] 76%|███████▌ | 38/50 [00:19<00:06, 1.99it/s] 78%|███████▊ | 39/50 [00:19<00:05, 1.99it/s] 80%|████████ | 40/50 [00:20<00:05, 1.99it/s] 82%|████████▏ | 41/50 [00:20<00:04, 1.99it/s] 84%|████████▍ | 42/50 [00:21<00:04, 1.99it/s] 86%|████████▌ | 43/50 [00:21<00:03, 1.99it/s] 88%|████████▊ | 44/50 [00:22<00:03, 1.99it/s] 90%|█████████ | 45/50 [00:22<00:02, 1.99it/s] 92%|█████████▏| 46/50 [00:23<00:02, 1.99it/s] 94%|█████████▍| 47/50 [00:23<00:01, 1.99it/s] 96%|█████████▌| 48/50 [00:24<00:01, 1.99it/s] 98%|█████████▊| 49/50 [00:24<00:00, 1.99it/s] 100%|██████████| 50/50 [00:25<00:00, 1.98it/s] 100%|██████████| 50/50 [00:25<00:00, 1.99it/s] 0%| | 0/50 [00:00<?, ?it/s] 2%|▏ | 1/50 [00:00<00:05, 9.28it/s] 4%|▍ | 2/50 [00:00<00:16, 2.95it/s] 6%|▌ | 3/50 [00:01<00:19, 2.41it/s] 8%|▊ | 4/50 [00:01<00:20, 2.23it/s] 10%|█ | 5/50 [00:02<00:21, 2.14it/s] 12%|█▏ | 6/50 [00:02<00:21, 2.08it/s] 14%|█▍ | 7/50 [00:03<00:20, 2.05it/s] 16%|█▌ | 8/50 [00:03<00:20, 2.03it/s] 18%|█▊ | 9/50 [00:04<00:20, 2.02it/s] 20%|██ | 10/50 [00:04<00:19, 2.01it/s] 22%|██▏ | 11/50 [00:05<00:19, 2.00it/s] 24%|██▍ | 12/50 [00:05<00:19, 2.00it/s] 26%|██▌ | 13/50 [00:06<00:18, 1.99it/s] 28%|██▊ | 14/50 [00:06<00:18, 1.99it/s] 30%|███ | 15/50 [00:07<00:17, 1.99it/s] 32%|███▏ | 16/50 [00:07<00:17, 1.99it/s] 34%|███▍ | 17/50 [00:08<00:16, 1.99it/s] 36%|███▌ | 18/50 [00:08<00:16, 1.99it/s] 38%|███▊ | 19/50 [00:09<00:15, 1.99it/s] 40%|████ | 20/50 [00:09<00:15, 1.99it/s] 42%|████▏ | 21/50 [00:10<00:14, 1.99it/s] 44%|████▍ | 22/50 [00:10<00:14, 1.99it/s] 46%|████▌ | 23/50 [00:11<00:13, 1.98it/s] 48%|████▊ | 24/50 [00:11<00:13, 1.99it/s] 50%|█████ | 25/50 [00:12<00:12, 1.98it/s] 52%|█████▏ | 26/50 [00:12<00:12, 1.99it/s] 54%|█████▍ | 27/50 [00:13<00:11, 1.98it/s] 56%|█████▌ | 28/50 [00:13<00:11, 1.99it/s] 58%|█████▊ | 29/50 [00:14<00:10, 1.98it/s] 60%|██████ | 30/50 [00:14<00:10, 1.99it/s] 62%|██████▏ | 31/50 [00:15<00:09, 1.98it/s] 64%|██████▍ | 32/50 [00:15<00:09, 1.98it/s] 66%|██████▌ | 33/50 [00:16<00:08, 1.98it/s] 68%|██████▊ | 34/50 [00:16<00:08, 1.98it/s] 70%|███████ | 35/50 [00:17<00:07, 1.98it/s] 72%|███████▏ | 36/50 [00:17<00:07, 1.98it/s] 74%|███████▍ | 37/50 [00:18<00:06, 1.98it/s] 76%|███████▌ | 38/50 [00:18<00:06, 1.98it/s] 78%|███████▊ | 39/50 [00:19<00:05, 1.98it/s] 80%|████████ | 40/50 [00:19<00:05, 1.98it/s] 82%|████████▏ | 41/50 [00:20<00:04, 1.98it/s] 84%|████████▍ | 42/50 [00:20<00:04, 1.98it/s] 86%|████████▌ | 43/50 [00:21<00:03, 1.98it/s] 88%|████████▊ | 44/50 [00:21<00:03, 1.98it/s] 90%|█████████ | 45/50 [00:22<00:02, 1.98it/s] 92%|█████████▏| 46/50 [00:22<00:02, 1.98it/s] 94%|█████████▍| 47/50 [00:23<00:01, 1.98it/s] 96%|█████████▌| 48/50 [00:23<00:01, 1.98it/s] 98%|█████████▊| 49/50 [00:24<00:00, 1.98it/s] 100%|██████████| 50/50 [00:24<00:00, 1.98it/s] 100%|██████████| 50/50 [00:24<00:00, 2.02it/s] Inversion completed! Number of frames for editing: 40 Sampling: 0%| | 0/50 [00:00<?, ?it/s] Sampling: 2%|▏ | 1/50 [00:02<02:24, 2.95s/it] Sampling: 4%|▍ | 2/50 [00:05<02:21, 2.94s/it] Sampling: 6%|▌ | 3/50 [00:08<02:18, 2.94s/it] Sampling: 8%|▊ | 4/50 [00:11<02:15, 2.94s/it] Sampling: 10%|█ | 5/50 [00:14<02:12, 2.95s/it] Sampling: 12%|█▏ | 6/50 [00:17<02:09, 2.94s/it] Sampling: 14%|█▍ | 7/50 [00:20<02:06, 2.94s/it] Sampling: 16%|█▌ | 8/50 [00:23<02:03, 2.94s/it] Sampling: 18%|█▊ | 9/50 [00:26<02:00, 2.94s/it] Sampling: 20%|██ | 10/50 [00:29<01:57, 2.94s/it] Sampling: 22%|██▏ | 11/50 [00:32<01:54, 2.94s/it] Sampling: 24%|██▍ | 12/50 [00:35<01:51, 2.94s/it] Sampling: 26%|██▌ | 13/50 [00:38<01:48, 2.94s/it] Sampling: 28%|██▊ | 14/50 [00:41<01:45, 2.94s/it] Sampling: 30%|███ | 15/50 [00:44<01:43, 2.94s/it] Sampling: 32%|███▏ | 16/50 [00:47<01:40, 2.94s/it] Sampling: 34%|███▍ | 17/50 [00:50<01:37, 2.94s/it] Sampling: 36%|███▌ | 18/50 [00:52<01:34, 2.94s/it] Sampling: 38%|███▊ | 19/50 [00:55<01:31, 2.94s/it] Sampling: 40%|████ | 20/50 [00:58<01:28, 2.94s/it] Sampling: 42%|████▏ | 21/50 [01:01<01:25, 2.94s/it] Sampling: 44%|████▍ | 22/50 [01:04<01:22, 2.94s/it] Sampling: 46%|████▌ | 23/50 [01:07<01:19, 2.95s/it] Sampling: 48%|████▊ | 24/50 [01:10<01:16, 2.95s/it] Sampling: 50%|█████ | 25/50 [01:13<01:13, 2.95s/it] Sampling: 52%|█████▏ | 26/50 [01:16<01:10, 2.94s/it] Sampling: 54%|█████▍ | 27/50 [01:19<01:07, 2.94s/it] Sampling: 56%|█████▌ | 28/50 [01:22<01:04, 2.95s/it] Sampling: 58%|█████▊ | 29/50 [01:25<01:01, 2.95s/it] Sampling: 60%|██████ | 30/50 [01:28<00:58, 2.95s/it] Sampling: 62%|██████▏ | 31/50 [01:31<00:55, 2.95s/it] Sampling: 64%|██████▍ | 32/50 [01:34<00:53, 2.95s/it] Sampling: 66%|██████▌ | 33/50 [01:37<00:50, 2.95s/it] Sampling: 68%|██████▊ | 34/50 [01:40<00:47, 2.95s/it] Sampling: 70%|███████ | 35/50 [01:43<00:44, 2.95s/it] Sampling: 72%|███████▏ | 36/50 [01:46<00:41, 2.95s/it] Sampling: 74%|███████▍ | 37/50 [01:48<00:38, 2.95s/it] Sampling: 76%|███████▌ | 38/50 [01:51<00:35, 2.95s/it] Sampling: 78%|███████▊ | 39/50 [01:54<00:32, 2.95s/it] Sampling: 80%|████████ | 40/50 [01:57<00:29, 2.95s/it] Sampling: 82%|████████▏ | 41/50 [02:00<00:26, 2.95s/it] Sampling: 84%|████████▍ | 42/50 [02:03<00:23, 2.96s/it] Sampling: 86%|████████▌ | 43/50 [02:06<00:20, 2.97s/it] Sampling: 88%|████████▊ | 44/50 [02:09<00:17, 2.97s/it] Sampling: 90%|█████████ | 45/50 [02:12<00:14, 2.96s/it] Sampling: 92%|█████████▏| 46/50 [02:15<00:11, 2.97s/it] Sampling: 94%|█████████▍| 47/50 [02:18<00:08, 2.97s/it] Sampling: 96%|█████████▌| 48/50 [02:21<00:05, 2.96s/it] Sampling: 98%|█████████▊| 49/50 [02:24<00:02, 2.96s/it] Sampling: 100%|██████████| 50/50 [02:27<00:00, 2.95s/it] Sampling: 100%|██████████| 50/50 [02:27<00:00, 2.95s/it] Done!
Version Details
- Version ID
d456a26c0c1764ac4d9b933a564fe70e660c46423ba0afb5d3200f1a46060728
- Version Created
- January 23, 2024