cjwbw/tokenflow 🔢🖼️📝 → 🖼️

▶️ 2.0K runs 📅 Jan 2024 ⚙️ Cog 0.9.3 🔗 GitHub 📄 Paper ⚖️ License
video-editing video-to-video

About

Consistent Diffusion Features for Consistent Video Editing

Example Output

Output

Performance Metrics

215.74s Prediction Time
217.04s Total Time
All Input Parameters
{
  "fps": 20,
  "video": "https://replicate.delivery/pbxt/KGcnUKDlfys2uAo8eHbWSE8Pzc17hI5IOGdstViaBvfPJWHY/wolf.mp4",
  "width": 512,
  "height": 512,
  "n_frames": 40,
  "guidance_scale": 7.5,
  "diffusion_prompt": "a shiny silver robotic wolf",
  "inversion_prompt": "",
  "num_diffusion_steps": 50,
  "num_inversion_steps": 50,
  "negative_diffusion_prompt": "ugly, blurry, low res, unrealistic, unaesthetic"
}
Input Parameters
fps Type: integerDefault: 10
Frames per second in the output video
seed Type: integer
Random seed. Leave blank to randomize the seed.
video (required) Type: string
Input video.
width Type: integerDefault: 512
Width of the output video. For non-square videos, we recommend using 672 x 384 or 384 x 672, aspect ratio 1.75.
height Type: integerDefault: 512
Height of the output video. For non-square videos, we recommend using 672 x 384 or 384 x 672, aspect ratio 1.75.
n_frames Type: integerDefault: 40
Number of frames in the video to process.
guidance_scale Type: numberDefault: 7.5Range: 1 - 20
Scale for classifier-free guidance
diffusion_prompt (required) Type: string
Describe your output video.
inversion_prompt Type: stringDefault:
Describe your input video or leave it empty.
num_diffusion_steps Type: integerDefault: 50
Number of sampling step.
num_inversion_steps Type: integerDefault: 50
Number of inversion step.
negative_diffusion_prompt Type: stringDefault: ugly, blurry, low res, unrealistic, unaesthetic
Specify things to not see in the output
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
Using seed: 60822
/root/.pyenv/versions/3.11.7/lib/python3.11/site-packages/torchvision/io/video.py:161: UserWarning: The pts_unit 'pts' gives wrong results. Please use pts_unit 'sec'.
warnings.warn("The pts_unit 'pts' gives wrong results. Please use pts_unit 'sec'.")
  0%|          | 0/50 [00:00<?, ?it/s]
  2%|▏         | 1/50 [00:00<00:24,  1.99it/s]
  4%|▍         | 2/50 [00:01<00:24,  1.98it/s]
  6%|▌         | 3/50 [00:01<00:23,  1.99it/s]
  8%|▊         | 4/50 [00:02<00:23,  1.99it/s]
 10%|█         | 5/50 [00:02<00:22,  1.99it/s]
 12%|█▏        | 6/50 [00:03<00:22,  1.99it/s]
 14%|█▍        | 7/50 [00:03<00:21,  1.99it/s]
 16%|█▌        | 8/50 [00:04<00:21,  1.99it/s]
 18%|█▊        | 9/50 [00:04<00:20,  1.99it/s]
 20%|██        | 10/50 [00:05<00:20,  1.99it/s]
 22%|██▏       | 11/50 [00:05<00:19,  1.98it/s]
 24%|██▍       | 12/50 [00:06<00:19,  1.98it/s]
 26%|██▌       | 13/50 [00:06<00:18,  1.98it/s]
 28%|██▊       | 14/50 [00:07<00:18,  1.98it/s]
 30%|███       | 15/50 [00:07<00:17,  1.98it/s]
 32%|███▏      | 16/50 [00:08<00:17,  1.99it/s]
 34%|███▍      | 17/50 [00:08<00:16,  1.99it/s]
 36%|███▌      | 18/50 [00:09<00:16,  2.00it/s]
 38%|███▊      | 19/50 [00:09<00:15,  2.00it/s]
 40%|████      | 20/50 [00:10<00:15,  2.00it/s]
 42%|████▏     | 21/50 [00:10<00:14,  1.99it/s]
 44%|████▍     | 22/50 [00:11<00:14,  2.00it/s]
 46%|████▌     | 23/50 [00:11<00:13,  2.00it/s]
 48%|████▊     | 24/50 [00:12<00:13,  2.00it/s]
 50%|█████     | 25/50 [00:12<00:12,  1.99it/s]
 52%|█████▏    | 26/50 [00:13<00:12,  2.00it/s]
 54%|█████▍    | 27/50 [00:13<00:11,  1.99it/s]
 56%|█████▌    | 28/50 [00:14<00:11,  1.99it/s]
 58%|█████▊    | 29/50 [00:14<00:10,  1.99it/s]
 60%|██████    | 30/50 [00:15<00:10,  1.99it/s]
 62%|██████▏   | 31/50 [00:15<00:09,  1.99it/s]
 64%|██████▍   | 32/50 [00:16<00:09,  1.99it/s]
 66%|██████▌   | 33/50 [00:16<00:08,  1.99it/s]
 68%|██████▊   | 34/50 [00:17<00:08,  1.99it/s]
 70%|███████   | 35/50 [00:17<00:07,  1.99it/s]
 72%|███████▏  | 36/50 [00:18<00:07,  1.99it/s]
 74%|███████▍  | 37/50 [00:18<00:06,  1.99it/s]
 76%|███████▌  | 38/50 [00:19<00:06,  1.99it/s]
 78%|███████▊  | 39/50 [00:19<00:05,  1.99it/s]
 80%|████████  | 40/50 [00:20<00:05,  1.99it/s]
 82%|████████▏ | 41/50 [00:20<00:04,  1.99it/s]
 84%|████████▍ | 42/50 [00:21<00:04,  1.99it/s]
 86%|████████▌ | 43/50 [00:21<00:03,  1.99it/s]
 88%|████████▊ | 44/50 [00:22<00:03,  1.99it/s]
 90%|█████████ | 45/50 [00:22<00:02,  1.99it/s]
 92%|█████████▏| 46/50 [00:23<00:02,  1.99it/s]
 94%|█████████▍| 47/50 [00:23<00:01,  1.99it/s]
 96%|█████████▌| 48/50 [00:24<00:01,  1.99it/s]
 98%|█████████▊| 49/50 [00:24<00:00,  1.99it/s]
100%|██████████| 50/50 [00:25<00:00,  1.98it/s]
100%|██████████| 50/50 [00:25<00:00,  1.99it/s]
  0%|          | 0/50 [00:00<?, ?it/s]
  2%|▏         | 1/50 [00:00<00:05,  9.28it/s]
  4%|▍         | 2/50 [00:00<00:16,  2.95it/s]
  6%|▌         | 3/50 [00:01<00:19,  2.41it/s]
  8%|▊         | 4/50 [00:01<00:20,  2.23it/s]
 10%|█         | 5/50 [00:02<00:21,  2.14it/s]
 12%|█▏        | 6/50 [00:02<00:21,  2.08it/s]
 14%|█▍        | 7/50 [00:03<00:20,  2.05it/s]
 16%|█▌        | 8/50 [00:03<00:20,  2.03it/s]
 18%|█▊        | 9/50 [00:04<00:20,  2.02it/s]
 20%|██        | 10/50 [00:04<00:19,  2.01it/s]
 22%|██▏       | 11/50 [00:05<00:19,  2.00it/s]
 24%|██▍       | 12/50 [00:05<00:19,  2.00it/s]
 26%|██▌       | 13/50 [00:06<00:18,  1.99it/s]
 28%|██▊       | 14/50 [00:06<00:18,  1.99it/s]
 30%|███       | 15/50 [00:07<00:17,  1.99it/s]
 32%|███▏      | 16/50 [00:07<00:17,  1.99it/s]
 34%|███▍      | 17/50 [00:08<00:16,  1.99it/s]
 36%|███▌      | 18/50 [00:08<00:16,  1.99it/s]
 38%|███▊      | 19/50 [00:09<00:15,  1.99it/s]
 40%|████      | 20/50 [00:09<00:15,  1.99it/s]
 42%|████▏     | 21/50 [00:10<00:14,  1.99it/s]
 44%|████▍     | 22/50 [00:10<00:14,  1.99it/s]
 46%|████▌     | 23/50 [00:11<00:13,  1.98it/s]
 48%|████▊     | 24/50 [00:11<00:13,  1.99it/s]
 50%|█████     | 25/50 [00:12<00:12,  1.98it/s]
 52%|█████▏    | 26/50 [00:12<00:12,  1.99it/s]
 54%|█████▍    | 27/50 [00:13<00:11,  1.98it/s]
 56%|█████▌    | 28/50 [00:13<00:11,  1.99it/s]
 58%|█████▊    | 29/50 [00:14<00:10,  1.98it/s]
 60%|██████    | 30/50 [00:14<00:10,  1.99it/s]
 62%|██████▏   | 31/50 [00:15<00:09,  1.98it/s]
 64%|██████▍   | 32/50 [00:15<00:09,  1.98it/s]
 66%|██████▌   | 33/50 [00:16<00:08,  1.98it/s]
 68%|██████▊   | 34/50 [00:16<00:08,  1.98it/s]
 70%|███████   | 35/50 [00:17<00:07,  1.98it/s]
 72%|███████▏  | 36/50 [00:17<00:07,  1.98it/s]
 74%|███████▍  | 37/50 [00:18<00:06,  1.98it/s]
 76%|███████▌  | 38/50 [00:18<00:06,  1.98it/s]
 78%|███████▊  | 39/50 [00:19<00:05,  1.98it/s]
 80%|████████  | 40/50 [00:19<00:05,  1.98it/s]
 82%|████████▏ | 41/50 [00:20<00:04,  1.98it/s]
 84%|████████▍ | 42/50 [00:20<00:04,  1.98it/s]
 86%|████████▌ | 43/50 [00:21<00:03,  1.98it/s]
 88%|████████▊ | 44/50 [00:21<00:03,  1.98it/s]
 90%|█████████ | 45/50 [00:22<00:02,  1.98it/s]
 92%|█████████▏| 46/50 [00:22<00:02,  1.98it/s]
 94%|█████████▍| 47/50 [00:23<00:01,  1.98it/s]
 96%|█████████▌| 48/50 [00:23<00:01,  1.98it/s]
 98%|█████████▊| 49/50 [00:24<00:00,  1.98it/s]
100%|██████████| 50/50 [00:24<00:00,  1.98it/s]
100%|██████████| 50/50 [00:24<00:00,  2.02it/s]
Inversion completed!
Number of frames for editing:  40
Sampling:   0%|          | 0/50 [00:00<?, ?it/s]
Sampling:   2%|▏         | 1/50 [00:02<02:24,  2.95s/it]
Sampling:   4%|▍         | 2/50 [00:05<02:21,  2.94s/it]
Sampling:   6%|▌         | 3/50 [00:08<02:18,  2.94s/it]
Sampling:   8%|▊         | 4/50 [00:11<02:15,  2.94s/it]
Sampling:  10%|█         | 5/50 [00:14<02:12,  2.95s/it]
Sampling:  12%|█▏        | 6/50 [00:17<02:09,  2.94s/it]
Sampling:  14%|█▍        | 7/50 [00:20<02:06,  2.94s/it]
Sampling:  16%|█▌        | 8/50 [00:23<02:03,  2.94s/it]
Sampling:  18%|█▊        | 9/50 [00:26<02:00,  2.94s/it]
Sampling:  20%|██        | 10/50 [00:29<01:57,  2.94s/it]
Sampling:  22%|██▏       | 11/50 [00:32<01:54,  2.94s/it]
Sampling:  24%|██▍       | 12/50 [00:35<01:51,  2.94s/it]
Sampling:  26%|██▌       | 13/50 [00:38<01:48,  2.94s/it]
Sampling:  28%|██▊       | 14/50 [00:41<01:45,  2.94s/it]
Sampling:  30%|███       | 15/50 [00:44<01:43,  2.94s/it]
Sampling:  32%|███▏      | 16/50 [00:47<01:40,  2.94s/it]
Sampling:  34%|███▍      | 17/50 [00:50<01:37,  2.94s/it]
Sampling:  36%|███▌      | 18/50 [00:52<01:34,  2.94s/it]
Sampling:  38%|███▊      | 19/50 [00:55<01:31,  2.94s/it]
Sampling:  40%|████      | 20/50 [00:58<01:28,  2.94s/it]
Sampling:  42%|████▏     | 21/50 [01:01<01:25,  2.94s/it]
Sampling:  44%|████▍     | 22/50 [01:04<01:22,  2.94s/it]
Sampling:  46%|████▌     | 23/50 [01:07<01:19,  2.95s/it]
Sampling:  48%|████▊     | 24/50 [01:10<01:16,  2.95s/it]
Sampling:  50%|█████     | 25/50 [01:13<01:13,  2.95s/it]
Sampling:  52%|█████▏    | 26/50 [01:16<01:10,  2.94s/it]
Sampling:  54%|█████▍    | 27/50 [01:19<01:07,  2.94s/it]
Sampling:  56%|█████▌    | 28/50 [01:22<01:04,  2.95s/it]
Sampling:  58%|█████▊    | 29/50 [01:25<01:01,  2.95s/it]
Sampling:  60%|██████    | 30/50 [01:28<00:58,  2.95s/it]
Sampling:  62%|██████▏   | 31/50 [01:31<00:55,  2.95s/it]
Sampling:  64%|██████▍   | 32/50 [01:34<00:53,  2.95s/it]
Sampling:  66%|██████▌   | 33/50 [01:37<00:50,  2.95s/it]
Sampling:  68%|██████▊   | 34/50 [01:40<00:47,  2.95s/it]
Sampling:  70%|███████   | 35/50 [01:43<00:44,  2.95s/it]
Sampling:  72%|███████▏  | 36/50 [01:46<00:41,  2.95s/it]
Sampling:  74%|███████▍  | 37/50 [01:48<00:38,  2.95s/it]
Sampling:  76%|███████▌  | 38/50 [01:51<00:35,  2.95s/it]
Sampling:  78%|███████▊  | 39/50 [01:54<00:32,  2.95s/it]
Sampling:  80%|████████  | 40/50 [01:57<00:29,  2.95s/it]
Sampling:  82%|████████▏ | 41/50 [02:00<00:26,  2.95s/it]
Sampling:  84%|████████▍ | 42/50 [02:03<00:23,  2.96s/it]
Sampling:  86%|████████▌ | 43/50 [02:06<00:20,  2.97s/it]
Sampling:  88%|████████▊ | 44/50 [02:09<00:17,  2.97s/it]
Sampling:  90%|█████████ | 45/50 [02:12<00:14,  2.96s/it]
Sampling:  92%|█████████▏| 46/50 [02:15<00:11,  2.97s/it]
Sampling:  94%|█████████▍| 47/50 [02:18<00:08,  2.97s/it]
Sampling:  96%|█████████▌| 48/50 [02:21<00:05,  2.96s/it]
Sampling:  98%|█████████▊| 49/50 [02:24<00:02,  2.96s/it]
Sampling: 100%|██████████| 50/50 [02:27<00:00,  2.95s/it]
Sampling: 100%|██████████| 50/50 [02:27<00:00,  2.95s/it]
Done!
Version Details
Version ID
d456a26c0c1764ac4d9b933a564fe70e660c46423ba0afb5d3200f1a46060728
Version Created
January 23, 2024
Run on Replicate →