thudm/cogvideox-i2v 🔢🖼️📝 → 🖼️

▶️ 1.1K runs 📅 Sep 2024 ⚙️ Cog 0.9.23 🔗 GitHub 📄 Paper ⚖️ License
image-to-video

About

Image-to-Video Diffusion Models with An Expert Transformer

Example Output

Prompt:

"Starry sky slowly rotating."

Output

Performance Metrics

406.22s Prediction Time
555.34s Total Time
All Input Parameters
{
  "image": "https://replicate.delivery/pbxt/Lf97CMO0Sz0sZ0IuQarZRT8TbcMz4pCurtiLSKWDBPSTMb1S/input.jpg",
  "prompt": "Starry sky slowly rotating.",
  "num_frames": 49,
  "guidance_scale": 6,
  "num_inference_steps": 50
}
Input Parameters
seed Type: integer
Random seed. Leave blank to randomize the seed
image (required) Type: string
Input image
prompt Type: stringDefault: Starry sky slowly rotating.
Input prompt
num_frames Type: integerDefault: 49
Number of frames for the output video
guidance_scale Type: numberDefault: 6Range: 1 - 20
Scale for classifier-free guidance
num_inference_steps Type: integerDefault: 50Range: 1 - 500
Number of denoising steps
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
Using seed: 10643
  0%|          | 0/50 [00:00<?, ?it/s]
  2%|▏         | 1/50 [00:12<10:16, 12.59s/it]
  4%|▍         | 2/50 [00:20<07:41,  9.62s/it]
  6%|▌         | 3/50 [00:27<06:47,  8.66s/it]
  8%|▊         | 4/50 [00:35<06:18,  8.22s/it]
 10%|█         | 5/50 [00:42<05:59,  7.99s/it]
 12%|█▏        | 6/50 [00:50<05:45,  7.85s/it]
 14%|█▍        | 7/50 [00:57<05:34,  7.77s/it]
 16%|█▌        | 8/50 [01:05<05:24,  7.72s/it]
 18%|█▊        | 9/50 [01:13<05:14,  7.68s/it]
 20%|██        | 10/50 [01:20<05:06,  7.65s/it]
 22%|██▏       | 11/50 [01:28<04:57,  7.64s/it]
 24%|██▍       | 12/50 [01:35<04:49,  7.63s/it]
 26%|██▌       | 13/50 [01:43<04:42,  7.62s/it]
 28%|██▊       | 14/50 [01:51<04:34,  7.62s/it]
 30%|███       | 15/50 [01:58<04:26,  7.62s/it]
 32%|███▏      | 16/50 [02:06<04:19,  7.63s/it]
 34%|███▍      | 17/50 [02:14<04:11,  7.63s/it]
 36%|███▌      | 18/50 [02:21<04:04,  7.63s/it]
 38%|███▊      | 19/50 [02:29<03:56,  7.63s/it]
 40%|████      | 20/50 [02:37<03:49,  7.63s/it]
 42%|████▏     | 21/50 [02:44<03:41,  7.63s/it]
 44%|████▍     | 22/50 [02:52<03:33,  7.64s/it]
 46%|████▌     | 23/50 [02:59<03:26,  7.64s/it]
 48%|████▊     | 24/50 [03:07<03:18,  7.64s/it]
 50%|█████     | 25/50 [03:15<03:10,  7.64s/it]
 52%|█████▏    | 26/50 [03:22<03:03,  7.64s/it]
 54%|█████▍    | 27/50 [03:30<02:55,  7.64s/it]
 56%|█████▌    | 28/50 [03:38<02:48,  7.64s/it]
 58%|█████▊    | 29/50 [03:45<02:40,  7.64s/it]
 60%|██████    | 30/50 [03:53<02:32,  7.64s/it]
 62%|██████▏   | 31/50 [04:01<02:25,  7.65s/it]
 64%|██████▍   | 32/50 [04:08<02:17,  7.65s/it]
 66%|██████▌   | 33/50 [04:16<02:10,  7.66s/it]
 68%|██████▊   | 34/50 [04:24<02:02,  7.66s/it]
 70%|███████   | 35/50 [04:31<01:54,  7.66s/it]
 72%|███████▏  | 36/50 [04:39<01:47,  7.66s/it]
 74%|███████▍  | 37/50 [04:47<01:39,  7.66s/it]
 76%|███████▌  | 38/50 [04:54<01:31,  7.66s/it]
 78%|███████▊  | 39/50 [05:02<01:24,  7.66s/it]
 80%|████████  | 40/50 [05:10<01:16,  7.66s/it]
 82%|████████▏ | 41/50 [05:17<01:08,  7.66s/it]
 84%|████████▍ | 42/50 [05:25<01:01,  7.66s/it]
 86%|████████▌ | 43/50 [05:32<00:53,  7.66s/it]
 88%|████████▊ | 44/50 [05:40<00:45,  7.66s/it]
 90%|█████████ | 45/50 [05:48<00:38,  7.66s/it]
 92%|█████████▏| 46/50 [05:55<00:30,  7.66s/it]
 94%|█████████▍| 47/50 [06:03<00:22,  7.66s/it]
 96%|█████████▌| 48/50 [06:11<00:15,  7.66s/it]
 98%|█████████▊| 49/50 [06:18<00:07,  7.66s/it]
100%|██████████| 50/50 [06:26<00:00,  7.66s/it]
100%|██████████| 50/50 [06:26<00:00,  7.73s/it]
Version Details
Version ID
82caaa79520e03b3963c975e38dd68ac8a6b18a8a39c19fd8dc03b4ed4b91c58
Version Created
September 21, 2024
Run on Replicate →