cjwbw/kandinsky-2-2-controlnet-depth 🔢❓🖼️📝 → 🖼️
About
Kandinsky Image Generation with ControlNet Conditioning

Example Output
Prompt:
"A robot, 4k photo"
Output

Performance Metrics
9.33s
Prediction Time
284.38s
Total Time
All Input Parameters
{ "task": "img2img", "image": "https://replicate.delivery/pbxt/JBQjVyAYINXgKMvXGfE2ykyLgNE6E7ytZLC8b26BM2D0IRoG/cat.png", "width": 768, "height": 768, "prompt": "A robot, 4k photo", "num_outputs": 1, "negative_prompt": "lowres, text, error, cropped, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, out of frame, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck, username, watermark, signature", "num_inference_steps": 75 }
Input Parameters
- seed
- Random seed. Leave blank to randomize the seed
- task
- Choose a task
- image
- Input image
- width
- Width of output image. Lower the setting if hits memory limits.
- height
- Height of output image. Lower the setting if hits memory limits.
- prompt
- Input prompt
- num_outputs
- Number of images to output.
- negative_prompt
- Specify things to not see in the output
- num_inference_steps
- Number of denoising steps
Output Schema
Output
Example Execution Logs
Using seed: 49968 0%| | 0/21 [00:00<?, ?it/s] 19%|█▉ | 4/21 [00:00<00:00, 39.59it/s] 43%|████▎ | 9/21 [00:00<00:00, 40.19it/s] 67%|██████▋ | 14/21 [00:00<00:00, 40.28it/s] 90%|█████████ | 19/21 [00:00<00:00, 40.10it/s] 100%|██████████| 21/21 [00:00<00:00, 40.17it/s] Token indices sequence length is longer than the specified maximum sequence length for this model (112 > 77). Running this sequence through the model will result in indexing errors The following part of your input was truncated because CLIP can only handle sequences up to 77 tokens: ['gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck, username, watermark, signature'] 0%| | 0/25 [00:00<?, ?it/s] 20%|██ | 5/25 [00:00<00:00, 40.73it/s] 40%|████ | 10/25 [00:00<00:00, 39.75it/s] 60%|██████ | 15/25 [00:00<00:00, 39.91it/s] 80%|████████ | 20/25 [00:00<00:00, 39.96it/s] 96%|█████████▌| 24/25 [00:00<00:00, 39.72it/s] 100%|██████████| 25/25 [00:00<00:00, 39.86it/s] 0%| | 0/37 [00:00<?, ?it/s] 3%|▎ | 1/37 [00:00<00:09, 3.78it/s] 8%|▊ | 3/37 [00:00<00:04, 8.39it/s] 14%|█▎ | 5/37 [00:00<00:02, 10.77it/s] 19%|█▉ | 7/37 [00:00<00:02, 12.40it/s] 24%|██▍ | 9/37 [00:00<00:02, 13.46it/s] 30%|██▉ | 11/37 [00:00<00:01, 14.10it/s] 35%|███▌ | 13/37 [00:01<00:01, 14.55it/s] 41%|████ | 15/37 [00:01<00:01, 14.79it/s] 46%|████▌ | 17/37 [00:01<00:01, 15.03it/s] 51%|█████▏ | 19/37 [00:01<00:01, 15.20it/s] 57%|█████▋ | 21/37 [00:01<00:01, 15.28it/s] 62%|██████▏ | 23/37 [00:01<00:00, 15.37it/s] 68%|██████▊ | 25/37 [00:01<00:00, 15.45it/s] 73%|███████▎ | 27/37 [00:01<00:00, 15.53it/s] 78%|███████▊ | 29/37 [00:02<00:00, 15.62it/s] 84%|████████▍ | 31/37 [00:02<00:00, 15.39it/s] 89%|████████▉ | 33/37 [00:02<00:00, 15.50it/s] 95%|█████████▍| 35/37 [00:02<00:00, 15.63it/s] 100%|██████████| 37/37 [00:02<00:00, 15.70it/s] 100%|██████████| 37/37 [00:02<00:00, 14.28it/s]
Version Details
- Version ID
98b54ca0b42be225e927f1dae2d9c506e69fe5b3bce301e13718d662a227a12b
- Version Created
- July 16, 2023