ai-forever/kandinsky-2.2 🔢❓📝 → 🖼️
About
multilingual text2image latent diffusion model

Example Output
Prompt:
"A moss covered astronaut with a black background"
Output

Performance Metrics
9.03s
Prediction Time
9.00s
Total Time
All Input Parameters
{ "width": 1024, "height": 1024, "prompt": "A moss covered astronaut with a black background", "num_outputs": 1, "num_inference_steps": 75 }
Input Parameters
- seed
- Random seed. Leave blank to randomize the seed
- width
- Width of output image. Lower the setting if hits memory limits.
- height
- Height of output image. Lower the setting if hits memory limits.
- prompt
- Input prompt
- num_outputs
- Number of images to output.
- output_format
- Output image format
- negative_prompt
- Specify things to not see in the output
- num_inference_steps
- Number of denoising steps
- num_inference_steps_prior
- Number of denoising steps for priors
Output Schema
Output
Example Execution Logs
Using seed: 4697 0%| | 0/25 [00:00<?, ?it/s] 20%|██ | 5/25 [00:00<00:00, 40.13it/s] 40%|████ | 10/25 [00:00<00:00, 40.19it/s] 60%|██████ | 15/25 [00:00<00:00, 40.05it/s] 80%|████████ | 20/25 [00:00<00:00, 39.66it/s] 100%|██████████| 25/25 [00:00<00:00, 39.93it/s] 100%|██████████| 25/25 [00:00<00:00, 39.90it/s] The following part of your input was truncated because CLIP can only handle sequences up to 77 tokens: ['gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck, username, watermark, signature'] 0%| | 0/25 [00:00<?, ?it/s] 16%|█▌ | 4/25 [00:00<00:00, 37.91it/s] 36%|███▌ | 9/25 [00:00<00:00, 39.60it/s] 56%|█████▌ | 14/25 [00:00<00:00, 39.87it/s] 76%|███████▌ | 19/25 [00:00<00:00, 40.26it/s] 96%|█████████▌| 24/25 [00:00<00:00, 40.42it/s] 100%|██████████| 25/25 [00:00<00:00, 40.12it/s] 0%| | 0/75 [00:00<?, ?it/s] 1%|▏ | 1/75 [00:00<00:14, 5.06it/s] 4%|▍ | 3/75 [00:00<00:08, 8.75it/s] 7%|▋ | 5/75 [00:00<00:06, 10.26it/s] 9%|▉ | 7/75 [00:00<00:06, 11.00it/s] 12%|█▏ | 9/75 [00:00<00:05, 11.45it/s] 15%|█▍ | 11/75 [00:01<00:05, 11.72it/s] 17%|█▋ | 13/75 [00:01<00:05, 11.89it/s] 20%|██ | 15/75 [00:01<00:04, 12.01it/s] 23%|██▎ | 17/75 [00:01<00:04, 12.09it/s] 25%|██▌ | 19/75 [00:01<00:04, 12.12it/s] 28%|██▊ | 21/75 [00:01<00:04, 12.16it/s] 31%|███ | 23/75 [00:02<00:04, 12.19it/s] 33%|███▎ | 25/75 [00:02<00:04, 12.21it/s] 36%|███▌ | 27/75 [00:02<00:03, 12.20it/s] 39%|███▊ | 29/75 [00:02<00:03, 12.21it/s] 41%|████▏ | 31/75 [00:02<00:03, 12.12it/s] 44%|████▍ | 33/75 [00:02<00:03, 12.11it/s] 47%|████▋ | 35/75 [00:02<00:03, 12.04it/s] 49%|████▉ | 37/75 [00:03<00:03, 11.90it/s] 52%|█████▏ | 39/75 [00:03<00:03, 11.98it/s] 55%|█████▍ | 41/75 [00:03<00:02, 12.04it/s] 57%|█████▋ | 43/75 [00:03<00:02, 12.08it/s] 60%|██████ | 45/75 [00:03<00:02, 12.12it/s] 63%|██████▎ | 47/75 [00:03<00:02, 12.09it/s] 65%|██████▌ | 49/75 [00:04<00:02, 12.06it/s] 68%|██████▊ | 51/75 [00:04<00:01, 12.10it/s] 71%|███████ | 53/75 [00:04<00:01, 12.14it/s] 73%|███████▎ | 55/75 [00:04<00:01, 12.14it/s] 76%|███████▌ | 57/75 [00:04<00:01, 12.16it/s] 79%|███████▊ | 59/75 [00:04<00:01, 12.18it/s] 81%|████████▏ | 61/75 [00:05<00:01, 12.11it/s] 84%|████████▍ | 63/75 [00:05<00:00, 12.15it/s] 87%|████████▋ | 65/75 [00:05<00:00, 12.14it/s] 89%|████████▉ | 67/75 [00:05<00:00, 12.14it/s] 92%|█████████▏| 69/75 [00:05<00:00, 12.16it/s] 95%|█████████▍| 71/75 [00:05<00:00, 12.19it/s] 97%|█████████▋| 73/75 [00:06<00:00, 12.17it/s] 100%|██████████| 75/75 [00:06<00:00, 12.18it/s] 100%|██████████| 75/75 [00:06<00:00, 11.92it/s]
Version Details
- Version ID
ad9d7879fbffa2874e1d909d1d37d9bc682889cc65b31f7bb00d2362619f194a
- Version Created
- April 10, 2024