ai-forever/kandinsky-2.2 🔢❓📝 → 🖼️

▶️ 10.0M runs 📅 Jul 2023 ⚙️ Cog 0.9.4 🔗 GitHub ⚖️ License
controlnet multilingual text-to-image

About

multilingual text2image latent diffusion model

Example Output

Prompt:

"A moss covered astronaut with a black background"

Output

Example output

Performance Metrics

9.03s Prediction Time
9.00s Total Time
All Input Parameters
{
  "width": 1024,
  "height": 1024,
  "prompt": "A moss covered astronaut with a black background",
  "num_outputs": 1,
  "num_inference_steps": 75
}
Input Parameters
seed Type: integer
Random seed. Leave blank to randomize the seed
width Default: 512
Width of output image. Lower the setting if hits memory limits.
height Default: 512
Height of output image. Lower the setting if hits memory limits.
prompt Type: stringDefault: A moss covered astronaut with a black background
Input prompt
num_outputs Type: integerDefault: 1Range: 1 - 4
Number of images to output.
output_format Default: webp
Output image format
negative_prompt Type: string
Specify things to not see in the output
num_inference_steps Type: integerDefault: 75Range: 1 - 500
Number of denoising steps
num_inference_steps_prior Type: integerDefault: 25Range: 1 - 500
Number of denoising steps for priors
Output Schema

Output

Type: arrayItems Type: stringItems Format: uri

Example Execution Logs
Using seed: 4697
  0%|          | 0/25 [00:00<?, ?it/s]
 20%|██        | 5/25 [00:00<00:00, 40.13it/s]
 40%|████      | 10/25 [00:00<00:00, 40.19it/s]
 60%|██████    | 15/25 [00:00<00:00, 40.05it/s]
 80%|████████  | 20/25 [00:00<00:00, 39.66it/s]
100%|██████████| 25/25 [00:00<00:00, 39.93it/s]
100%|██████████| 25/25 [00:00<00:00, 39.90it/s]
The following part of your input was truncated because CLIP can only handle sequences up to 77 tokens: ['gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck, username, watermark, signature']
  0%|          | 0/25 [00:00<?, ?it/s]
 16%|█▌        | 4/25 [00:00<00:00, 37.91it/s]
 36%|███▌      | 9/25 [00:00<00:00, 39.60it/s]
 56%|█████▌    | 14/25 [00:00<00:00, 39.87it/s]
 76%|███████▌  | 19/25 [00:00<00:00, 40.26it/s]
 96%|█████████▌| 24/25 [00:00<00:00, 40.42it/s]
100%|██████████| 25/25 [00:00<00:00, 40.12it/s]
  0%|          | 0/75 [00:00<?, ?it/s]
  1%|▏         | 1/75 [00:00<00:14,  5.06it/s]
  4%|▍         | 3/75 [00:00<00:08,  8.75it/s]
  7%|▋         | 5/75 [00:00<00:06, 10.26it/s]
  9%|▉         | 7/75 [00:00<00:06, 11.00it/s]
 12%|█▏        | 9/75 [00:00<00:05, 11.45it/s]
 15%|█▍        | 11/75 [00:01<00:05, 11.72it/s]
 17%|█▋        | 13/75 [00:01<00:05, 11.89it/s]
 20%|██        | 15/75 [00:01<00:04, 12.01it/s]
 23%|██▎       | 17/75 [00:01<00:04, 12.09it/s]
 25%|██▌       | 19/75 [00:01<00:04, 12.12it/s]
 28%|██▊       | 21/75 [00:01<00:04, 12.16it/s]
 31%|███       | 23/75 [00:02<00:04, 12.19it/s]
 33%|███▎      | 25/75 [00:02<00:04, 12.21it/s]
 36%|███▌      | 27/75 [00:02<00:03, 12.20it/s]
 39%|███▊      | 29/75 [00:02<00:03, 12.21it/s]
 41%|████▏     | 31/75 [00:02<00:03, 12.12it/s]
 44%|████▍     | 33/75 [00:02<00:03, 12.11it/s]
 47%|████▋     | 35/75 [00:02<00:03, 12.04it/s]
 49%|████▉     | 37/75 [00:03<00:03, 11.90it/s]
 52%|█████▏    | 39/75 [00:03<00:03, 11.98it/s]
 55%|█████▍    | 41/75 [00:03<00:02, 12.04it/s]
 57%|█████▋    | 43/75 [00:03<00:02, 12.08it/s]
 60%|██████    | 45/75 [00:03<00:02, 12.12it/s]
 63%|██████▎   | 47/75 [00:03<00:02, 12.09it/s]
 65%|██████▌   | 49/75 [00:04<00:02, 12.06it/s]
 68%|██████▊   | 51/75 [00:04<00:01, 12.10it/s]
 71%|███████   | 53/75 [00:04<00:01, 12.14it/s]
 73%|███████▎  | 55/75 [00:04<00:01, 12.14it/s]
 76%|███████▌  | 57/75 [00:04<00:01, 12.16it/s]
 79%|███████▊  | 59/75 [00:04<00:01, 12.18it/s]
 81%|████████▏ | 61/75 [00:05<00:01, 12.11it/s]
 84%|████████▍ | 63/75 [00:05<00:00, 12.15it/s]
 87%|████████▋ | 65/75 [00:05<00:00, 12.14it/s]
 89%|████████▉ | 67/75 [00:05<00:00, 12.14it/s]
 92%|█████████▏| 69/75 [00:05<00:00, 12.16it/s]
 95%|█████████▍| 71/75 [00:05<00:00, 12.19it/s]
 97%|█████████▋| 73/75 [00:06<00:00, 12.17it/s]
100%|██████████| 75/75 [00:06<00:00, 12.18it/s]
100%|██████████| 75/75 [00:06<00:00, 11.92it/s]
Version Details
Version ID
ad9d7879fbffa2874e1d909d1d37d9bc682889cc65b31f7bb00d2362619f194a
Version Created
April 10, 2024
Run on Replicate →