afiaka87/glid-3-xl 🖼️🔢❓📝✓ → 🖼️
About
CompVis `latent-diffusion text2im` finetuned for inpainting.
Example Output
Prompt:
"pikachu rendered in pixar"
Output
Performance Metrics
10.70s
Prediction Time
10.88s
Total Time
All Input Parameters
{
"seed": -1,
"steps": 100,
"width": 256,
"height": 256,
"prompt": "pikachu rendered in pixar",
"batch_size": 1,
"guidance_scale": 5,
"aesthetic_rating": 9,
"aesthetic_weight": 0.5
}
Input Parameters
- mask
- a mask image for inpainting an init_image. white pixels = keep, black pixels = discard. resized to width = image width/8, height = image height/8
- seed
- Seed for random number generator. If -1, a random seed will be chosen.
- steps
- Number of diffusion steps to run. Due to PLMS sampling, using more than 100 steps is unnecessary and may simply produce the exact same output.
- width
- Target width
- height
- Target height
- prompt
- Your text prompt.
- negative
- (optional) Negate the model's prediction for this text from the model's prediction for the target text.
- batch_size
- Batch size. (higher = slower)
- init_image
- (optional) Initial image to use for the model's prediction. If provided alongside a mask, the image will be inpainted instead.
- guidance_scale
- Classifier-free guidance scale. Higher values will result in more guidance toward caption, with diminishing returns. Try values between 1.0 and 40.0. In general, going above 5.0 will introduce some artifacting.
- aesthetic_rating
- Aesthetic rating (1-9) - embed to use.
- aesthetic_weight
- Aesthetic weight (0-1). How much to guide towards the aesthetic embed vs the prompt embed.
- init_skip_fraction
- Fraction of sampling steps to skip when using an init image. Defaults to 0.0 if init_image is not specified and 0.5 if init_image is specified.
- intermediate_outputs
- Whether to return intermediate outputs. Enable to visualize the diffusion process and/or debug the model. May slow down inference.
Output Schema
Output
Example Execution Logs
Using seed 2882092835 Running simulation for pikachu rendered in pixar Encoding text embeddings with pikachu rendered in pixar dimensions Using aesthetic embedding 9 with weight 0.5 Running diffusion... 0%| | 0/100 [00:00<?, ?it/s] 1%| | 1/100 [00:00<00:41, 2.37it/s] 2%|▏ | 2/100 [00:00<00:37, 2.64it/s] 3%|▎ | 3/100 [00:01<00:35, 2.74it/s] 5%|▌ | 5/100 [00:01<00:19, 4.79it/s] 7%|▋ | 7/100 [00:01<00:14, 6.47it/s] 9%|▉ | 9/100 [00:01<00:11, 7.79it/s] 11%|█ | 11/100 [00:01<00:10, 8.80it/s] 13%|█▎ | 13/100 [00:01<00:09, 9.56it/s] 15%|█▌ | 15/100 [00:02<00:08, 10.08it/s] 17%|█▋ | 17/100 [00:02<00:07, 10.42it/s] 19%|█▉ | 19/100 [00:02<00:07, 10.70it/s] 21%|██ | 21/100 [00:02<00:07, 10.84it/s] 23%|██▎ | 23/100 [00:02<00:06, 11.04it/s] 25%|██▌ | 25/100 [00:03<00:06, 11.11it/s] 27%|██▋ | 27/100 [00:03<00:06, 11.19it/s] 29%|██▉ | 29/100 [00:03<00:06, 11.25it/s] 31%|███ | 31/100 [00:03<00:06, 11.29it/s] 33%|███▎ | 33/100 [00:03<00:05, 11.31it/s] 35%|███▌ | 35/100 [00:03<00:05, 11.36it/s] 37%|███▋ | 37/100 [00:04<00:05, 11.34it/s] 39%|███▉ | 39/100 [00:04<00:05, 11.33it/s] 41%|████ | 41/100 [00:04<00:05, 11.32it/s] 43%|████▎ | 43/100 [00:04<00:05, 11.34it/s] 45%|████▌ | 45/100 [00:04<00:04, 11.32it/s] 47%|████▋ | 47/100 [00:04<00:04, 11.38it/s] 49%|████▉ | 49/100 [00:05<00:04, 11.40it/s] 51%|█████ | 51/100 [00:05<00:04, 11.36it/s] 53%|█████▎ | 53/100 [00:05<00:04, 11.37it/s] 55%|█████▌ | 55/100 [00:05<00:03, 11.39it/s] 57%|█████▋ | 57/100 [00:05<00:03, 11.37it/s] 59%|█████▉ | 59/100 [00:06<00:03, 11.36it/s] 61%|██████ | 61/100 [00:06<00:03, 11.39it/s] 63%|██████▎ | 63/100 [00:06<00:03, 11.36it/s] 65%|██████▌ | 65/100 [00:06<00:03, 11.38it/s] 67%|██████▋ | 67/100 [00:06<00:02, 11.36it/s] 69%|██████▉ | 69/100 [00:06<00:02, 11.35it/s] 71%|███████ | 71/100 [00:07<00:02, 11.33it/s] 73%|███████▎ | 73/100 [00:07<00:02, 11.32it/s] 75%|███████▌ | 75/100 [00:07<00:02, 11.35it/s] 77%|███████▋ | 77/100 [00:07<00:02, 11.31it/s] 79%|███████▉ | 79/100 [00:07<00:01, 11.30it/s] 81%|████████ | 81/100 [00:07<00:01, 11.30it/s] 83%|████████▎ | 83/100 [00:08<00:01, 11.30it/s] 85%|████████▌ | 85/100 [00:08<00:01, 11.28it/s] 87%|████████▋ | 87/100 [00:08<00:01, 11.31it/s] 89%|████████▉ | 89/100 [00:08<00:00, 11.28it/s] 91%|█████████ | 91/100 [00:08<00:00, 11.30it/s] 93%|█████████▎| 93/100 [00:09<00:00, 11.31it/s] 95%|█████████▌| 95/100 [00:09<00:00, 11.32it/s] 97%|█████████▋| 97/100 [00:09<00:00, 11.33it/s] 99%|█████████▉| 99/100 [00:09<00:00, 11.30it/s] 100%|██████████| 100/100 [00:09<00:00, 10.34it/s] Saving final sample/s
Version Details
- Version ID
d74db2a276065cf0d42fe9e2917219112ddf8c698f5d9acbe1cc353b58097dab- Version Created
- August 5, 2022