stability-ai/stable-diffusion 🔢❓📝 → 🖼️
About
A latent text-to-image diffusion model capable of generating photo-realistic images given any text input
Example Output
Prompt:
"an astronaut riding a horse on mars, hd, dramatic lighting"
Output
Performance Metrics
1.89s
Prediction Time
2.04s
Total Time
All Input Parameters
{
"prompt": "an astronaut riding a horse on mars, hd, dramatic lighting",
"scheduler": "K_EULER",
"num_outputs": 1,
"guidance_scale": 7.5,
"image_dimensions": "512x512",
"num_inference_steps": 50
}
Input Parameters
- seed
- Random seed. Leave blank to randomize the seed
- width
- Width of generated image in pixels. Needs to be a multiple of 64
- height
- Height of generated image in pixels. Needs to be a multiple of 64
- prompt
- Input prompt
- scheduler
- Choose a scheduler.
- num_outputs
- Number of images to generate.
- guidance_scale
- Scale for classifier-free guidance
- negative_prompt
- Specify things to not see in the output
- num_inference_steps
- Number of denoising steps
Output Schema
Output
Example Execution Logs
Using seed: 52443 input_shape: torch.Size([1, 77]) 0%| | 0/50 [00:00<?, ?it/s] 10%|█ | 5/50 [00:00<00:01, 41.06it/s] 20%|██ | 10/50 [00:00<00:00, 41.32it/s] 30%|███ | 15/50 [00:00<00:00, 43.77it/s] 42%|████▏ | 21/50 [00:00<00:00, 46.80it/s] 54%|█████▍ | 27/50 [00:00<00:00, 48.53it/s] 66%|██████▌ | 33/50 [00:00<00:00, 49.63it/s] 78%|███████▊ | 39/50 [00:00<00:00, 50.45it/s] 90%|█████████ | 45/50 [00:00<00:00, 50.90it/s] 100%|██████████| 50/50 [00:01<00:00, 48.51it/s]
Version Details
- Version ID
ac732df83cea7fff18b8472768c88ad041fa750ff7682a21affe81863cbe77e4- Version Created
- July 10, 2023