lucataco/open-dalle-v1.1 🖼️🔢📝❓✓ → 🖼️

▶️ 132.7K runs 📅 Dec 2023 ⚙️ Cog 0.8.6 🔗 GitHub ⚖️ License
image-inpainting image-to-image text-to-image

About

A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension

Example Output

Prompt:

"black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed"

Output

Example output

Performance Metrics

14.36s Prediction Time
127.05s Total Time
All Input Parameters
{
  "width": 1024,
  "height": 1024,
  "prompt": "black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed",
  "scheduler": "KarrasDPM",
  "num_outputs": 1,
  "guidance_scale": 7.5,
  "apply_watermark": true,
  "negative_prompt": "worst quality, low quality",
  "prompt_strength": 0.8,
  "num_inference_steps": 60
}
Input Parameters
mask Type: string
Input mask for inpaint mode. Black areas will be preserved, white areas will be inpainted.
seed Type: integer
Random seed. Leave blank to randomize the seed
image Type: string
Input image for img2img or inpaint mode
width Type: integerDefault: 1024
Width of output image
height Type: integerDefault: 1024
Height of output image
prompt Type: stringDefault: black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed
Input prompt
scheduler Default: KarrasDPM
scheduler
num_outputs Type: integerDefault: 1Range: 1 - 4
Number of images to output.
guidance_scale Type: numberDefault: 7.5Range: 1 - 50
Scale for classifier-free guidance
apply_watermark Type: booleanDefault: true
Applies a watermark to enable determining if an image is generated in downstream applications. If you have other provisions for generating or deploying images safely, you can use this to disable watermarking.
negative_prompt Type: stringDefault: worst quality, low quality
Negative Input prompt
prompt_strength Type: numberDefault: 0.8Range: 0 - 1
Prompt strength when using img2img / inpaint. 1.0 corresponds to full destruction of information in image
num_inference_steps Type: integerDefault: 60Range: 1 - 100
Number of denoising steps 60-70 for best detail, 35 for fast
disable_safety_checker Type: booleanDefault: false
Disable safety checker for generated images. This feature is only available through the API. See https://replicate.com/docs/how-does-replicate-work#safety
Output Schema

Output

Type: arrayItems Type: stringItems Format: uri

Example Execution Logs
Using seed: 2034103420
Prompt: black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed
txt2img mode
  0%|          | 0/60 [00:00<?, ?it/s]
  2%|▏         | 1/60 [00:00<00:18,  3.14it/s]
  5%|▌         | 3/60 [00:00<00:11,  5.17it/s]
  7%|▋         | 4/60 [00:00<00:11,  5.09it/s]
  8%|▊         | 5/60 [00:01<00:10,  5.03it/s]
 10%|█         | 6/60 [00:01<00:10,  4.99it/s]
 12%|█▏        | 7/60 [00:01<00:10,  4.96it/s]
 13%|█▎        | 8/60 [00:01<00:10,  4.94it/s]
 15%|█▌        | 9/60 [00:01<00:10,  4.93it/s]
 17%|█▋        | 10/60 [00:02<00:10,  4.92it/s]
 18%|█▊        | 11/60 [00:02<00:09,  4.91it/s]
 20%|██        | 12/60 [00:02<00:09,  4.91it/s]
 22%|██▏       | 13/60 [00:02<00:09,  4.90it/s]
 23%|██▎       | 14/60 [00:02<00:09,  4.90it/s]
 25%|██▌       | 15/60 [00:03<00:09,  4.90it/s]
 27%|██▋       | 16/60 [00:03<00:08,  4.90it/s]
 28%|██▊       | 17/60 [00:03<00:08,  4.83it/s]
 30%|███       | 18/60 [00:03<00:08,  4.83it/s]
 32%|███▏      | 19/60 [00:03<00:08,  4.85it/s]
 33%|███▎      | 20/60 [00:04<00:08,  4.88it/s]
 35%|███▌      | 21/60 [00:04<00:07,  4.90it/s]
 37%|███▋      | 22/60 [00:04<00:07,  4.90it/s]
 38%|███▊      | 23/60 [00:04<00:07,  4.91it/s]
 40%|████      | 24/60 [00:04<00:07,  4.91it/s]
 42%|████▏     | 25/60 [00:05<00:07,  4.92it/s]
 43%|████▎     | 26/60 [00:05<00:06,  4.92it/s]
 45%|████▌     | 27/60 [00:05<00:06,  4.92it/s]
 47%|████▋     | 28/60 [00:05<00:06,  4.92it/s]
 48%|████▊     | 29/60 [00:05<00:06,  4.92it/s]
 50%|█████     | 30/60 [00:06<00:06,  4.92it/s]
 52%|█████▏    | 31/60 [00:06<00:05,  4.92it/s]
 53%|█████▎    | 32/60 [00:06<00:05,  4.92it/s]
 55%|█████▌    | 33/60 [00:06<00:05,  4.92it/s]
 57%|█████▋    | 34/60 [00:06<00:05,  4.92it/s]
 58%|█████▊    | 35/60 [00:07<00:05,  4.92it/s]
 60%|██████    | 36/60 [00:07<00:04,  4.92it/s]
 62%|██████▏   | 37/60 [00:07<00:04,  4.92it/s]
 63%|██████▎   | 38/60 [00:07<00:04,  4.92it/s]
 65%|██████▌   | 39/60 [00:07<00:04,  4.92it/s]
 67%|██████▋   | 40/60 [00:08<00:04,  4.92it/s]
 68%|██████▊   | 41/60 [00:08<00:03,  4.91it/s]
 70%|███████   | 42/60 [00:08<00:03,  4.91it/s]
 72%|███████▏  | 43/60 [00:08<00:03,  4.91it/s]
 73%|███████▎  | 44/60 [00:08<00:03,  4.91it/s]
 75%|███████▌  | 45/60 [00:09<00:03,  4.91it/s]
 77%|███████▋  | 46/60 [00:09<00:02,  4.91it/s]
 78%|███████▊  | 47/60 [00:09<00:02,  4.91it/s]
 80%|████████  | 48/60 [00:09<00:02,  4.91it/s]
 82%|████████▏ | 49/60 [00:09<00:02,  4.90it/s]
 83%|████████▎ | 50/60 [00:10<00:02,  4.91it/s]
 85%|████████▌ | 51/60 [00:10<00:01,  4.91it/s]
 87%|████████▋ | 52/60 [00:10<00:01,  4.91it/s]
 88%|████████▊ | 53/60 [00:10<00:01,  4.90it/s]
 90%|█████████ | 54/60 [00:11<00:01,  4.90it/s]
 92%|█████████▏| 55/60 [00:11<00:01,  4.90it/s]
 93%|█████████▎| 56/60 [00:11<00:00,  4.90it/s]
 95%|█████████▌| 57/60 [00:11<00:00,  4.91it/s]
 97%|█████████▋| 58/60 [00:11<00:00,  4.90it/s]
 98%|█████████▊| 59/60 [00:12<00:00,  4.90it/s]
100%|██████████| 60/60 [00:12<00:00,  4.91it/s]
100%|██████████| 60/60 [00:12<00:00,  4.90it/s]
Version Details
Version ID
1c7d4c8dec39c7306df7794b28419078cb9d18b9213ab1c21fdc46a1deca0144
Version Created
December 27, 2023
Run on Replicate →