charlesmccarthy/animagine-xl 🖼️🔢📝❓✓ → 🖼️

▶️ 9.3K runs 📅 Dec 2023 ⚙️ Cog 0.8.6
anime image-to-image text-to-image

About

Animagine XL 2.0 is an advanced latent text-to-image diffusion model designed to create high-resolution, detailed anime images.

Example Output

Prompt:

"black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed"

Output

Example output

Performance Metrics

13.66s Prediction Time
252.21s Total Time
All Input Parameters
{
  "width": 1024,
  "height": 1024,
  "prompt": "black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed",
  "scheduler": "KarrasDPM",
  "num_outputs": 1,
  "guidance_scale": 7.5,
  "apply_watermark": true,
  "negative_prompt": "worst quality, low quality",
  "prompt_strength": 0.8,
  "num_inference_steps": 60
}
Input Parameters
mask Type: string
Input mask for inpaint mode. Black areas will be preserved, white areas will be inpainted.
seed Type: integer
Random seed. Leave blank to randomize the seed
image Type: string
Input image for img2img or inpaint mode
width Type: integerDefault: 1024
Width of output image
height Type: integerDefault: 1024
Height of output image
prompt Type: stringDefault: black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed
Input prompt
scheduler Default: KarrasDPM
scheduler
num_outputs Type: integerDefault: 1Range: 1 - 4
Number of images to output.
guidance_scale Type: numberDefault: 7.5Range: 1 - 50
Scale for classifier-free guidance
apply_watermark Type: booleanDefault: true
Applies a watermark to enable determining if an image is generated in downstream applications. If you have other provisions for generating or deploying images safely, you can use this to disable watermarking.
negative_prompt Type: stringDefault: worst quality, low quality
Negative Input prompt
prompt_strength Type: numberDefault: 0.8Range: 0 - 1
Prompt strength when using img2img / inpaint. 1.0 corresponds to full destruction of information in image
num_inference_steps Type: integerDefault: 60Range: 1 - 100
Number of denoising steps 60-70 for best detail, 35 for fast
disable_safety_checker Type: booleanDefault: false
Disable safety checker for generated images. This feature is only available through the API. See https://replicate.com/docs/how-does-replicate-work#safety
Output Schema

Output

Type: arrayItems Type: stringItems Format: uri

Example Execution Logs
Using seed: 4117415544
Prompt: black fluffy gorgeous dangerous cat animal creature, large orange eyes, big fluffy ears, piercing gaze, full moon, dark ambiance, best quality, extremely detailed
txt2img mode
  0%|          | 0/60 [00:00<?, ?it/s]
  2%|▏         | 1/60 [00:00<00:18,  3.15it/s]
  5%|▌         | 3/60 [00:00<00:10,  5.25it/s]
  7%|▋         | 4/60 [00:00<00:10,  5.17it/s]
  8%|▊         | 5/60 [00:01<00:10,  5.12it/s]
 10%|█         | 6/60 [00:01<00:10,  5.08it/s]
 12%|█▏        | 7/60 [00:01<00:10,  5.04it/s]
 13%|█▎        | 8/60 [00:01<00:10,  5.03it/s]
 15%|█▌        | 9/60 [00:01<00:10,  5.02it/s]
 17%|█▋        | 10/60 [00:02<00:09,  5.02it/s]
 18%|█▊        | 11/60 [00:02<00:09,  5.01it/s]
 20%|██        | 12/60 [00:02<00:09,  5.00it/s]
 22%|██▏       | 13/60 [00:02<00:09,  4.99it/s]
 23%|██▎       | 14/60 [00:02<00:09,  4.99it/s]
 25%|██▌       | 15/60 [00:03<00:09,  4.99it/s]
 27%|██▋       | 16/60 [00:03<00:08,  4.99it/s]
 28%|██▊       | 17/60 [00:03<00:08,  4.98it/s]
 30%|███       | 18/60 [00:03<00:08,  4.99it/s]
 32%|███▏      | 19/60 [00:03<00:08,  4.99it/s]
 33%|███▎      | 20/60 [00:04<00:08,  4.99it/s]
 35%|███▌      | 21/60 [00:04<00:07,  4.98it/s]
 37%|███▋      | 22/60 [00:04<00:07,  4.98it/s]
 38%|███▊      | 23/60 [00:04<00:07,  4.98it/s]
 40%|████      | 24/60 [00:04<00:07,  4.98it/s]
 42%|████▏     | 25/60 [00:05<00:07,  4.98it/s]
 43%|████▎     | 26/60 [00:05<00:06,  4.97it/s]
 45%|████▌     | 27/60 [00:05<00:06,  4.97it/s]
 47%|████▋     | 28/60 [00:05<00:06,  4.97it/s]
 48%|████▊     | 29/60 [00:05<00:06,  4.97it/s]
 50%|█████     | 30/60 [00:06<00:06,  4.97it/s]
 52%|█████▏    | 31/60 [00:06<00:05,  4.96it/s]
 53%|█████▎    | 32/60 [00:06<00:05,  4.96it/s]
 55%|█████▌    | 33/60 [00:06<00:05,  4.96it/s]
 57%|█████▋    | 34/60 [00:06<00:05,  4.96it/s]
 58%|█████▊    | 35/60 [00:07<00:05,  4.96it/s]
 60%|██████    | 36/60 [00:07<00:04,  4.96it/s]
 62%|██████▏   | 37/60 [00:07<00:04,  4.95it/s]
 63%|██████▎   | 38/60 [00:07<00:04,  4.95it/s]
 65%|██████▌   | 39/60 [00:07<00:04,  4.96it/s]
 67%|██████▋   | 40/60 [00:08<00:04,  4.96it/s]
 68%|██████▊   | 41/60 [00:08<00:03,  4.96it/s]
 70%|███████   | 42/60 [00:08<00:03,  4.95it/s]
 72%|███████▏  | 43/60 [00:08<00:03,  4.95it/s]
 73%|███████▎  | 44/60 [00:08<00:03,  4.95it/s]
 75%|███████▌  | 45/60 [00:09<00:03,  4.95it/s]
 77%|███████▋  | 46/60 [00:09<00:02,  4.95it/s]
 78%|███████▊  | 47/60 [00:09<00:02,  4.95it/s]
 80%|████████  | 48/60 [00:09<00:02,  4.95it/s]
 82%|████████▏ | 49/60 [00:09<00:02,  4.95it/s]
 83%|████████▎ | 50/60 [00:10<00:02,  4.95it/s]
 85%|████████▌ | 51/60 [00:10<00:01,  4.95it/s]
 87%|████████▋ | 52/60 [00:10<00:01,  4.95it/s]
 88%|████████▊ | 53/60 [00:10<00:01,  4.94it/s]
 90%|█████████ | 54/60 [00:10<00:01,  4.94it/s]
 92%|█████████▏| 55/60 [00:11<00:01,  4.94it/s]
 93%|█████████▎| 56/60 [00:11<00:00,  4.94it/s]
 95%|█████████▌| 57/60 [00:11<00:00,  4.94it/s]
 97%|█████████▋| 58/60 [00:11<00:00,  4.94it/s]
 98%|█████████▊| 59/60 [00:11<00:00,  4.94it/s]
100%|██████████| 60/60 [00:12<00:00,  4.94it/s]
100%|██████████| 60/60 [00:12<00:00,  4.96it/s]
Version Details
Version ID
db29f76d40ecf86335295ca5b24ed95e6b1eca4e29239c47cfefa68f408cbf5e
Version Created
December 27, 2023
Run on Replicate →