philz1337x/clarity-upscaler 🖼️🔢📝❓✓ → 🖼️

▶️ 21.8M runs 📅 Mar 2024 ⚙️ Cog 0.8.0-beta11 🔗 GitHub 📄 Paper ⚖️ License
image-restoration image-upscaling

About

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Example Output

Prompt:

"masterpiece, best quality, highres, lora:more_details:0.5 lora:SDXLrender_v2.0:1"

Output

Example output

Performance Metrics

12.81s Prediction Time
12.83s Total Time
All Input Parameters
{
  "seed": 1337,
  "image": "https://replicate.delivery/pbxt/KiDB5iqtTcxiTI17WASotG1Ei0TNJCztdU6J02pnMYAd8B1X/13_before-4.png",
  "prompt": "masterpiece, best quality, highres, <lora:more_details:0.5> <lora:SDXLrender_v2.0:1>",
  "dynamic": 6,
  "sd_model": "juggernaut_reborn.safetensors [338b85bc4f]",
  "scheduler": "DPM++ 3M SDE Karras",
  "creativity": 0.35,
  "lora_links": "",
  "downscaling": false,
  "resemblance": 0.6,
  "scale_factor": 2,
  "tiling_width": 112,
  "tiling_height": 144,
  "custom_sd_model": "",
  "negative_prompt": "(worst quality, low quality, normal quality:2) JuggernautNegative-neg",
  "num_inference_steps": 18,
  "downscaling_resolution": 768
}
Input Parameters
mask Type: string
Mask image to mark areas that should be preserved during upscaling
seed Type: integerDefault: 1337
Random seed. Leave blank to randomize the seed
image (required) Type: string
input image
prompt Type: stringDefault: masterpiece, best quality, highres, <lora:more_details:0.5> <lora:SDXLrender_v2.0:1>
Prompt
dynamic Type: numberDefault: 6Range: 1 - 50
HDR, try from 3 - 9
handfix Default: disabled
Use clarity to fix hands in the image
pattern Type: booleanDefault: false
Upscale a pattern with seamless tiling
sharpen Type: numberDefault: 0Range: 0 - 10
Sharpen the image after upscaling. The higher the value, the more sharpening is applied. 0 for no sharpening
sd_model Default: juggernaut_reborn.safetensors [338b85bc4f]
Stable Diffusion model checkpoint
scheduler Default: DPM++ 3M SDE Karras
scheduler
creativity Type: numberDefault: 0.35Range: 0 - 1
Creativity, try from 0.3 - 0.9
lora_links Type: stringDefault:
Link to a lora file you want to use in your upscaling. Multiple links possible, seperated by comma
downscaling Type: booleanDefault: false
Downscale the image before upscaling. Can improve quality and speed for images with high resolution but lower quality
resemblance Type: numberDefault: 0.6Range: 0 - 3
Resemblance, try from 0.3 - 1.6
scale_factor Type: numberDefault: 2
Scale factor
tiling_width Default: 112
Fractality, set lower tile width for a high Fractality
output_format Default: png
Format of the output images
tiling_height Default: 144
Fractality, set lower tile height for a high Fractality
custom_sd_model Type: stringDefault:
negative_prompt Type: stringDefault: (worst quality, low quality, normal quality:2) JuggernautNegative-neg
Negative Prompt
num_inference_steps Type: integerDefault: 18Range: 1 - 100
Number of denoising steps
downscaling_resolution Type: integerDefault: 768
Downscaling resolution
Output Schema

Output

Type: arrayItems Type: stringItems Format: uri

Example Execution Logs
Running prediction
[Tiled Diffusion] upscaling image with 4x-UltraSharp...
[Tiled Diffusion] ControlNet found, support is enabled.
2024-04-08 15:34:42,215 - ControlNet - INFO - unit_separate = False, style_align = False
2024-04-08 15:34:42,215 - ControlNet - INFO - Loading model from cache: control_v11f1e_sd15_tile
2024-04-08 15:34:42,234 - ControlNet - INFO - Using preprocessor: tile_resample
2024-04-08 15:34:42,234 - ControlNet - INFO - preprocessor resolution = 1536
2024-04-08 15:34:42,310 - ControlNet - INFO - ControlNet Hooked - Time = 0.0993812084197998
MultiDiffusion hooked into 'DPM++ 3M SDE Karras' sampler, Tile size: 144x112, Tile count: 4, Batch size: 4, Tile batches: 1 (ext: ContrlNet)
[Tiled VAE]: the input size is tiny and unnecessary to tile.
MultiDiffusion Sampling:   0%|          | 0/10 [00:00<?, ?it/s]
  0%|          | 0/7 [00:00<?, ?it/s]
Total progress:   0%|          | 0/7 [00:00<?, ?it/s]
 14%|█▍        | 1/7 [00:00<00:05,  1.10it/s]
Total progress:  29%|██▊       | 2/7 [00:00<00:02,  2.22it/s]
 29%|██▊       | 2/7 [00:01<00:04,  1.11it/s]
Total progress:  43%|████▎     | 3/7 [00:01<00:02,  1.57it/s]
 43%|████▎     | 3/7 [00:02<00:03,  1.11it/s]
Total progress:  57%|█████▋    | 4/7 [00:02<00:02,  1.36it/s]
 57%|█████▋    | 4/7 [00:03<00:02,  1.11it/s]
Total progress:  71%|███████▏  | 5/7 [00:03<00:01,  1.26it/s]
 71%|███████▏  | 5/7 [00:04<00:01,  1.11it/s]
Total progress:  86%|████████▌ | 6/7 [00:04<00:00,  1.21it/s]
 86%|████████▌ | 6/7 [00:05<00:00,  1.11it/s]
100%|██████████| 7/7 [00:06<00:00,  1.11it/s]
100%|██████████| 7/7 [00:06<00:00,  1.11it/s]
Total progress: 100%|██████████| 7/7 [00:05<00:00,  1.18it/s][Tiled VAE]: the input size is tiny and unnecessary to tile.
Total progress: 100%|██████████| 7/7 [00:06<00:00,  1.18it/s]
Total progress: 100%|██████████| 7/7 [00:06<00:00,  1.11it/s]
Prediction took 11.742665767669678 seconds
Version Details
Version ID
dfad41707589d68ecdccd1dfa600d55a208f9310748e44bfe35b4a6291453d5e
Version Created
June 24, 2024
Run on Replicate →