zsyoaoa/invsr 🔢🖼️❓ → 🖼️

▶️ 5.1K runs 📅 Dec 2024 ⚙️ Cog 0.13.6 🔗 GitHub 📄 Paper ⚖️ License
image-upscaling

About

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Example Output

Output

Example output

Performance Metrics

96.04s Prediction Time
113.52s Total Time
All Input Parameters
{
  "seed": 12345,
  "in_path": "https://replicate.delivery/pbxt/M8qhJrY5aD7tG40HumHd3gIIR3LXjMKThkOCNB1oSfGrimcu/32.jpg",
  "num_steps": 1,
  "chopping_size": 128
}
Input Parameters
seed Type: integerDefault: 12345
Random seed. Leave blank to randomize the seed.
in_path (required) Type: string
Input low-quality image
num_steps Default: 1
Number of sampling steps.
chopping_size Default: 128
Chopping resolution
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
Setting timesteps for inference: [200]
Downloading: "https://huggingface.co/OAOA/InvSR/resolve/main/noise_predictor_sd_turbo_v5.pth" to /src/weights/noise_predictor_sd_turbo_v5.pth
  0%|          | 0.00/129M [00:00<?, ?B/s]
  8%|▊         | 9.88M/129M [00:00<00:01, 102MB/s]
 26%|██▌       | 33.6M/129M [00:00<00:00, 188MB/s]
 40%|███▉      | 51.6M/129M [00:00<00:01, 45.9MB/s]
 48%|████▊     | 62.5M/129M [00:01<00:01, 44.9MB/s]
 55%|█████▍    | 70.8M/129M [00:01<00:01, 44.3MB/s]
 60%|██████    | 77.5M/129M [00:01<00:01, 44.0MB/s]
 65%|██████▍   | 83.4M/129M [00:01<00:01, 42.5MB/s]
 69%|██████▊   | 88.5M/129M [00:01<00:01, 42.3MB/s]
 72%|███████▏  | 93.2M/129M [00:02<00:00, 42.4MB/s]
 76%|███████▌  | 97.9M/129M [00:02<00:00, 43.0MB/s]
 79%|███████▉  | 102M/129M [00:02<00:00, 43.1MB/s] 
 83%|████████▎ | 107M/129M [00:02<00:00, 42.7MB/s]
 86%|████████▌ | 111M/129M [00:02<00:00, 42.8MB/s]
 89%|████████▉ | 115M/129M [00:02<00:00, 43.0MB/s]
 93%|█████████▎| 120M/129M [00:02<00:00, 42.8MB/s]
 96%|█████████▌| 124M/129M [00:02<00:00, 40.1MB/s]
 99%|█████████▉| 128M/129M [00:02<00:00, 40.8MB/s]
100%|██████████| 129M/129M [00:02<00:00, 46.3MB/s]
Fetching 12 files:   0%|          | 0/12 [00:00<?, ?it/s]
Fetching 12 files:  17%|█▋        | 2/12 [00:00<00:02,  4.21it/s]
Fetching 12 files:  33%|███▎      | 4/12 [00:32<01:17,  9.65s/it]
Fetching 12 files:  83%|████████▎ | 10/12 [01:23<00:17,  8.72s/it]
Fetching 12 files: 100%|██████████| 12/12 [01:23<00:00,  6.92s/it]
Loading pipeline components...:   0%|          | 0/5 [00:00<?, ?it/s]
Loading pipeline components...:  20%|██        | 1/5 [00:00<00:03,  1.19it/s]
Loading pipeline components...:  40%|████      | 2/5 [00:01<00:03,  1.01s/it]
Loading pipeline components...:  80%|████████  | 4/5 [00:02<00:00,  2.43it/s]
Loading pipeline components...: 100%|██████████| 5/5 [00:02<00:00,  2.96it/s]
Loading pipeline components...: 100%|██████████| 5/5 [00:02<00:00,  2.22it/s]
You have disabled the safety checker for <class 'diffusers.pipelines.stable_diffusion.pipeline_stable_diffusion.StableDiffusionPipeline'> by passing `safety_checker=None`. Ensure that you abide to the conditions of the Stable Diffusion license and do not expose unfiltered results in services or applications open to the public. Both the diffusers team and Hugging Face strongly recommend to keep the safety filter enabled in all public facing circumstances, disabling it only for use-cases that involve analyzing network behavior or auditing its results. For more information, please have a look at https://github.com/huggingface/diffusers/pull/254 .
You have disabled the safety checker for <class 'diffusers.pipelines.stable_diffusion.pipeline_stable_diffusion_inversion_sr.StableDiffusionInvEnhancePipeline'> by passing `safety_checker=None`. Ensure that you abide to the conditions of the Stable Diffusion license and do not expose unfiltered results in services or applications open to the public. Both the diffusers team and Hugging Face strongly recommend to keep the safety filter enabled in all public facing circumstances, disabling it only for use-cases that involve analyzing network behavior or auditing its results. For more information, please have a look at https://github.com/huggingface/diffusers/pull/254 .
Activating gradient checkpoing for vae...
Loading started model from ./weights/noise_predictor_sd_turbo_v5.pth...
/src/sampler_invsr.py:101: FutureWarning: You are using `torch.load` with `weights_only=False` (the current default value), which uses the default pickle module implicitly. It is possible to construct malicious pickle data which will execute arbitrary code during unpickling (See https://github.com/pytorch/pytorch/blob/main/SECURITY.md#untrusted-models for more details). In a future release, the default value for `weights_only` will be flipped to `True`. This limits the functions that could be executed during unpickling. Arbitrary objects will no longer be allowed to be loaded via this mode unless they are explicitly allowlisted by the user via `torch.serialization.add_safe_globals`. We recommend you start setting `weights_only=True` for any use case where you don't have full control of the loaded file. Please open an issue on GitHub for any issues related to this experimental feature.
state = torch.load(ckpt_path, map_location=f"cuda")
Loading Done
/root/.pyenv/versions/3.10.15/lib/python3.10/site-packages/diffusers/configuration_utils.py:140: FutureWarning: Accessing config attribute `vae_latent_channels` directly via 'VaeImageProcessor' object attribute is deprecated. Please access 'vae_latent_channels' over 'VaeImageProcessor's config object instead, e.g. 'scheduler.config.vae_latent_channels'.
deprecate("direct config name access", "1.0.0", deprecation_message, standard_warn=False)
  0%|          | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:00<00:00,  4.53it/s]
100%|██████████| 1/1 [00:00<00:00,  4.52it/s]
  0%|          | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:00<00:00, 21.85it/s]
  0%|          | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:00<00:00, 10.38it/s]
Processing done, enjoy the results in invsr_output
Version Details
Version ID
37eebabfb6cdc4be2892b884b96b361d6fedc9f6a934d2fa3c1a2f85f004b0f0
Version Created
December 14, 2024
Run on Replicate →