nateraw/audio-super-resolution 🔢🖼️ → 🖼️

▶️ 62.6K runs 📅 Sep 2023 ⚙️ Cog 0.8.6 🔗 GitHub 📄 Paper ⚖️ License
audio-restoration audio-super-resolution audio-to-audio

About

AudioSR: Versatile Audio Super-resolution at Scale

Example Output

Output

Example output

Performance Metrics

32.91s Prediction Time
121.48s Total Time
All Input Parameters
{
  "seed": 42,
  "ddim_steps": 50,
  "input_file": "https://replicate.delivery/pbxt/JYv70XQsiZBbSmknfMhGoEb4QYbuyJ9hJkfgjyzCvh4TzPmT/music.wav",
  "guidance_scale": 3.5
}
Input Parameters
seed Type: integer
Random seed. Leave blank to randomize the seed
ddim_steps Type: integerDefault: 50Range: 10 - 500
Number of inference steps
input_file (required) Type: string
Audio to upsample
guidance_scale Type: numberDefault: 3.5Range: 1 - 20
Scale for classifier free guidance
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
 Warning: audio is longer than 10.24 seconds, may degrade the model performance. It's recommand to truncate your audio to 5.12 seconds before input to AudioSR to get the best performance.
/src/audiosr/utils.py:109: FutureWarning: Pass sr=48000, n_fft=2048, n_mels=256, fmin=20, fmax=24000 as keyword args. From version 0.10 passing these as positional arguments will result in an error
mel = librosa_mel_fn(sampling_rate, filter_length, n_mel, mel_fmin, mel_fmax)
Running DDIM Sampling with 50 timesteps
DDIM Sampler:   0%|          | 0/50 [00:00<?, ?it/s]
DDIM Sampler:   2%|▏         | 1/50 [00:06<05:20,  6.54s/it]
DDIM Sampler:   4%|▍         | 2/50 [00:06<02:14,  2.80s/it]
DDIM Sampler:   6%|▌         | 3/50 [00:06<01:15,  1.60s/it]
DDIM Sampler:   8%|▊         | 4/50 [00:07<00:47,  1.04s/it]
DDIM Sampler:  10%|█         | 5/50 [00:07<00:32,  1.37it/s]
DDIM Sampler:  12%|█▏        | 6/50 [00:07<00:23,  1.84it/s]
DDIM Sampler:  14%|█▍        | 7/50 [00:07<00:18,  2.36it/s]
DDIM Sampler:  16%|█▌        | 8/50 [00:07<00:14,  2.89it/s]
DDIM Sampler:  18%|█▊        | 9/50 [00:07<00:12,  3.40it/s]
DDIM Sampler:  20%|██        | 10/50 [00:08<00:10,  3.86it/s]
DDIM Sampler:  22%|██▏       | 11/50 [00:08<00:09,  4.26it/s]
DDIM Sampler:  24%|██▍       | 12/50 [00:08<00:08,  4.59it/s]
DDIM Sampler:  26%|██▌       | 13/50 [00:08<00:07,  4.85it/s]
DDIM Sampler:  28%|██▊       | 14/50 [00:08<00:07,  5.04it/s]
DDIM Sampler:  30%|███       | 15/50 [00:09<00:06,  5.19it/s]
DDIM Sampler:  32%|███▏      | 16/50 [00:09<00:06,  5.29it/s]
DDIM Sampler:  34%|███▍      | 17/50 [00:09<00:06,  5.37it/s]
DDIM Sampler:  36%|███▌      | 18/50 [00:09<00:05,  5.42it/s]
DDIM Sampler:  38%|███▊      | 19/50 [00:09<00:05,  5.46it/s]
DDIM Sampler:  40%|████      | 20/50 [00:09<00:05,  5.49it/s]
DDIM Sampler:  42%|████▏     | 21/50 [00:10<00:05,  5.51it/s]
DDIM Sampler:  44%|████▍     | 22/50 [00:10<00:05,  5.53it/s]
DDIM Sampler:  46%|████▌     | 23/50 [00:10<00:04,  5.54it/s]
DDIM Sampler:  48%|████▊     | 24/50 [00:10<00:04,  5.55it/s]
DDIM Sampler:  50%|█████     | 25/50 [00:10<00:04,  5.55it/s]
DDIM Sampler:  52%|█████▏    | 26/50 [00:11<00:04,  5.56it/s]
DDIM Sampler:  54%|█████▍    | 27/50 [00:11<00:04,  5.56it/s]
DDIM Sampler:  56%|█████▌    | 28/50 [00:11<00:03,  5.56it/s]
DDIM Sampler:  58%|█████▊    | 29/50 [00:11<00:03,  5.56it/s]
DDIM Sampler:  60%|██████    | 30/50 [00:11<00:03,  5.56it/s]
DDIM Sampler:  62%|██████▏   | 31/50 [00:11<00:03,  5.56it/s]
DDIM Sampler:  64%|██████▍   | 32/50 [00:12<00:03,  5.48it/s]
DDIM Sampler:  66%|██████▌   | 33/50 [00:12<00:03,  5.50it/s]
DDIM Sampler:  68%|██████▊   | 34/50 [00:12<00:02,  5.44it/s]
DDIM Sampler:  70%|███████   | 35/50 [00:12<00:02,  5.23it/s]
DDIM Sampler:  72%|███████▏  | 36/50 [00:12<00:02,  5.08it/s]
DDIM Sampler:  74%|███████▍  | 37/50 [00:13<00:02,  4.99it/s]
DDIM Sampler:  76%|███████▌  | 38/50 [00:13<00:02,  4.85it/s]
DDIM Sampler:  78%|███████▊  | 39/50 [00:13<00:02,  4.74it/s]
DDIM Sampler:  80%|████████  | 40/50 [00:13<00:02,  4.71it/s]
DDIM Sampler:  82%|████████▏ | 41/50 [00:13<00:01,  4.69it/s]
DDIM Sampler:  84%|████████▍ | 42/50 [00:14<00:01,  4.68it/s]
DDIM Sampler:  86%|████████▌ | 43/50 [00:14<00:01,  4.68it/s]
DDIM Sampler:  88%|████████▊ | 44/50 [00:14<00:01,  4.81it/s]
DDIM Sampler:  90%|█████████ | 45/50 [00:14<00:00,  5.01it/s]
DDIM Sampler:  92%|█████████▏| 46/50 [00:14<00:00,  5.16it/s]
DDIM Sampler:  94%|█████████▍| 47/50 [00:15<00:00,  5.28it/s]
DDIM Sampler:  96%|█████████▌| 48/50 [00:15<00:00,  5.36it/s]
DDIM Sampler:  98%|█████████▊| 49/50 [00:15<00:00,  5.42it/s]
DDIM Sampler: 100%|██████████| 50/50 [00:15<00:00,  5.46it/s]
DDIM Sampler: 100%|██████████| 50/50 [00:15<00:00,  3.19it/s]
Version Details
Version ID
0e453d5e4c2e0ef4f8d38a6167053dda09cf3c8dbca2355cde61dca55a915bc5
Version Created
September 20, 2023
Run on Replicate →