mtg/effnet-discogs 📝🖼️🔢❓ → 🖼️

▶️ 320.9K runs 📅 Nov 2021 ⚙️ Cog 0.7.2 🔗 GitHub ⚖️ License
audio-classification music-understanding

About

An EfficientNet for music style classification by 400 styles from the Discogs taxonomy

Example Output

Output

Example output

Performance Metrics

65.63s Prediction Time
99.55s Total Time
All Input Parameters
{
  "url": "https://www.youtube.com/watch?v=CHekNnySAfM",
  "top_n": "10",
  "output_format": "Visualization"
}
Input Parameters
url Type: string
YouTube URL to process (overrides audio input)
audio Type: string
Audio file to process
top_n Type: integerDefault: 10
Top n music styles to show
output_format Default: Visualization
Output either a bar chart visualization or a JSON blob
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
[youtube] CHekNnySAfM: Downloading webpage
[youtube] CHekNnySAfM: Downloading player bd67d609
[download] Destination: /tmp/tmp5hp8fkyh/audio.webm

[download]   0.0% of 3.82MiB at  7.33KiB/s ETA 08:53
[download]   0.1% of 3.82MiB at 21.97KiB/s ETA 02:57
[download]   0.2% of 3.82MiB at 51.24KiB/s ETA 01:16
[download]   0.4% of 3.82MiB at 109.68KiB/s ETA 00:35
[download]   0.8% of 3.82MiB at 88.27KiB/s ETA 00:43
[download]   1.6% of 3.82MiB at 76.81KiB/s ETA 00:50
[download]   3.2% of 3.82MiB at 73.03KiB/s ETA 00:51
[download]   5.0% of 3.82MiB at 68.01KiB/s ETA 00:54
[download]   6.6% of 3.82MiB at 67.48KiB/s ETA 00:54
[download]   8.3% of 3.82MiB at 68.29KiB/s ETA 00:52
[download]  10.1% of 3.82MiB at 69.88KiB/s ETA 00:50
[download]  12.1% of 3.82MiB at 69.50KiB/s ETA 00:49
[download]  13.8% of 3.82MiB at 70.00KiB/s ETA 00:48
[download]  15.7% of 3.82MiB at 69.33KiB/s ETA 00:47
[download]  17.3% of 3.82MiB at 69.45KiB/s ETA 00:46
[download]  19.2% of 3.82MiB at 70.08KiB/s ETA 00:45
[download]  21.1% of 3.82MiB at 69.75KiB/s ETA 00:44
[download]  22.8% of 3.82MiB at 69.95KiB/s ETA 00:43
[download]  24.7% of 3.82MiB at 69.37KiB/s ETA 00:42
[download]  26.3% of 3.82MiB at 69.24KiB/s ETA 00:41
[download]  28.0% of 3.82MiB at 69.47KiB/s ETA 00:40
[download]  29.9% of 3.82MiB at 69.07KiB/s ETA 00:39
[download]  31.5% of 3.82MiB at 69.07KiB/s ETA 00:38
[download]  33.3% of 3.82MiB at 69.38KiB/s ETA 00:37
[download]  35.2% of 3.82MiB at 69.21KiB/s ETA 00:36
[download]  36.9% of 3.82MiB at 69.35KiB/s ETA 00:35
[download]  38.7% of 3.82MiB at 69.73KiB/s ETA 00:34
[download]  40.7% of 3.82MiB at 69.65KiB/s ETA 00:33
[download]  42.5% of 3.82MiB at 69.82KiB/s ETA 00:32
[download]  44.4% of 3.82MiB at 69.56KiB/s ETA 00:31
[download]  46.0% of 3.82MiB at 69.57KiB/s ETA 00:30
[download]  47.8% of 3.82MiB at 69.80KiB/s ETA 00:29
[download]  49.8% of 3.82MiB at 69.65KiB/s ETA 00:28
[download]  51.4% of 3.82MiB at 69.72KiB/s ETA 00:27
[download]  53.3% of 3.82MiB at 69.48KiB/s ETA 00:26
[download]  54.9% of 3.82MiB at 69.43KiB/s ETA 00:25
[download]  56.6% of 3.82MiB at 69.56KiB/s ETA 00:24
[download]  58.5% of 3.82MiB at 69.36KiB/s ETA 00:23
[download]  60.2% of 3.82MiB at 69.37KiB/s ETA 00:22
[download]  61.9% of 3.82MiB at 69.54KiB/s ETA 00:21
[download]  63.9% of 3.82MiB at 69.42KiB/s ETA 00:20
[download]  65.6% of 3.82MiB at 69.48KiB/s ETA 00:19
[download]  67.4% of 3.82MiB at 69.68KiB/s ETA 00:18
[download]  69.4% of 3.82MiB at 69.62KiB/s ETA 00:17
[download]  71.1% of 3.82MiB at 69.71KiB/s ETA 00:16
[download]  73.0% of 3.82MiB at 69.54KiB/s ETA 00:15
[download]  74.6% of 3.82MiB at 69.53KiB/s ETA 00:14
[download]  76.4% of 3.82MiB at 69.66KiB/s ETA 00:13
[download]  78.3% of 3.82MiB at 69.58KiB/s ETA 00:12
[download]  80.0% of 3.82MiB at 69.64KiB/s ETA 00:11
[download]  81.9% of 3.82MiB at 69.44KiB/s ETA 00:10
[download]  83.4% of 3.82MiB at 69.40KiB/s ETA 00:09
[download]  85.2% of 3.82MiB at 69.48KiB/s ETA 00:08
[download]  87.0% of 3.82MiB at 69.67KiB/s ETA 00:07
[download]  89.1% of 3.82MiB at 69.66KiB/s ETA 00:06
[download]  90.9% of 3.82MiB at 69.45KiB/s ETA 00:05
[download]  92.4% of 3.82MiB at 69.69KiB/s ETA 00:04
[download]  94.6% of 3.82MiB at 69.55KiB/s ETA 00:03
[download]  96.3% of 3.82MiB at 69.56KiB/s ETA 00:02
[download]  98.1% of 3.82MiB at 69.67KiB/s ETA 00:01
[download] 100.0% of 3.82MiB at 69.70KiB/s ETA 00:00
[download] 100% of 3.82MiB in 00:56
[ffmpeg] Destination: /tmp/tmp5hp8fkyh/audio.wav
Deleting original file /tmp/tmp5hp8fkyh/audio.webm (pass -k to keep)
running the inference network...
2022-03-16 13:55:58.101663: I tensorflow/core/platform/profile_utils/cpu_utils.cc:114] CPU Frequency: 2299995000 Hz
plotting...
done!
Version Details
Version ID
1532dd069fb4f0e27c6833e28815f6b8c194dfec76fd9cd73460540fd720ffe1
Version Created
May 25, 2023
Run on Replicate →