lucataco/whisperspeech-small 📝❓ → 🖼️
About
An Open Source text-to-speech system built by inverting Whisper

Example Output
Prompt:
"This is the first demo of Whisper Speech, a fully open source text-to-speech model trained by Collabora and Lion on the Juwels supercomputer"
Output
Performance Metrics
24.50s
Prediction Time
145.56s
Total Time
All Input Parameters
{ "prompt": "This is the first demo of Whisper Speech, a fully open source text-to-speech model trained by Collabora and Lion on the Juwels supercomputer", "speaker": "", "language": "en" }
Input Parameters
- prompt
- Text to synthesize
- speaker
- URL for zero-shot voice cloning(ex: https://upload.wikimedia.org/wikipedia/commons/7/75/Winston_Churchill_-_Be_Ye_Men_of_Valour.ogg)
- language
- Language to synthesize
Output Schema
Output
Example Execution Logs
█ |----------------------------------------| 0.00% [0/749 00:00<?] |----------------------------------------| 0.13% [1/749 00:00<00:18] |----------------------------------------| 0.27% [2/749 00:00<00:14] |----------------------------------------| 0.40% [3/749 00:00<00:13] |----------------------------------------| 0.53% [4/749 00:00<00:13] |----------------------------------------| 0.67% [5/749 00:00<00:12] |----------------------------------------| 2.14% [16/749 00:00<00:11] |█---------------------------------------| 3.74% [28/749 00:00<00:11] |██--------------------------------------| 5.34% [40/749 00:00<00:10] |██--------------------------------------| 6.94% [52/749 00:00<00:10] |███-------------------------------------| 8.54% [64/749 00:00<00:10] |████------------------------------------| 10.15% [76/749 00:01<00:10] |████------------------------------------| 11.75% [88/749 00:01<00:10] |█████-----------------------------------| 13.35% [100/749 00:01<00:09] |█████-----------------------------------| 14.95% [112/749 00:01<00:09] |██████----------------------------------| 16.56% [124/749 00:01<00:09] |███████---------------------------------| 18.16% [136/749 00:02<00:09] |███████---------------------------------| 19.76% [148/749 00:02<00:09] |████████--------------------------------| 21.36% [160/749 00:02<00:09] |█████████-------------------------------| 22.96% [172/749 00:02<00:08] |█████████-------------------------------| 24.57% [184/749 00:02<00:08] |██████████------------------------------| 26.17% [196/749 00:03<00:08] |███████████-----------------------------| 27.77% [208/749 00:03<00:08] |███████████-----------------------------| 29.37% [220/749 00:03<00:08] |████████████----------------------------| 30.97% [232/749 00:03<00:07] |█████████████---------------------------| 32.58% [244/749 00:03<00:07] █ |----------------------------------------| 0.00% [0/752 00:00<?] |----------------------------------------| 0.13% [1/752 00:00<00:19] |----------------------------------------| 0.27% [2/752 00:00<00:19] |----------------------------------------| 0.40% [3/752 00:00<00:19] |----------------------------------------| 0.53% [4/752 00:00<00:19] |----------------------------------------| 0.66% [5/752 00:00<00:19] |----------------------------------------| 1.60% [12/752 00:00<00:19] |█---------------------------------------| 2.53% [19/752 00:00<00:18] |█---------------------------------------| 3.46% [26/752 00:00<00:18] |█---------------------------------------| 4.39% [33/752 00:00<00:18] |██--------------------------------------| 5.32% [40/752 00:01<00:18] |██--------------------------------------| 6.25% [47/752 00:01<00:17] |██--------------------------------------| 7.18% [54/752 00:01<00:17] |███-------------------------------------| 8.11% [61/752 00:01<00:17] |███-------------------------------------| 9.04% [68/752 00:01<00:17] |███-------------------------------------| 9.97% [75/752 00:01<00:17] |████------------------------------------| 10.90% [82/752 00:02<00:16] |████------------------------------------| 11.84% [89/752 00:02<00:16] |█████-----------------------------------| 12.77% [96/752 00:02<00:16] |█████-----------------------------------| 13.70% [103/752 00:02<00:16] |█████-----------------------------------| 14.63% [110/752 00:02<00:16] |██████----------------------------------| 15.56% [117/752 00:02<00:16] |██████----------------------------------| 16.49% [124/752 00:03<00:15] |██████----------------------------------| 17.42% [131/752 00:03<00:15] |███████---------------------------------| 18.35% [138/752 00:03<00:15] |███████---------------------------------| 19.28% [145/752 00:03<00:15] |████████--------------------------------| 20.21% [152/752 00:03<00:15] |████████--------------------------------| 21.14% [159/752 00:04<00:15] |████████--------------------------------| 22.07% [166/752 00:04<00:14] |█████████-------------------------------| 23.01% [173/752 00:04<00:14] |█████████-------------------------------| 23.94% [180/752 00:04<00:14] |█████████-------------------------------| 24.87% [187/752 00:04<00:14] |██████████------------------------------| 25.80% [194/752 00:04<00:14] |██████████------------------------------| 26.73% [201/752 00:05<00:13] |███████████-----------------------------| 27.66% [208/752 00:05<00:13] |███████████-----------------------------| 28.59% [215/752 00:05<00:13] |███████████-----------------------------| 29.52% [222/752 00:05<00:13] |████████████----------------------------| 30.45% [229/752 00:05<00:13] |████████████----------------------------| 31.38% [236/752 00:06<00:13] |████████████----------------------------| 32.31% [243/752 00:06<00:12] |█████████████---------------------------| 33.24% [250/752 00:06<00:12] |█████████████---------------------------| 34.18% [257/752 00:06<00:12] |██████████████--------------------------| 35.11% [264/752 00:06<00:12] |██████████████--------------------------| 36.04% [271/752 00:06<00:12] |██████████████--------------------------| 36.97% [278/752 00:07<00:12] |███████████████-------------------------| 37.90% [285/752 00:07<00:11] |███████████████-------------------------| 38.83% [292/752 00:07<00:11] |███████████████-------------------------| 39.76% [299/752 00:07<00:11] |████████████████------------------------| 40.69% [306/752 00:07<00:11] |████████████████------------------------| 41.62% [313/752 00:08<00:11] |█████████████████-----------------------| 42.55% [320/752 00:08<00:11] |█████████████████-----------------------| 43.48% [327/752 00:08<00:10] |█████████████████-----------------------| 44.41% [334/752 00:08<00:10] |██████████████████----------------------| 45.35% [341/752 00:08<00:10] |██████████████████----------------------| 46.28% [348/752 00:08<00:10] |██████████████████----------------------| 47.21% [355/752 00:09<00:10] |███████████████████---------------------| 48.14% [362/752 00:09<00:09] |███████████████████---------------------| 49.07% [369/752 00:09<00:09] |████████████████████--------------------| 50.00% [376/752 00:09<00:09] |████████████████████--------------------| 50.93% [383/752 00:09<00:09] |████████████████████--------------------| 51.86% [390/752 00:09<00:09] |█████████████████████-------------------| 52.79% [397/752 00:10<00:09] |█████████████████████-------------------| 53.72% [404/752 00:10<00:08] |█████████████████████-------------------| 54.65% [411/752 00:10<00:08] |██████████████████████------------------| 55.59% [418/752 00:10<00:08] |██████████████████████------------------| 56.52% [425/752 00:10<00:08] |██████████████████████------------------| 57.45% [432/752 00:11<00:08] |███████████████████████-----------------| 58.38% [439/752 00:11<00:08] |███████████████████████-----------------| 59.31% [446/752 00:11<00:07] |████████████████████████----------------| 60.24% [453/752 00:11<00:07] |████████████████████████----------------| 61.17% [460/752 00:11<00:07] |████████████████████████----------------| 62.10% [467/752 00:11<00:07] |█████████████████████████---------------| 63.03% [474/752 00:12<00:07] |█████████████████████████---------------| 63.96% [481/752 00:12<00:06] |█████████████████████████---------------| 64.89% [488/752 00:12<00:06] |██████████████████████████--------------| 65.82% [495/752 00:12<00:06] |██████████████████████████--------------| 66.76% [502/752 00:12<00:06] |███████████████████████████-------------| 67.69% [509/752 00:12<00:06] |███████████████████████████-------------| 68.62% [516/752 00:13<00:06] |███████████████████████████-------------| 69.55% [523/752 00:13<00:05] |████████████████████████████------------| 70.48% [530/752 00:13<00:05] |████████████████████████████------------| 71.41% [537/752 00:13<00:05] |████████████████████████████------------| 72.34% [544/752 00:13<00:05] |█████████████████████████████-----------| 73.27% [551/752 00:14<00:05] |█████████████████████████████-----------| 74.20% [558/752 00:14<00:04] |██████████████████████████████----------| 75.13% [565/752 00:14<00:04] |██████████████████████████████----------| 76.06% [572/752 00:14<00:04] |██████████████████████████████----------| 76.99% [579/752 00:14<00:04] |███████████████████████████████---------| 77.93% [586/752 00:14<00:04] |███████████████████████████████---------| 78.86% [593/752 00:15<00:04] |███████████████████████████████---------| 79.79% [600/752 00:15<00:03] |████████████████████████████████--------| 80.72% [607/752 00:15<00:03] |████████████████████████████████--------| 81.65% [614/752 00:15<00:03] |█████████████████████████████████-------| 82.58% [621/752 00:15<00:03] |█████████████████████████████████-------| 83.51% [628/752 00:15<00:03] |█████████████████████████████████-------| 84.44% [635/752 00:16<00:02] |██████████████████████████████████------| 85.37% [642/752 00:16<00:02] |██████████████████████████████████------| 86.30% [649/752 00:16<00:02] |██████████████████████████████████------| 87.23% [656/752 00:16<00:02] |███████████████████████████████████-----| 88.16% [663/752 00:16<00:02] |███████████████████████████████████-----| 89.10% [670/752 00:17<00:02] |████████████████████████████████████----| 90.03% [677/752 00:17<00:01] |████████████████████████████████████----| 90.96% [684/752 00:17<00:01] |████████████████████████████████████----| 91.89% [691/752 00:17<00:01] |█████████████████████████████████████---| 92.82% [698/752 00:17<00:01] |█████████████████████████████████████---| 93.75% [705/752 00:17<00:01] |█████████████████████████████████████---| 94.68% [712/752 00:18<00:01] |██████████████████████████████████████--| 95.61% [719/752 00:18<00:00] |██████████████████████████████████████--| 96.54% [726/752 00:18<00:00] |██████████████████████████████████████--| 97.47% [733/752 00:18<00:00] |███████████████████████████████████████-| 98.40% [740/752 00:18<00:00] |███████████████████████████████████████-| 99.34% [747/752 00:18<00:00] |████████████████████████████████████████| 100.00% [752/752 00:19<00:00]
Version Details
- Version ID
70789b0c0bfa6d81964a43545867f34a8f8175572c429e7c3c2869fb6fa5ff95
- Version Created
- January 18, 2024