ttsds/whisperspeech 📝❓🖼️ → 🖼️

▶️ 1.3K runs 📅 Feb 2025 ⚙️ Cog 0.13.6
text-to-speech voice-cloning

About

Example Output

Output

Example output

Performance Metrics

27.83s Prediction Time
216.35s Total Time
All Input Parameters
{
  "text": "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.",
  "version": "small",
  "language": "en",
  "speaker_reference": "https://replicate.delivery/pbxt/MNFXdPaUPOwYCZjZM4azsymbzE2TCV2WJXfGpeV2DrFWaSq8/example_en.wav"
}
Input Parameters
text (required) Type: string
Text to synthesize
version Default: small
Version of the model to use
language Default: en
Language of the text
speaker_reference (required) Type: string
Reference audio file
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
Received request for small
Generating speech
█
|----------------------------------------| 0.00% [0/748 00:00<?]
|----------------------------------------| 0.13% [1/748 00:00<00:11]
|----------------------------------------| 0.27% [2/748 00:00<00:11]
|----------------------------------------| 0.40% [3/748 00:00<00:11]
|----------------------------------------| 0.53% [4/748 00:00<00:11]
|----------------------------------------| 0.67% [5/748 00:00<00:11]
|----------------------------------------| 2.41% [18/748 00:00<00:10]
|█---------------------------------------| 4.14% [31/748 00:00<00:10]
|██--------------------------------------| 5.88% [44/748 00:00<00:10]
|███-------------------------------------| 7.62% [57/748 00:00<00:09]
|███-------------------------------------| 9.36% [70/748 00:01<00:09]
|████------------------------------------| 11.10% [83/748 00:01<00:09]
|█████-----------------------------------| 12.83% [96/748 00:01<00:09]
|█████-----------------------------------| 14.57% [109/748 00:01<00:09]
|██████----------------------------------| 16.31% [122/748 00:01<00:09]
|███████---------------------------------| 18.05% [135/748 00:01<00:08]
|███████---------------------------------| 19.79% [148/748 00:02<00:08]
█
|----------------------------------------| 0.00% [0/475 00:00<?]
|----------------------------------------| 0.21% [1/475 00:00<00:21]
|----------------------------------------| 0.42% [2/475 00:00<00:21]
|----------------------------------------| 0.63% [3/475 00:00<00:21]
|----------------------------------------| 0.84% [4/475 00:00<00:21]
|----------------------------------------| 1.05% [5/475 00:00<00:21]
|----------------------------------------| 1.89% [9/475 00:00<00:21]
|█---------------------------------------| 2.74% [13/475 00:00<00:21]
|█---------------------------------------| 3.58% [17/475 00:00<00:20]
|█---------------------------------------| 4.42% [21/475 00:00<00:20]
|██--------------------------------------| 5.26% [25/475 00:01<00:20]
|██--------------------------------------| 6.11% [29/475 00:01<00:20]
|██--------------------------------------| 6.95% [33/475 00:01<00:20]
|███-------------------------------------| 7.79% [37/475 00:01<00:20]
|███-------------------------------------| 8.63% [41/475 00:01<00:19]
|███-------------------------------------| 9.47% [45/475 00:02<00:19]
|████------------------------------------| 10.32% [49/475 00:02<00:19]
|████------------------------------------| 11.16% [53/475 00:02<00:19]
|████------------------------------------| 12.00% [57/475 00:02<00:19]
|█████-----------------------------------| 12.84% [61/475 00:02<00:18]
|█████-----------------------------------| 13.68% [65/475 00:02<00:18]
|█████-----------------------------------| 14.53% [69/475 00:03<00:18]
|██████----------------------------------| 15.37% [73/475 00:03<00:18]
|██████----------------------------------| 16.21% [77/475 00:03<00:18]
|██████----------------------------------| 17.05% [81/475 00:03<00:18]
|███████---------------------------------| 17.89% [85/475 00:03<00:17]
|███████---------------------------------| 18.74% [89/475 00:04<00:17]
|███████---------------------------------| 19.58% [93/475 00:04<00:17]
|████████--------------------------------| 20.42% [97/475 00:04<00:17]
|████████--------------------------------| 21.26% [101/475 00:04<00:17]
|████████--------------------------------| 22.11% [105/475 00:04<00:16]
|█████████-------------------------------| 22.95% [109/475 00:04<00:16]
|█████████-------------------------------| 23.79% [113/475 00:05<00:16]
|█████████-------------------------------| 24.63% [117/475 00:05<00:16]
|██████████------------------------------| 25.47% [121/475 00:05<00:16]
|██████████------------------------------| 26.32% [125/475 00:05<00:16]
|██████████------------------------------| 27.16% [129/475 00:05<00:15]
|███████████-----------------------------| 28.00% [133/475 00:06<00:15]
|███████████-----------------------------| 28.84% [137/475 00:06<00:15]
|███████████-----------------------------| 29.68% [141/475 00:06<00:15]
|████████████----------------------------| 30.53% [145/475 00:06<00:15]
|████████████----------------------------| 31.37% [149/475 00:06<00:14]
|████████████----------------------------| 32.21% [153/475 00:07<00:14]
|█████████████---------------------------| 33.05% [157/475 00:07<00:14]
|█████████████---------------------------| 33.89% [161/475 00:07<00:14]
|█████████████---------------------------| 34.74% [165/475 00:07<00:14]
|██████████████--------------------------| 35.58% [169/475 00:07<00:14]
|██████████████--------------------------| 36.42% [173/475 00:07<00:13]
|██████████████--------------------------| 37.26% [177/475 00:08<00:13]
|███████████████-------------------------| 38.11% [181/475 00:08<00:13]
|███████████████-------------------------| 38.95% [185/475 00:08<00:13]
|███████████████-------------------------| 39.79% [189/475 00:08<00:13]
|████████████████------------------------| 40.63% [193/475 00:08<00:12]
|████████████████------------------------| 41.47% [197/475 00:09<00:12]
|████████████████------------------------| 42.32% [201/475 00:09<00:12]
|█████████████████-----------------------| 43.16% [205/475 00:09<00:12]
|█████████████████-----------------------| 44.00% [209/475 00:09<00:12]
|█████████████████-----------------------| 44.84% [213/475 00:09<00:12]
|██████████████████----------------------| 45.68% [217/475 00:09<00:11]
|██████████████████----------------------| 46.53% [221/475 00:10<00:11]
|██████████████████----------------------| 47.37% [225/475 00:10<00:11]
|███████████████████---------------------| 48.21% [229/475 00:10<00:11]
|███████████████████---------------------| 49.05% [233/475 00:10<00:11]
|███████████████████---------------------| 49.89% [237/475 00:10<00:10]
|████████████████████--------------------| 50.74% [241/475 00:11<00:10]
|████████████████████--------------------| 51.58% [245/475 00:11<00:10]
|████████████████████--------------------| 52.42% [249/475 00:11<00:10]
|█████████████████████-------------------| 53.26% [253/475 00:11<00:10]
|█████████████████████-------------------| 54.11% [257/475 00:11<00:10]
|█████████████████████-------------------| 54.95% [261/475 00:11<00:09]
|██████████████████████------------------| 55.79% [265/475 00:12<00:09]
|██████████████████████------------------| 56.63% [269/475 00:12<00:09]
|██████████████████████------------------| 57.47% [273/475 00:12<00:09]
|███████████████████████-----------------| 58.32% [277/475 00:12<00:09]
|███████████████████████-----------------| 59.16% [281/475 00:12<00:08]
|████████████████████████----------------| 60.00% [285/475 00:13<00:08]
|████████████████████████----------------| 60.84% [289/475 00:13<00:08]
|████████████████████████----------------| 61.68% [293/475 00:13<00:08]
|█████████████████████████---------------| 62.53% [297/475 00:13<00:08]
|█████████████████████████---------------| 63.37% [301/475 00:13<00:08]
|█████████████████████████---------------| 64.21% [305/475 00:14<00:07]
|██████████████████████████--------------| 65.05% [309/475 00:14<00:07]
|██████████████████████████--------------| 65.89% [313/475 00:14<00:07]
|██████████████████████████--------------| 66.74% [317/475 00:14<00:07]
|███████████████████████████-------------| 67.58% [321/475 00:14<00:07]
|███████████████████████████-------------| 68.42% [325/475 00:14<00:06]
|███████████████████████████-------------| 69.26% [329/475 00:15<00:06]
|████████████████████████████------------| 70.11% [333/475 00:15<00:06]
|████████████████████████████------------| 70.95% [337/475 00:15<00:06]
|████████████████████████████------------| 71.79% [341/475 00:15<00:06]
|█████████████████████████████-----------| 72.63% [345/475 00:15<00:05]
|█████████████████████████████-----------| 73.47% [349/475 00:16<00:05]
|█████████████████████████████-----------| 74.32% [353/475 00:16<00:05]
|██████████████████████████████----------| 75.16% [357/475 00:16<00:05]
|██████████████████████████████----------| 76.00% [361/475 00:16<00:05]
|██████████████████████████████----------| 76.84% [365/475 00:16<00:05]
|███████████████████████████████---------| 77.68% [369/475 00:16<00:04]
|███████████████████████████████---------| 78.53% [373/475 00:17<00:04]
|███████████████████████████████---------| 79.37% [377/475 00:17<00:04]
|████████████████████████████████--------| 80.21% [381/475 00:17<00:04]
|████████████████████████████████--------| 81.05% [385/475 00:17<00:04]
|████████████████████████████████--------| 81.89% [389/475 00:17<00:03]
|█████████████████████████████████-------| 82.74% [393/475 00:18<00:03]
|█████████████████████████████████-------| 83.58% [397/475 00:18<00:03]
|█████████████████████████████████-------| 84.42% [401/475 00:18<00:03]
|██████████████████████████████████------| 85.26% [405/475 00:18<00:03]
|██████████████████████████████████------| 86.11% [409/475 00:18<00:03]
|██████████████████████████████████------| 86.95% [413/475 00:18<00:02]
|███████████████████████████████████-----| 87.79% [417/475 00:19<00:02]
|███████████████████████████████████-----| 88.63% [421/475 00:19<00:02]
|███████████████████████████████████-----| 89.47% [425/475 00:19<00:02]
|████████████████████████████████████----| 90.32% [429/475 00:19<00:02]
|████████████████████████████████████----| 91.16% [433/475 00:19<00:01]
|████████████████████████████████████----| 92.00% [437/475 00:20<00:01]
|█████████████████████████████████████---| 92.84% [441/475 00:20<00:01]
|█████████████████████████████████████---| 93.68% [445/475 00:20<00:01]
|█████████████████████████████████████---| 94.53% [449/475 00:20<00:01]
|██████████████████████████████████████--| 95.37% [453/475 00:20<00:01]
|██████████████████████████████████████--| 96.21% [457/475 00:20<00:00]
|██████████████████████████████████████--| 97.05% [461/475 00:21<00:00]
|███████████████████████████████████████-| 97.89% [465/475 00:21<00:00]
|███████████████████████████████████████-| 98.74% [469/475 00:21<00:00]
|███████████████████████████████████████-| 99.58% [473/475 00:21<00:00]
|████████████████████████████████████████| 100.00% [475/475 00:21<00:00]
Version Details
Version ID
9328cf0b16ea5fff4c986377ba9def07f5106e779e04fbf5754afa7cfff4c53c
Version Created
March 26, 2025
Run on Replicate →