ttsds/gptsovits_1 📝❓🖼️ → 🖼️

▶️ 245 runs 📅 Jan 2025 ⚙️ Cog 0.13.6
multilingual text-to-speech voice-cloning

About

Example Output

Output

Example output

Performance Metrics

4.41s Prediction Time
24.93s Total Time
All Input Parameters
{
  "text": "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.",
  "language": "en",
  "text_reference": "and keeping eternity before the eyes, though much.",
  "speaker_reference": "https://replicate.delivery/pbxt/MNFXdPaUPOwYCZjZM4azsymbzE2TCV2WJXfGpeV2DrFWaSq8/example_en.wav"
}
Input Parameters
text (required) Type: string
language (required)
text_reference (required) Type: string
speaker_reference (required) Type: string
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
[nltk_data] Downloading package averaged_perceptron_tagger to
[nltk_data]     /root/nltk_data...
[nltk_data]   Unzipping taggers/averaged_perceptron_tagger.zip.
[nltk_data] Downloading package cmudict to /root/nltk_data...
[nltk_data]   Unzipping corpora/cmudict.zip.
  0%|          | 0/1500 [00:00<?, ?it/s]
  0%|          | 6/1500 [00:00<00:25, 59.70it/s]
  3%|▎         | 42/1500 [00:00<00:06, 235.06it/s]
  5%|▌         | 77/1500 [00:00<00:04, 285.12it/s]
T2S Decoding EOS [92 -> 231]
  7%|▋         | 112/1500 [00:00<00:04, 308.94it/s]
9%|▉         | 138/1500 [00:00<00:04, 286.24it/s]
Version Details
Version ID
33e629c53dd218f539d7014e87feb1b1186e3384e0581eefd4b42e341276b225
Version Created
March 24, 2025
Run on Replicate →