ttsds/e2 📝🖼️ → 🖼️

▶️ 273 runs 📅 Jan 2025 ⚙️ Cog 0.13.6

text-to-speech voice-cloning

Performance

17.2sTypical run time

~216sCold start (first call)

273Total runs

About

Example Output

Output

Performance Metrics

17.17s Prediction Time

216.46s Total Time

All Input Parameters

{
  "text": "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.",
  "text_reference": "and keeping eternity before the eyes, though much",
  "speaker_reference": "https://replicate.delivery/pbxt/MNFXdPaUPOwYCZjZM4azsymbzE2TCV2WJXfGpeV2DrFWaSq8/example_en.wav"
}

Input Parameters

text (required) Type: string
text_reference (required) Type: string
speaker_reference (required) Type: string

Output Schema

Output

Type: string • Format: uri

Example Execution Logs

/tmp/tmpfrof9jz_example_en.wav
Converting audio...
Using custom reference text...
gen_text 0 With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.
Generating audio in 1 batches...
0%|          | 0/1 [00:00<?, ?it/s]Building prefix dict from the default dictionary ...
Dumping model to file cache /tmp/jieba.cache
Loading model cost 0.928 seconds.
Prefix dict has been built successfully.
100%|██████████| 1/1 [00:16<00:00, 16.43s/it]
100%|██████████| 1/1 [00:16<00:00, 16.43s/it]

Version Details

Version ID: 5ab0d513ddfb7cd35904877554773eabf0cc6c49a6a7a70f63aaae322d8767dc
Version Created: January 27, 2025

Run on Replicate →