ttsds/e2 📝🖼️ → 🖼️

▶️ 270 runs 📅 Jan 2025 ⚙️ Cog 0.13.6
text-to-speech voice-cloning

About

Example Output

Output

Example output

Performance Metrics

17.17s Prediction Time
216.46s Total Time
All Input Parameters
{
  "text": "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.",
  "text_reference": "and keeping eternity before the eyes, though much",
  "speaker_reference": "https://replicate.delivery/pbxt/MNFXdPaUPOwYCZjZM4azsymbzE2TCV2WJXfGpeV2DrFWaSq8/example_en.wav"
}
Input Parameters
text (required) Type: string
text_reference (required) Type: string
speaker_reference (required) Type: string
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
/tmp/tmpfrof9jz_example_en.wav
Converting audio...
Using custom reference text...
gen_text 0 With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.
Generating audio in 1 batches...
0%|          | 0/1 [00:00<?, ?it/s]Building prefix dict from the default dictionary ...
Dumping model to file cache /tmp/jieba.cache
Loading model cost 0.928 seconds.
Prefix dict has been built successfully.
100%|██████████| 1/1 [00:16<00:00, 16.43s/it]
100%|██████████| 1/1 [00:16<00:00, 16.43s/it]
Version Details
Version ID
5ab0d513ddfb7cd35904877554773eabf0cc6c49a6a7a70f63aaae322d8767dc
Version Created
January 27, 2025
Run on Replicate →