ttsds/f5 📝🖼️ → 🖼️

▶️ 2.7K runs 📅 Jan 2025 ⚙️ Cog 0.13.6
speech-style-transfer text-to-speech voice-cloning

About

Example Output

Output

Example output

Performance Metrics

3.01s Prediction Time
49.16s Total Time
All Input Parameters
{
  "text": "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.",
  "text_reference": "and keeping eternity before the eyes, though much.",
  "speaker_reference": "https://replicate.delivery/pbxt/MNFXdPaUPOwYCZjZM4azsymbzE2TCV2WJXfGpeV2DrFWaSq8/example_en.wav"
}
Input Parameters
text (required) Type: string
text_reference (required) Type: string
speaker_reference (required) Type: string
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
/tmp/tmp3nvq3ri2example_en.wav
Converting audio...
Using custom reference text...
gen_text 0With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.
Generating audio in 1 batches...
0%|          | 0/1 [00:00<?, ?it/s]Building prefix dict from the default dictionary ...
Dumping model to file cache /tmp/jieba.cache
Loading model cost 0.413 seconds.
Prefix dict has been built successfully.
100%|██████████| 1/1 [00:02<00:00,  2.75s/it]
100%|██████████| 1/1 [00:02<00:00,  2.75s/it]
Version Details
Version ID
8ed3cd9ee4f9a15bb24ce3752d9df17746ae0a62f105fae42603f102bd9af1d4
Version Created
January 30, 2025
Run on Replicate →