ttsds/f5 📝🖼️ → 🖼️

▶️ 2.7K runs 📅 Jan 2025 ⚙️ Cog 0.13.6

speech-style-transfer text-to-speech voice-cloning

Performance

3.0sTypical run time

~49sCold start (first call)

2.7KTotal runs

About

Example Output

Output

Performance Metrics

3.01s Prediction Time

49.16s Total Time

All Input Parameters

{
  "text": "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.",
  "text_reference": "and keeping eternity before the eyes, though much.",
  "speaker_reference": "https://replicate.delivery/pbxt/MNFXdPaUPOwYCZjZM4azsymbzE2TCV2WJXfGpeV2DrFWaSq8/example_en.wav"
}

Input Parameters

text (required) Type: string
text_reference (required) Type: string
speaker_reference (required) Type: string

Output Schema

Output

Type: string • Format: uri

Example Execution Logs

/tmp/tmp3nvq3ri2example_en.wav
Converting audio...
Using custom reference text...
gen_text 0With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.
Generating audio in 1 batches...
0%|          | 0/1 [00:00<?, ?it/s]Building prefix dict from the default dictionary ...
Dumping model to file cache /tmp/jieba.cache
Loading model cost 0.413 seconds.
Prefix dict has been built successfully.
100%|██████████| 1/1 [00:02<00:00,  2.75s/it]
100%|██████████| 1/1 [00:02<00:00,  2.75s/it]

Version Details

Version ID: 8ed3cd9ee4f9a15bb24ce3752d9df17746ae0a62f105fae42603f102bd9af1d4
Version Created: January 30, 2025

Run on Replicate →