ttsds/tortoise 📝🖼️ → 🖼️

▶️ 1.7K runs 📅 Feb 2025 ⚙️ Cog 0.13.6
text-to-speech voice-cloning

About

Example Output

Output

Example output

Performance Metrics

25.28s Prediction Time
142.33s Total Time
All Input Parameters
{
  "text": "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.",
  "speaker_reference": "https://replicate.delivery/pbxt/MNFXdPaUPOwYCZjZM4azsymbzE2TCV2WJXfGpeV2DrFWaSq8/example_en.wav"
}
Input Parameters
text (required) Type: string
speaker_reference Type: string
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
Generating autoregressive samples..
  0%|          | 0/6 [00:00<?, ?it/s]
 17%|█▋        | 1/6 [00:03<00:19,  3.92s/it]
 33%|███▎      | 2/6 [00:06<00:12,  3.13s/it]
 50%|█████     | 3/6 [00:09<00:08,  2.95s/it]
 67%|██████▋   | 4/6 [00:11<00:05,  2.77s/it]
 83%|████████▎ | 5/6 [00:14<00:02,  2.69s/it]
100%|██████████| 6/6 [00:16<00:00,  2.62s/it]
100%|██████████| 6/6 [00:16<00:00,  2.79s/it]
Computing best candidates using CLVP
  0%|          | 0/6 [00:00<?, ?it/s]
 33%|███▎      | 2/6 [00:00<00:00,  7.47it/s]
 50%|█████     | 3/6 [00:00<00:00,  6.24it/s]
 67%|██████▋   | 4/6 [00:00<00:00,  5.72it/s]
 83%|████████▎ | 5/6 [00:00<00:00,  5.48it/s]
100%|██████████| 6/6 [00:01<00:00,  5.34it/s]
100%|██████████| 6/6 [00:01<00:00,  5.67it/s]
Transforming autoregressive outputs into audio..
  0%|          | 0/80 [00:00<?, ?it/s]
  4%|▍         | 3/80 [00:00<00:03, 25.39it/s]
  8%|▊         | 6/80 [00:00<00:02, 26.74it/s]
 11%|█▏        | 9/80 [00:00<00:02, 27.23it/s]
 15%|█▌        | 12/80 [00:00<00:02, 27.38it/s]
 19%|█▉        | 15/80 [00:00<00:02, 27.48it/s]
 22%|██▎       | 18/80 [00:00<00:02, 27.57it/s]
 26%|██▋       | 21/80 [00:00<00:02, 27.72it/s]
 30%|███       | 24/80 [00:00<00:02, 27.48it/s]
 34%|███▍      | 27/80 [00:00<00:01, 27.65it/s]
 38%|███▊      | 30/80 [00:01<00:01, 27.70it/s]
 41%|████▏     | 33/80 [00:01<00:01, 27.79it/s]
 45%|████▌     | 36/80 [00:01<00:01, 27.90it/s]
 49%|████▉     | 39/80 [00:01<00:01, 27.96it/s]
 52%|█████▎    | 42/80 [00:01<00:01, 27.85it/s]
 56%|█████▋    | 45/80 [00:01<00:01, 27.81it/s]
 60%|██████    | 48/80 [00:01<00:01, 27.81it/s]
 64%|██████▍   | 51/80 [00:01<00:01, 27.82it/s]
 68%|██████▊   | 54/80 [00:01<00:00, 27.81it/s]
 71%|███████▏  | 57/80 [00:02<00:00, 27.85it/s]
 75%|███████▌  | 60/80 [00:02<00:00, 27.89it/s]
 79%|███████▉  | 63/80 [00:02<00:00, 27.90it/s]
 82%|████████▎ | 66/80 [00:02<00:00, 27.88it/s]
 86%|████████▋ | 69/80 [00:02<00:00, 27.65it/s]
 90%|█████████ | 72/80 [00:02<00:00, 27.74it/s]
 94%|█████████▍| 75/80 [00:02<00:00, 27.86it/s]
 98%|█████████▊| 78/80 [00:02<00:00, 27.92it/s]
100%|██████████| 80/80 [00:02<00:00, 27.72it/s]
Version Details
Version ID
274f7fd0812bba7717154110a0ffd2930db847135ab234e23a9d7c35924b8a09
Version Created
February 24, 2025
Run on Replicate →