ttsds/amphion_naturalspeech2 📝🖼️ → 🖼️

▶️ 244 runs 📅 Jan 2025 ⚙️ Cog 0.13.6 🔗 GitHub 📄 Paper ⚖️ License
text-to-speech voice-cloning

About

The NaturalSpeech2 model by Amphion.

Example Output

Output

Example output

Performance Metrics

3.53s Prediction Time
91.84s Total Time
All Input Parameters
{
  "text": "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.",
  "speaker_reference": "https://replicate.delivery/pbxt/MN9oVKrayGMTWkp7zLYiFI4f2MxcvUNXdLPZKNm2XF6pfFCd/example.wav"
}
Input Parameters
text (required) Type: string
speaker_reference (required) Type: string
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
W IH0 DH T EH1 N Y ER0 sp S UW1 Z IY0 D HH AE1 V AO1 L DH AH0 M AO1 R L EH1 ZH ER0 F AO1 R Y AA1 T IH0 NG sp B AH1 T HH ER0 P AH2 B L IH0 K EY1 SH AH0 N Z AA1 R N OW1 G IH0 D
tensor([[80, 44, 27, 69, 30, 55, 81, 33, 84, 67, 77, 82, 48, 26, 42,  6, 79, 14,
53, 27,  9, 54, 14, 66, 53, 30, 83, 33, 40, 14, 66, 81,  2, 69, 44, 56,
84, 24, 10, 69, 42, 33, 65, 11, 24, 53, 44, 52, 38, 68,  9, 55, 82,  2,
66, 55, 59, 41, 44, 26]], device='cuda:0')
tensor([[ 3.9186,  2.9686,  4.9976,  5.7980,  4.0413,  3.9953,  3.9211,  3.9709,
5.0094, 10.3756,  8.8338,  5.9407,  4.8495,  3.8872,  0.9731,  6.5990,
5.0840,  8.3756,  5.0295,  4.0766,  2.9993,  6.2080,  5.3888,  5.3165,
5.1823,  6.1473,  6.7604,  7.5368,  8.0171,  2.1517,  3.9014,  7.5853,
9.0661,  4.1307,  4.0265,  5.0240,  5.1804,  5.9589,  4.9108,  3.0568,
0.9921,  5.6249,  7.6311,  4.8512,  3.9711,  2.0713,  3.0145,  7.2740,
7.4062,  7.9518,  3.0312,  3.8800,  3.9691,  4.0073,  3.0389,  6.5661,
9.4134,  4.9636,  5.0012,  6.6313]], device='cuda:0')
tensor([[ 4,  3,  5,  6,  4,  4,  4,  4,  5, 10,  9,  6,  5,  4,  1,  7,  5,  8,
5,  4,  3,  6,  5,  5,  5,  6,  7,  8,  8,  2,  4,  8,  9,  4,  4,  5,
5,  6,  5,  3,  1,  6,  8,  5,  4,  2,  3,  7,  7,  8,  3,  4,  4,  4,
3,  7,  9,  5,  5,  7]], device='cuda:0')
tensor(313, device='cuda:0')
Version Details
Version ID
ef5a325879add6bf5f161568d9ebad53824fa3b6b7f0eacb7f300e8720e861f7
Version Created
January 23, 2025
Run on Replicate →