ttsds/amphion_naturalspeech2 📝🖼️ → 🖼️
About
The NaturalSpeech2 model by Amphion.
Example Output
Output
Performance Metrics
3.53s
Prediction Time
91.84s
Total Time
All Input Parameters
{ "text": "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good.", "speaker_reference": "https://replicate.delivery/pbxt/MN9oVKrayGMTWkp7zLYiFI4f2MxcvUNXdLPZKNm2XF6pfFCd/example.wav" }
Input Parameters
- text (required)
- speaker_reference (required)
Output Schema
Output
Example Execution Logs
W IH0 DH T EH1 N Y ER0 sp S UW1 Z IY0 D HH AE1 V AO1 L DH AH0 M AO1 R L EH1 ZH ER0 F AO1 R Y AA1 T IH0 NG sp B AH1 T HH ER0 P AH2 B L IH0 K EY1 SH AH0 N Z AA1 R N OW1 G IH0 D tensor([[80, 44, 27, 69, 30, 55, 81, 33, 84, 67, 77, 82, 48, 26, 42, 6, 79, 14, 53, 27, 9, 54, 14, 66, 53, 30, 83, 33, 40, 14, 66, 81, 2, 69, 44, 56, 84, 24, 10, 69, 42, 33, 65, 11, 24, 53, 44, 52, 38, 68, 9, 55, 82, 2, 66, 55, 59, 41, 44, 26]], device='cuda:0') tensor([[ 3.9186, 2.9686, 4.9976, 5.7980, 4.0413, 3.9953, 3.9211, 3.9709, 5.0094, 10.3756, 8.8338, 5.9407, 4.8495, 3.8872, 0.9731, 6.5990, 5.0840, 8.3756, 5.0295, 4.0766, 2.9993, 6.2080, 5.3888, 5.3165, 5.1823, 6.1473, 6.7604, 7.5368, 8.0171, 2.1517, 3.9014, 7.5853, 9.0661, 4.1307, 4.0265, 5.0240, 5.1804, 5.9589, 4.9108, 3.0568, 0.9921, 5.6249, 7.6311, 4.8512, 3.9711, 2.0713, 3.0145, 7.2740, 7.4062, 7.9518, 3.0312, 3.8800, 3.9691, 4.0073, 3.0389, 6.5661, 9.4134, 4.9636, 5.0012, 6.6313]], device='cuda:0') tensor([[ 4, 3, 5, 6, 4, 4, 4, 4, 5, 10, 9, 6, 5, 4, 1, 7, 5, 8, 5, 4, 3, 6, 5, 5, 5, 6, 7, 8, 8, 2, 4, 8, 9, 4, 4, 5, 5, 6, 5, 3, 1, 6, 8, 5, 4, 2, 3, 7, 7, 8, 3, 4, 4, 4, 3, 7, 9, 5, 5, 7]], device='cuda:0') tensor(313, device='cuda:0')
Version Details
- Version ID
ef5a325879add6bf5f161568d9ebad53824fa3b6b7f0eacb7f300e8720e861f7
- Version Created
- January 23, 2025