chenxwh/openvoice 📝🖼️🔢❓ → 🖼️

▶️ 91.7K runs 📅 Jan 2024 ⚙️ Cog v0.9.6+dev 🔗 GitHub 📄 Paper ⚖️ License

multilingual multilingual-tts speech-style-transfer text-to-speech voice-cloning

Performance

5.8sTypical run time

91.7KTotal runs

About

Updated to OpenVoice v2: Versatile Instant Voice Cloning

Example Output

Output

Performance Metrics

5.78s Prediction Time

5.84s Total Time

All Input Parameters

{
  "text": "Did you ever hear a folk tale about a giant turtle?",
  "audio": "https://replicate.delivery/pbxt/KpK4hkLwhVAJE9K0DAbZP3YfwLzJyLl09kuPnc4MvCYLcX8m/example_reference.mp3",
  "speed": 1,
  "language": "EN_NEWEST"
}

Input Parameters

text Type: stringDefault: Did you ever hear a folk tale about a giant turtle?: Input text
audio (required) Type: string: Input reference audio
speed Type: numberDefault: 1: Set speed scale of the output audio
language Default: EN_NEWEST: The language of the audio to be generated

Output Schema

Output

Type: string • Format: uri

Example Execution Logs

OpenVoice version: v2
> Text split to sentences.
Did you ever hear a folk tale about a giant turtle?
> ===========================
  0%|          | 0/1 [00:00<?, ?it/s]
100%|██████████| 1/1 [00:00<00:00, 15.59it/s]

Version Details

Version ID: d548923c9d7fc9330a3b7c7f9e2f91b2ee90c83311a351dfcd32af353799223d
Version Created: May 18, 2024

Run on Replicate →