lucataco/seamless_communication ❓📝🖼️🔢 → ❓

▶️ 1.1K runs 📅 Dec 2023 ⚙️ Cog 0.8.6 🔗 GitHub 📄 Paper ⚖️ License

speech-to-text speech-translation text-to-speech

Performance

11.3sTypical run time

~137sCold start (first call)

1.1KTotal runs

About

FacebookResearch/SeamlessM4T v2 - Massively Multilingual & Multimodal Machine Translation

Example Output

Output

{"text_output":"Mon animal préféré, c'est l'éléphant.","audio_output":"https://replicate.delivery/pbxt/6dPxbnvciRKmNBVlfHGnuelSwPA6dQSAKf05fjxB0696e5egE/out.wav"}

Performance Metrics

11.32s Prediction Time

136.59s Total Time

All Input Parameters

{
  "task_name": "S2ST (Speech to Speech translation)",
  "input_audio": "https://replicate.delivery/pbxt/K4oyjNRg7zgO3bfT9LKI9of4A6w9reAlXkyWzZeZONrz2mVY/demo-speech.mp3",
  "input_text_language": "English",
  "max_input_audio_length": 60,
  "target_language_text_only": "French",
  "target_language_with_speech": "French"
}

Input Parameters

task_name Default: S2ST (Speech to Speech translation): Choose a task
input_text Type: string: Provide input for tasks with text: T2ST and T2TT
input_audio Type: string: Provide input file for tasks with speech input: S2ST, S2TT and ASR
input_text_language Default: English: Specify language of the input_text for T2ST and T2TT
max_input_audio_length Type: numberDefault: 60: Set maximum input audio length.
target_language_text_only Default: French: Set target language for tasks with text output only: S2TT, T2TT and ASR.
target_language_with_speech Default: French: Set target language for tasks with speech output: S2ST or T2ST. Less languages are available for speech compared to text output.

Output Schema

text_output Type: string: Text Output
audio_output Type: stringFormat: uri: Audio Output

Version Details

Version ID: b61de43a89a30bb31baa14ba81647303accb8220975ea91268a447650f013298
Version Created: December 19, 2023

Run on Replicate →