lucataco/seamless_communication ❓📝🖼️🔢 → ❓
About
FacebookResearch/SeamlessM4T v2 - Massively Multilingual & Multimodal Machine Translation

Example Output
Output
{"text_output":"Mon animal préféré, c'est l'éléphant.","audio_output":"https://replicate.delivery/pbxt/6dPxbnvciRKmNBVlfHGnuelSwPA6dQSAKf05fjxB0696e5egE/out.wav"}
Performance Metrics
11.32s
Prediction Time
136.59s
Total Time
All Input Parameters
{ "task_name": "S2ST (Speech to Speech translation)", "input_audio": "https://replicate.delivery/pbxt/K4oyjNRg7zgO3bfT9LKI9of4A6w9reAlXkyWzZeZONrz2mVY/demo-speech.mp3", "input_text_language": "English", "max_input_audio_length": 60, "target_language_text_only": "French", "target_language_with_speech": "French" }
Input Parameters
- task_name
- Choose a task
- input_text
- Provide input for tasks with text: T2ST and T2TT
- input_audio
- Provide input file for tasks with speech input: S2ST, S2TT and ASR
- input_text_language
- Specify language of the input_text for T2ST and T2TT
- max_input_audio_length
- Set maximum input audio length.
- target_language_text_only
- Set target language for tasks with text output only: S2TT, T2TT and ASR.
- target_language_with_speech
- Set target language for tasks with speech output: S2ST or T2ST. Less languages are available for speech compared to text output.
Output Schema
- text_output
- Text Output
- audio_output
- Audio Output
Version Details
- Version ID
b61de43a89a30bb31baa14ba81647303accb8220975ea91268a447650f013298
- Version Created
- December 19, 2023