zsxkib/hibiki 🖼️🔢 → 🖼️

▶️ 17 runs 📅 Feb 2025 ⚙️ Cog 0.13.7 🔗 GitHub 📄 Paper ⚖️ License
audio-to-audio speech-translation video-to-audio

About

Hibiki: High-Fidelity Simultaneous Speech-To-Speech Translation

Example Output

Output

Example output

Performance Metrics

22.22s Prediction Time
65.95s Total Time
All Input Parameters
{
  "audio_input": "https://replicate.delivery/pbxt/MTZNHE3PVfheTWoSVFVA3vvKxUA0is9pgtRFYnfRnRPgQz9K/sample_fr_hibiki_monologue_otis.mp3",
  "max_duration": 0,
  "cut_start_seconds": 2,
  "volume_reduction_db": 30
}
Input Parameters
audio_input Type: string
Input audio file to translate
video_input Type: string
Optional input video file
max_duration Type: integerDefault: 0Range: 0 - ∞
Maximum duration in seconds (0 for no limit)
cut_start_seconds Type: numberDefault: 2Range: 0 - 4
Seconds to trim from start of translated audio
volume_reduction_db Type: integerDefault: 30Range: 0 - 60
Volume reduction for original audio (dB)
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
[Info] retrieving checkpoint
[Info] loading mimi
[Info] mimi loaded
[Info] loading moshi
[Info] moshi loaded
[Info] loading input file /tmp/tmpd90y11yasample_fr_hibiki_monologue_otis.mp3
Info: starting the inference loop
Info: processed 795 steps in 17s, 21.72ms/step
<s> You know, I don't think there are any good or bad situations. For me, if I had to summarize my life today with you, I would say that it is first of all encounters, people who have reached out to me, perhaps at a time when I could not, where I was alone at home. And it is quite curious to think that chance encounters forge a destiny, because when we have the taste of the thing, when we have the taste of the thing done well, the beautiful gesture, sometimes we do not find the interlocutor in front of us. I would say the mirror that helps you move forward. So this is not my case, as I said there. Since I, on the contrary, I was able, and I say thank you to life, I say thank you, I sing life, I dance life, I am only love. And finally, when many people today tell me, but how do it to have this humanity? Well, I answered them very simply. I told them that this taste of love, this taste that pushed me today to undertake a mechanical construction with both hands, who knows? Maybe simply to put myself at the service of the community, to make the gift, the gift of this. That's it I'm</s>[Info] writing /tmp/tmpd6e64nai/out_en-0 with duration 61.9 sec.
[Info] writing /tmp/tmpd6e64nai/out_en-1 with duration 63.6 sec.
Version Details
Version ID
c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86
Version Created
February 10, 2025
Run on Replicate →