stayallive/whisper-subtitles ❓🖼️✓ → ❓

▶️ 5.3K runs 📅 May 2023 ⚙️ Cog 0.7.1 🔗 GitHub ⚖️ License
speech-to-text subtitle-generation

About

Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.

Example Output

Output

{"preview":"We, the people of the United States, in order to form a more perfect union, establish justice, ensure domestic tranquility, provide for the common defense, promote the general welfare, and secure the blessings of liberty to ourselves and our posterity to ordain and establish this Constitution for the United States of America.","srt_file":"https://replicate.delivery/pbxt/n28D9jiq2N4INhPM0GwFUtf2PVQrC9eW26TJOOB9MmOsUu9QA/tmplcado2_xpreamble.en.srt","vtt_file":"https://replicate.delivery/pbxt/de4ylTrUeNoGeoUtjYZOmOYe9l9XjoMoD2RToDyAsQWxS52DB/tmplcado2_xpreamble.en.vtt"}

Performance Metrics

7.97s Prediction Time
412.12s Total Time
All Input Parameters
{
  "language": "en",
  "audio_path": "https://replicate.delivery/pbxt/IrETbKtxjksIYsBNRXynBGKpMxCYQzvSjsgsk3XMqp9NkvWc/preamble.wav",
  "model_name": "small.en"
}
Input Parameters
language Default: en
Language of the audio.
audio_path (required) Type: string
Audio file to generate subtitles for.
model_name Default: small
Name of the Whisper model to use.
vad_filter Type: booleanDefault: true
Enable the voice activity detection (VAD) to filter out parts of the audio without speech.
Output Schema

Output

Example Execution Logs
Transcribe with small.en model for the english language...
00:00.000 --> 00:06.100  We, the people of the United States, in order to form a more perfect union, establish justice,
00:06.100 --> 00:11.160  ensure domestic tranquility, provide for the common defense, promote the general welfare,
00:11.160 --> 00:15.840  and secure the blessings of liberty to ourselves and our posterity to ordain and establish
00:15.840 --> 00:18.520  this Constitution for the United States of America.
Version Details
Version ID
b97ba81004e7132181864c885a76cae0e56bc61caa4190a395f6d8ba45b7a969
Version Created
May 20, 2023
Run on Replicate →