stayallive/whisper-subtitles ❓🖼️✓ → ❓
About
Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.

Example Output
Output
{"preview":"We, the people of the United States, in order to form a more perfect union, establish justice, ensure domestic tranquility, provide for the common defense, promote the general welfare, and secure the blessings of liberty to ourselves and our posterity to ordain and establish this Constitution for the United States of America.","srt_file":"https://replicate.delivery/pbxt/n28D9jiq2N4INhPM0GwFUtf2PVQrC9eW26TJOOB9MmOsUu9QA/tmplcado2_xpreamble.en.srt","vtt_file":"https://replicate.delivery/pbxt/de4ylTrUeNoGeoUtjYZOmOYe9l9XjoMoD2RToDyAsQWxS52DB/tmplcado2_xpreamble.en.vtt"}
Performance Metrics
7.97s
Prediction Time
412.12s
Total Time
All Input Parameters
{ "language": "en", "audio_path": "https://replicate.delivery/pbxt/IrETbKtxjksIYsBNRXynBGKpMxCYQzvSjsgsk3XMqp9NkvWc/preamble.wav", "model_name": "small.en" }
Input Parameters
- language
- Language of the audio.
- audio_path (required)
- Audio file to generate subtitles for.
- model_name
- Name of the Whisper model to use.
- vad_filter
- Enable the voice activity detection (VAD) to filter out parts of the audio without speech.
Output Schema
Output
Example Execution Logs
Transcribe with small.en model for the english language... 00:00.000 --> 00:06.100 We, the people of the United States, in order to form a more perfect union, establish justice, 00:06.100 --> 00:11.160 ensure domestic tranquility, provide for the common defense, promote the general welfare, 00:11.160 --> 00:15.840 and secure the blessings of liberty to ourselves and our posterity to ordain and establish 00:15.840 --> 00:18.520 this Constitution for the United States of America.
Version Details
- Version ID
b97ba81004e7132181864c885a76cae0e56bc61caa4190a395f6d8ba45b7a969
- Version Created
- May 20, 2023