hnesk/whisper-wordtimestamps
Transcribe audio to text with per-word timestamps. Outputs the full transcript, time-coded segments, detected language,...
Found 72 models (showing 41-60)
Transcribe audio to text with per-word timestamps. Outputs the full transcript, time-coded segments, detected language,...
Transcribe speech from an input audio file into text. Use WhisperX for ASR with optional alignment to produce word-level...
Generate synchronized SRT subtitles from an audio input. Transcribe with WhisperX (faster-whisper-large-v3) and align wo...
Transcribe speech to text from an audio input. Uses OpenAI Whisper Large-v2 implemented in JAX for up to 15x faster infe...
Transcribe Spanish audio to text with optional speaker diarization and timestamps. Accepts an audio input and returns ei...
Transcribe or translate speech from audio into text with word-level timestamps and confidence scores. Support multilingu...
Transcribe multilingual audio to text and subtitles. Accepts an audio file and returns a transcription, timestamped segm...
Transcribe or translate audio to text with word-level timestamps and optional speaker diarization. Accept audio input an...
Transcribe speech to text from audio input. Accepts an audio file and optionally a source language, returns a transcript...
Transcribe audio into text with word-level timestamps and optional speaker diarization. Supports multilingual speech-to-...
Generate subtitles (.srt and .vtt) from audio files. Transcribe speech with Whisper via faster-whisper (CTranslate2) and...
Transcribe and optionally translate speech from audio to text at high speed. Leverage Whisper Large v3 via Hugging Face...
Generate subtitles from audio. Accepts an audio file and returns a transcript with detected language, optional English t...
Transcribe speech from audio into text with Whisper large-v3, supporting multilingual transcription, automatic language...
Transcribe audio to text with optional translation, word-level timestamps, and speaker diarization. Accept an audio inpu...
Transcribe speech to text with optional word-level timestamps. Accepts an audio input and returns a transcript plus dete...
Transcribe English speech from an audio input into text. Uses OpenAI Whisper medium.en and parallel batching (configurab...
Transcribe audio or video to text. Accepts an audio or video input and returns a JSON transcript or ASS subtitles, lever...
Transcribe speech from audio to text. Leverage Distil-Whisper variants (distil-large-v2, distil-medium.en) that run up t...
Transcribe audio to text with optional speaker diarization. Uses WhisperX (Whisper Large V2) for transcription and Pyann...