erium/whisperx
Transcribe audio into text with word-level timestamps and optional speaker diarization. Supports multilingual speech-to-...
Found 27 models (showing 21-27)
Transcribe audio into text with word-level timestamps and optional speaker diarization. Supports multilingual speech-to-...
Transcribe and optionally translate speech from audio to text at high speed. Leverage Whisper Large v3 via Hugging Face...
Transcribe audio to text with optional translation, word-level timestamps, and speaker diarization. Accept an audio inpu...
Transcribe audio to text with optional speaker diarization. Uses WhisperX (Whisper Large V2) for transcription and Pyann...
Transcribe speech to text from an audio input. Optionally translate to English, perform speaker diarization, and bias re...
Transcribe multilingual audio with speaker diarization and channel separation. Accepts an audio file and outputs text tr...
Identify and segment speakers in an audio recording. Accepts an audio file and outputs JSON with time-stamped segments (...