
konieshadow/speaker-diarization
Separate speakers in audio into labeled time segments. Accepts an audio file and optional constraints on the number of s...
Found 24 models (showing 1-20)
Separate speakers in audio into labeled time segments. Accepts an audio file and optional constraints on the number of s...
Transcribe audio to text at very high speed, with optional translation and speaker diarization. Accepts an audio input a...
Transcribe audio with speaker diarization. Accepts audio files or base64 input and returns a structured transcript with...
Transcribe speech audio to text with word-level timestamps and optional speaker diarization. Takes an audio file and out...
Transcribe and structure doctorβpatient conversations from audio input. Return a JSON string with turn-by-turn text, spe...
Transcribe audio to text. Accepts an audio input and outputs either plain text or segmented transcripts with start/end t...
Transcribe speech to text from audio or video inputs. Auto-detect language or specify one, and optionally translate the...
Transcribe and diarize noisy multi-speaker audio. Accept audio files or base64 and output structured segments with text,...
Transcribe long-form audio from multiple chunks into timestamped text. Accepts an array of audio chunks with total durat...
Identify and segment speakers in an audio recording. Takes an audio file as input and outputs structured diarization wit...
Cluster speech segments by speaker in an audio recording. Takes an audio input and a JSON list of segment records (start...
Identify and segment speakers in audio recordings. Takes an audio file as input and returns JSON with speaker-labeled ti...
Transcribe two-speaker phone calls from separate operator and customer audio tracks into a time-stamped, speaker-labeled...
Transcribe English audio and separate speakers, returning a timestamped transcript with speaker labels. Accepts an audio...
Transcribe audio to text with speaker diarization and word-level timestamps. Takes an audio file as input and returns a...
Transcribe or translate speech from video links or audio files into text with optional word- or chunk-level timestamps....
Transcribe hours-long audio to text with WhisperX large-v3, generating segment timestamps and optional word-level alignm...
Generate synchronized SRT subtitles from audio. Transcribe speech with WhisperX (faster-whisper-large-v3), align words t...
Transcribe Spanish audio to text with optional speaker diarization and timestamps. Accepts an audio input and returns ei...
Transcribe or translate audio to text with word-level timestamps and optional speaker diarization. Accept audio input an...