speaker-diarization AI Models

konieshadow/speaker-diarization

Segment speakers in an audio file and return time-stamped speaker labels (diarization). Accepts audio plus optional num_...

speaker-diarization • 33 runs

🤖 Model

vaibhavs10/incredibly-fast-whisper

Transcribe or translate speech to text from audio input. Run Whisper Large v3 with batched inference and Flash Attention...

speech-to-text • speaker-diarization • 18.5M runs

🤖 Model

thomasmol/whisper-diarization

Transcribe audio with speaker diarization. Takes an audio input and returns a text transcript with per-speaker labels, s...

speech-to-text • speaker-diarization • 3.4M runs

🤖 Model

victor-upmeet/whisperx

Transcribe audio to text with word-level timestamps and optional speaker diarization. Accepts an audio file with optiona...

speech-to-text • speaker-diarization • 4.6M runs

🤖 Model

aihilums/sehatsanjha

Transcribe and structure spoken conversations from an audio input. Accept an audio file with optional session context (u...

speech-to-text • speaker-diarization • conversation-structuring • 37.7K runs

🤖 Model

daanelson/whisperx

Transcribe audio to text with fast, batched speech recognition. Accept an audio file as input and return a transcript wi...

speech-to-text • 89.6K runs

🤖 Model

jigsawstack/speech-to-text

Transcribe speech from audio or video into text. Outputs a full transcript with optional per-segment timestamps and spea...

speech-to-text • speaker-diarization • 6 runs

🤖 Model

rafaelgalle/whisper-diarization-advanced

Transcribe and diarize noisy multi-speaker audio. Accept audio files or base64 and output structured segments with text,...

speech-to-text • speaker-diarization • 107.5K runs

🤖 Model

romanfurman6/whisperx-multi-chunk

Transcribe long-form audio from multiple chunks into timestamped text. Accepts an array of audio chunks with total durat...

speech-to-text • speaker-diarization • 10 runs

🤖 Model 🔊

lucataco/speaker-diarization

Identify who spoke when in an audio file. Takes a single audio recording as input and returns a diarization JSON with sp...

🔊 • speaker-diarization • audio-analysis • 12.5K runs

🤖 Model

eaa/diarisation

Separate speakers in audio recordings. Accept an audio file and a JSON list of time segments (start, duration), and clus...

speaker-diarization • 34 runs

🤖 Model 🔊

meronym/speaker-diarization

Segment speakers in audio recordings. Take an audio file and return time-stamped speech segments labeled by speaker, the...

🔊 • speaker-diarization • audio-embedding • 778.2K runs

🤖 Model

skripnik/call-transcriber

Transcribe two-speaker phone calls with timestamps and speaker labels. Accepts two audio tracks (operator and customer)...

speech-to-text • call-transcription • 15 runs

🤖 Model 🔊

meronym/speaker-transcription

Transcribe English speech from an audio input and label speakers with diarization. Return structured JSON with timestamp...

🔊 • speech-to-text • speaker-diarization • audio-embedding • 28.3K runs

🤖 Model

sparkdoaz/www

Transcribe audio to text with speaker diarization and word-level timestamps. Takes an audio file as input and returns a...

speech-to-text • speaker-diarization • 171 runs

🤖 Model 🎥

turian/insanely-fast-whisper-with-video

Transcribe or translate speech from audio files and videos to text. Accept audio or video input and return a transcript...

🎥 • speech-to-text • video-to-text • speaker-diarization • 8.6M runs

🤖 Model

victor-upmeet/whisperx-a40-large

Transcribe hours-long audio to text with WhisperX large-v3, generating segment timestamps and optional word-level alignm...

speech-to-text • speaker-diarization • 710.2K runs

🤖 Model

dashed/whisperx-subtitles-replicate

Generate synchronized SRT subtitles from an audio input. Transcribe with WhisperX (faster-whisper-large-v3) and align wo...

speech-to-text • speaker-diarization • subtitle-generation • 23.2K runs

🤖 Model

mercurio005/whisperx-spanish

Transcribe Spanish audio to text with optional speaker diarization and timestamps. Accepts an audio input and returns ei...

speech-to-text • speaker-diarization • spanish • 44.8K runs

🤖 Model

awerks/whisperx

Transcribe or translate audio to text with word-level timestamps and optional speaker diarization. Accept audio input an...

speech-to-text • speaker-diarization • word-level-timestamps • 14.7K runs