🤖 Model 🔊
quinten-kamphuis/forced-alignment
Align text to audio with exact word-level timestamps. Takes an audio clip and its transcript as input and outputs a list...
Found 2 models (showing 1-2)
Align text to audio with exact word-level timestamps. Takes an audio clip and its transcript as input and outputs a list...
Transcribe audio to text with per-word timestamps. Outputs the full transcript, time-coded segments, detected language,...