daanelson/whisperx 🖼️✓🔢 → ❓
About
Accelerated transcription of audio using WhisperX

Example Output
Output
[object Object][object Object]
Performance Metrics
2.72s
Prediction Time
2.70s
Total Time
All Input Parameters
{ "audio": "https://replicate.delivery/pbxt/J5r78wKSymorzW9idAbbbJ7iXQl9GddZTwfdX5OlLJW2hLR2/OSR_uk_000_0050_8k.wav", "batch_size": 32 }
Input Parameters
- audio (required)
- Audio file
- debug
- Print out memory usage information.
- only_text
- Set if you only want to return text; otherwise, segment metadata will be returned as well.
- batch_size
- Parallelization of input audio transcription
- align_output
- Use if you need word-level timing and not just batched transcription. Only works for English atm
Output Schema
Output
Version Details
- Version ID
9aa6ecadd30610b81119fc1b6807302fd18ca6cbb39b3216f430dcf23618cedd
- Version Created
- June 30, 2023