nvidia/parakeet-rnnt-1.1b 🖼️ → 📝
About
🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝

Example Output
Output
well i don't wish to see it any more observed phoebe turning away her eyes it is certainly very like the old portrait
Performance Metrics
1.43s
Prediction Time
100.03s
Total Time
Input Parameters
- audio_file (required)
- Input audio file to be transcribed by the ASR model
Output Schema
Output
Example Execution Logs
[!] Audio file /tmp/tmp5b0ke4ee2086-149220-0033.wav is mono: True [!] Audio file /tmp/tmp5b0ke4ee2086-149220-0033.wav is already mono Transcribing: 0%| | 0/1 [00:00<?, ?it/s] Transcribing: 100%|██████████| 1/1 [00:00<00:00, 1.23it/s] Transcribing: 100%|██████████| 1/1 [00:00<00:00, 1.23it/s]
Version Details
- Version ID
73ddbebaef172a47c8dfdd79381f110bfdc7691bcc7a4edde82f0a39e380ce50
- Version Created
- January 4, 2024