suno-ai/bark 📝🔢✓❓🖼️ → ❓
About
🔊 Text-Prompted Generative Audio Model

Example Output
Prompt:
"Hello, my name is Suno. And, uh — and I like pizza. [laughs] But I also have other interests such as playing tic tac toe."
Output
Performance Metrics
44.95s
Prediction Time
452.85s
Total Time
Input Parameters
- prompt
- Input prompt
- text_temp
- generation temperature (1.0 more diverse, 0.0 more conservative)
- output_full
- return full generation as a .npz file to be used as a history prompt
- waveform_temp
- generation temperature (1.0 more diverse, 0.0 more conservative)
- history_prompt
- history choice for audio cloning, choose from the list
- custom_history_prompt
- Provide your own .npz file with history choice for audio cloning, this will override the previous history_prompt setting
Output Schema
Output
Example Execution Logs
0%| | 0/100 [00:00<?, ?it/s] 1%| | 1/100 [00:00<00:27, 3.65it/s] 3%|▎ | 3/100 [00:00<00:13, 7.12it/s] 5%|▌ | 5/100 [00:00<00:10, 8.90it/s] 7%|▋ | 7/100 [00:00<00:09, 9.98it/s] 9%|▉ | 9/100 [00:00<00:08, 10.54it/s] 11%|█ | 11/100 [00:01<00:08, 11.09it/s] 13%|█▎ | 13/100 [00:01<00:07, 11.16it/s] 15%|█▌ | 15/100 [00:01<00:07, 11.32it/s] 17%|█▋ | 17/100 [00:01<00:07, 11.66it/s] 19%|█▉ | 19/100 [00:01<00:06, 11.69it/s] 21%|██ | 21/100 [00:01<00:06, 12.05it/s] 23%|██▎ | 23/100 [00:02<00:06, 12.17it/s] 25%|██▌ | 25/100 [00:02<00:06, 11.89it/s] 27%|██▋ | 27/100 [00:02<00:06, 11.64it/s] 29%|██▉ | 29/100 [00:02<00:06, 11.51it/s] 31%|███ | 31/100 [00:02<00:05, 11.61it/s] 33%|███▎ | 33/100 [00:02<00:05, 11.88it/s] 35%|███▌ | 35/100 [00:03<00:05, 11.82it/s] 37%|███▋ | 37/100 [00:03<00:05, 11.75it/s] 39%|███▉ | 39/100 [00:03<00:05, 11.63it/s] 41%|████ | 41/100 [00:03<00:05, 11.64it/s] 43%|████▎ | 43/100 [00:03<00:04, 11.79it/s] 45%|████▌ | 45/100 [00:04<00:04, 11.94it/s] 47%|████▋ | 47/100 [00:04<00:04, 11.83it/s] 49%|████▉ | 49/100 [00:04<00:04, 11.89it/s] 51%|█████ | 51/100 [00:04<00:04, 11.70it/s] 53%|█████▎ | 53/100 [00:04<00:04, 11.56it/s] 55%|█████▌ | 55/100 [00:04<00:03, 11.64it/s] 57%|█████▋ | 57/100 [00:05<00:03, 11.63it/s] 59%|█████▉ | 59/100 [00:05<00:03, 11.38it/s] 61%|██████ | 61/100 [00:05<00:03, 11.38it/s] 63%|██████▎ | 63/100 [00:05<00:03, 11.07it/s] 65%|██████▌ | 65/100 [00:05<00:03, 11.19it/s] 67%|██████▋ | 67/100 [00:05<00:02, 11.28it/s] 69%|██████▉ | 69/100 [00:06<00:02, 11.08it/s] 71%|███████ | 71/100 [00:06<00:02, 11.11it/s] 73%|███████▎ | 73/100 [00:06<00:02, 10.91it/s] 75%|███████▌ | 75/100 [00:06<00:02, 10.78it/s] 77%|███████▋ | 77/100 [00:06<00:02, 10.83it/s] 79%|███████▉ | 79/100 [00:07<00:01, 10.86it/s] 81%|████████ | 81/100 [00:07<00:01, 10.64it/s] 83%|████████▎ | 83/100 [00:07<00:01, 10.67it/s] 85%|████████▌ | 85/100 [00:07<00:01, 10.57it/s] 87%|████████▋ | 87/100 [00:07<00:01, 10.34it/s] 89%|████████▉ | 89/100 [00:08<00:01, 10.39it/s] 91%|█████████ | 91/100 [00:08<00:00, 10.10it/s] 93%|█████████▎| 93/100 [00:08<00:00, 10.06it/s] 100%|██████████| 100/100 [00:08<00:00, 20.32it/s] 100%|██████████| 100/100 [00:08<00:00, 11.69it/s] 0%| | 0/36 [00:00<?, ?it/s] 3%|▎ | 1/36 [00:00<00:23, 1.48it/s] 6%|▌ | 2/36 [00:01<00:22, 1.48it/s] 8%|▊ | 3/36 [00:02<00:22, 1.45it/s] 11%|█ | 4/36 [00:02<00:22, 1.43it/s] 14%|█▍ | 5/36 [00:03<00:21, 1.41it/s] 17%|█▋ | 6/36 [00:04<00:22, 1.36it/s] 19%|█▉ | 7/36 [00:05<00:21, 1.34it/s] 22%|██▏ | 8/36 [00:05<00:21, 1.30it/s] 25%|██▌ | 9/36 [00:06<00:21, 1.25it/s] 28%|██▊ | 10/36 [00:07<00:21, 1.23it/s] 31%|███ | 11/36 [00:08<00:21, 1.19it/s] 33%|███▎ | 12/36 [00:09<00:20, 1.15it/s] 36%|███▌ | 13/36 [00:10<00:20, 1.13it/s] 39%|███▉ | 14/36 [00:11<00:19, 1.12it/s] 42%|████▏ | 15/36 [00:12<00:18, 1.11it/s] 44%|████▍ | 16/36 [00:13<00:18, 1.10it/s] 47%|████▋ | 17/36 [00:14<00:17, 1.10it/s] 50%|█████ | 18/36 [00:14<00:16, 1.10it/s] 53%|█████▎ | 19/36 [00:15<00:15, 1.09it/s] 56%|█████▌ | 20/36 [00:16<00:14, 1.08it/s] 58%|█████▊ | 21/36 [00:17<00:13, 1.08it/s] 61%|██████ | 22/36 [00:18<00:12, 1.08it/s] 64%|██████▍ | 23/36 [00:19<00:11, 1.09it/s] 67%|██████▋ | 24/36 [00:20<00:11, 1.08it/s] 69%|██████▉ | 25/36 [00:21<00:10, 1.08it/s] 72%|███████▏ | 26/36 [00:22<00:09, 1.08it/s] 75%|███████▌ | 27/36 [00:23<00:08, 1.08it/s] 78%|███████▊ | 28/36 [00:24<00:07, 1.08it/s] 81%|████████ | 29/36 [00:25<00:06, 1.08it/s] 83%|████████▎ | 30/36 [00:26<00:05, 1.08it/s] 86%|████████▌ | 31/36 [00:26<00:04, 1.08it/s] 89%|████████▉ | 32/36 [00:27<00:03, 1.08it/s] 92%|█████████▏| 33/36 [00:28<00:02, 1.08it/s] 94%|█████████▍| 34/36 [00:29<00:01, 1.08it/s] 97%|█████████▋| 35/36 [00:30<00:00, 1.08it/s] 100%|██████████| 36/36 [00:31<00:00, 1.08it/s] 100%|██████████| 36/36 [00:31<00:00, 1.14it/s]
Version Details
- Version ID
b76242b40d67c76ab6742e987628a2a9ac019e11d56ab96c4e91ce03b79b2787
- Version Created
- April 27, 2023