suno-ai/bark 📝🔢✓❓🖼️ → ❓

▶️ 301.9K runs 📅 Apr 2023 ⚙️ Cog 0.7.0-beta19 🔗 GitHub ⚖️ License
music-generation sound-effect-generation text-to-speech

About

🔊 Text-Prompted Generative Audio Model

Example Output

Prompt:

"Hello, my name is Suno. And, uh — and I like pizza. [laughs] But I also have other interests such as playing tic tac toe."

Output

Example output

Performance Metrics

44.95s Prediction Time
452.85s Total Time
Input Parameters
prompt Type: stringDefault: Hello, my name is Suno. And, uh — and I like pizza. [laughs] But I also have other interests such as playing tic tac toe.
Input prompt
text_temp Type: numberDefault: 0.7
generation temperature (1.0 more diverse, 0.0 more conservative)
output_full Type: booleanDefault: false
return full generation as a .npz file to be used as a history prompt
waveform_temp Type: numberDefault: 0.7
generation temperature (1.0 more diverse, 0.0 more conservative)
history_prompt
history choice for audio cloning, choose from the list
custom_history_prompt Type: string
Provide your own .npz file with history choice for audio cloning, this will override the previous history_prompt setting
Output Schema

Output

Example Execution Logs
0%|          | 0/100 [00:00<?, ?it/s]
  1%|          | 1/100 [00:00<00:27,  3.65it/s]
  3%|▎         | 3/100 [00:00<00:13,  7.12it/s]
  5%|▌         | 5/100 [00:00<00:10,  8.90it/s]
  7%|▋         | 7/100 [00:00<00:09,  9.98it/s]
  9%|▉         | 9/100 [00:00<00:08, 10.54it/s]
 11%|█         | 11/100 [00:01<00:08, 11.09it/s]
 13%|█▎        | 13/100 [00:01<00:07, 11.16it/s]
 15%|█▌        | 15/100 [00:01<00:07, 11.32it/s]
 17%|█▋        | 17/100 [00:01<00:07, 11.66it/s]
 19%|█▉        | 19/100 [00:01<00:06, 11.69it/s]
 21%|██        | 21/100 [00:01<00:06, 12.05it/s]
 23%|██▎       | 23/100 [00:02<00:06, 12.17it/s]
 25%|██▌       | 25/100 [00:02<00:06, 11.89it/s]
 27%|██▋       | 27/100 [00:02<00:06, 11.64it/s]
 29%|██▉       | 29/100 [00:02<00:06, 11.51it/s]
 31%|███       | 31/100 [00:02<00:05, 11.61it/s]
 33%|███▎      | 33/100 [00:02<00:05, 11.88it/s]
 35%|███▌      | 35/100 [00:03<00:05, 11.82it/s]
 37%|███▋      | 37/100 [00:03<00:05, 11.75it/s]
 39%|███▉      | 39/100 [00:03<00:05, 11.63it/s]
 41%|████      | 41/100 [00:03<00:05, 11.64it/s]
 43%|████▎     | 43/100 [00:03<00:04, 11.79it/s]
 45%|████▌     | 45/100 [00:04<00:04, 11.94it/s]
 47%|████▋     | 47/100 [00:04<00:04, 11.83it/s]
 49%|████▉     | 49/100 [00:04<00:04, 11.89it/s]
 51%|█████     | 51/100 [00:04<00:04, 11.70it/s]
 53%|█████▎    | 53/100 [00:04<00:04, 11.56it/s]
 55%|█████▌    | 55/100 [00:04<00:03, 11.64it/s]
 57%|█████▋    | 57/100 [00:05<00:03, 11.63it/s]
 59%|█████▉    | 59/100 [00:05<00:03, 11.38it/s]
 61%|██████    | 61/100 [00:05<00:03, 11.38it/s]
 63%|██████▎   | 63/100 [00:05<00:03, 11.07it/s]
 65%|██████▌   | 65/100 [00:05<00:03, 11.19it/s]
 67%|██████▋   | 67/100 [00:05<00:02, 11.28it/s]
 69%|██████▉   | 69/100 [00:06<00:02, 11.08it/s]
 71%|███████   | 71/100 [00:06<00:02, 11.11it/s]
 73%|███████▎  | 73/100 [00:06<00:02, 10.91it/s]
 75%|███████▌  | 75/100 [00:06<00:02, 10.78it/s]
 77%|███████▋  | 77/100 [00:06<00:02, 10.83it/s]
 79%|███████▉  | 79/100 [00:07<00:01, 10.86it/s]
 81%|████████  | 81/100 [00:07<00:01, 10.64it/s]
 83%|████████▎ | 83/100 [00:07<00:01, 10.67it/s]
 85%|████████▌ | 85/100 [00:07<00:01, 10.57it/s]
 87%|████████▋ | 87/100 [00:07<00:01, 10.34it/s]
 89%|████████▉ | 89/100 [00:08<00:01, 10.39it/s]
 91%|█████████ | 91/100 [00:08<00:00, 10.10it/s]
 93%|█████████▎| 93/100 [00:08<00:00, 10.06it/s]
100%|██████████| 100/100 [00:08<00:00, 20.32it/s]
100%|██████████| 100/100 [00:08<00:00, 11.69it/s]
  0%|          | 0/36 [00:00<?, ?it/s]
  3%|▎         | 1/36 [00:00<00:23,  1.48it/s]
  6%|▌         | 2/36 [00:01<00:22,  1.48it/s]
  8%|▊         | 3/36 [00:02<00:22,  1.45it/s]
 11%|█         | 4/36 [00:02<00:22,  1.43it/s]
 14%|█▍        | 5/36 [00:03<00:21,  1.41it/s]
 17%|█▋        | 6/36 [00:04<00:22,  1.36it/s]
 19%|█▉        | 7/36 [00:05<00:21,  1.34it/s]
 22%|██▏       | 8/36 [00:05<00:21,  1.30it/s]
 25%|██▌       | 9/36 [00:06<00:21,  1.25it/s]
 28%|██▊       | 10/36 [00:07<00:21,  1.23it/s]
 31%|███       | 11/36 [00:08<00:21,  1.19it/s]
 33%|███▎      | 12/36 [00:09<00:20,  1.15it/s]
 36%|███▌      | 13/36 [00:10<00:20,  1.13it/s]
 39%|███▉      | 14/36 [00:11<00:19,  1.12it/s]
 42%|████▏     | 15/36 [00:12<00:18,  1.11it/s]
 44%|████▍     | 16/36 [00:13<00:18,  1.10it/s]
 47%|████▋     | 17/36 [00:14<00:17,  1.10it/s]
 50%|█████     | 18/36 [00:14<00:16,  1.10it/s]
 53%|█████▎    | 19/36 [00:15<00:15,  1.09it/s]
 56%|█████▌    | 20/36 [00:16<00:14,  1.08it/s]
 58%|█████▊    | 21/36 [00:17<00:13,  1.08it/s]
 61%|██████    | 22/36 [00:18<00:12,  1.08it/s]
 64%|██████▍   | 23/36 [00:19<00:11,  1.09it/s]
 67%|██████▋   | 24/36 [00:20<00:11,  1.08it/s]
 69%|██████▉   | 25/36 [00:21<00:10,  1.08it/s]
 72%|███████▏  | 26/36 [00:22<00:09,  1.08it/s]
 75%|███████▌  | 27/36 [00:23<00:08,  1.08it/s]
 78%|███████▊  | 28/36 [00:24<00:07,  1.08it/s]
 81%|████████  | 29/36 [00:25<00:06,  1.08it/s]
 83%|████████▎ | 30/36 [00:26<00:05,  1.08it/s]
 86%|████████▌ | 31/36 [00:26<00:04,  1.08it/s]
 89%|████████▉ | 32/36 [00:27<00:03,  1.08it/s]
 92%|█████████▏| 33/36 [00:28<00:02,  1.08it/s]
 94%|█████████▍| 34/36 [00:29<00:01,  1.08it/s]
 97%|█████████▋| 35/36 [00:30<00:00,  1.08it/s]
100%|██████████| 36/36 [00:31<00:00,  1.08it/s]
100%|██████████| 36/36 [00:31<00:00,  1.14it/s]
Version Details
Version ID
b76242b40d67c76ab6742e987628a2a9ac019e11d56ab96c4e91ce03b79b2787
Version Created
April 27, 2023
Run on Replicate →