lucataco/hermes-2-pro-llama-3-70b 🔢📝 → 📝

▶️ 360 runs 📅 Jun 2024 ⚙️ Cog 0.9.10
function-calling json-structured-output text-generation text-translation

About

Hermes 2 Pro is an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced Function Calling and JSON Mode dataset developed in-house

Example Output

Prompt:

"Write a short story about Goku discovering kirby has teamed up with Majin Buu to destroy the world."

Output

Title: The Unlikely Alliance

Goku was soaring through the skies, enjoying the peace and tranquility that had lasted for a few years since his last battle. Suddenly, he sensed an immense power surging from a distant planet. His curiosity piqued, Goku used his instant transmission to travel to the source of the energy.

Upon arrival, Goku was shocked to find Kirby, the pink puffball he had once considered an ally, standing alongside Majin Buu, the powerful villain responsible for the destruction of entire planets. The two unlikely allies were laughing maniacally as they prepared to unleash their combined power upon the Earth.

Goku's eyes widened in disbelief. "Kirby, what are you doing with Majin Buu? You're supposed to be a hero, not a villain!"

Kirby turned to face Goku, his once innocent eyes now filled with malice. "Foolish Saiyan, you know nothing! I have grown tired of being seen as weak and innocent. Teaming up with Majin Buu will grant me the power and respect I deserve!"

Majin Buu chuckled, a sinister grin on his face. "Yes, Kirby and I have decided to join forces and destroy this pathetic planet. No one will stand in our way, not even you, Goku!"

Goku's fists clenched as he prepared for battle. "I can't let you do that. I have to protect the Earth and everyone on it!"

The battle commenced, with Goku using every ounce of his strength to fend off the combined might of Kirby and Majin Buu. Despite his incredible power, Goku found himself struggling against the duo. Kirby's ability to copy his opponents' powers, combined with Majin Buu's overwhelming strength, proved to be a formidable challenge.

As the battle raged on, Goku noticed a glimmer of hesitation in Kirby's eyes. Sensing an opportunity, Goku spoke to the pink puffball. "Kirby, I know you're not evil. You've always been a hero, and you don't have to do this. You can still choose the right path!"

Kirby faltered, his eyes wavering between his desire for power and his innate sense of justice. Goku pressed on, "Remember the times we fought together as allies? We protected the innocent and stood against evil. You don't want to be responsible for the destruction of the Earth, do you?"

Torn between his ambitions and his conscience, Kirby finally made his choice. He turned to Majin Buu

Performance Metrics

26.55s Prediction Time
229.90s Total Time
All Input Parameters
{
  "top_k": 50,
  "top_p": 0.9,
  "prompt": "Write a short story about Goku discovering kirby has teamed up with Majin Buu to destroy the world.",
  "max_tokens": 512,
  "min_tokens": 0,
  "temperature": 0.6,
  "system_prompt": "You are a helpful assistant.",
  "presence_penalty": 0,
  "frequency_penalty": 0
}
Input Parameters
top_k Type: integerDefault: 50
The number of highest probability tokens to consider for generating the output. If > 0, only keep the top k tokens with highest probability (top-k filtering).
top_p Type: numberDefault: 0.9
A probability threshold for generating the output. If < 1.0, only keep the top tokens with cumulative probability >= top_p (nucleus filtering). Nucleus filtering is described in Holtzman et al. (http://arxiv.org/abs/1904.09751).
prompt Type: stringDefault:
Prompt
max_tokens Type: integerDefault: 512
The maximum number of tokens the model should generate as output.
min_tokens Type: integerDefault: 0
The minimum number of tokens the model should generate as output.
temperature Type: numberDefault: 0.6
The value used to modulate the next token probabilities.
system_prompt Type: stringDefault: You are a helpful assistant.
System prompt to send to the model. This is prepended to the prompt and helps guide system behavior. Ignored for non-chat models.
stop_sequences Type: string
A comma-separated list of sequences to stop generation at. For example, '<end>,<stop>' will stop generation at the first instance of 'end' or '<stop>'.
presence_penalty Type: numberDefault: 0
Presence penalty
frequency_penalty Type: numberDefault: 0
Frequency penalty
Output Schema

Output

Type: arrayItems Type: string

Example Execution Logs
INFO 06-28 19:41:58 async_llm_engine.py:529] Received request 07aba362773940d7aa2154854e6b86c9: prompt: '<|begin_of_text|><|im_start|>system\nYou are a helpful assistant.<|im_end|>\n<|im_start|>user\nWrite a short story about Goku discovering kirby has teamed up with Majin Buu to destroy the world.<|im_end|>\n<|im_start|>assistant\n', sampling_params: SamplingParams(n=1, best_of=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.6, top_p=0.9, top_k=50, min_p=0.0, seed=None, use_beam_search=False, length_penalty=1.0, early_stopping=False, stop=[], stop_token_ids=[128003], include_stop_str_in_output=False, ignore_eos=False, max_tokens=512, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None), prompt_token_ids: None, lora_request: None.
 stdoutINFO 06-28 19:42:01 metrics.py:334] Avg prompt throughput: 8.5 tokens/s, Avg generation throughput: 14.9 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 1.5%, CPU KV cache usage: 0.0%
 stdoutINFO 06-28 19:42:07 metrics.py:334] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 19.4 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 2.7%, CPU KV cache usage: 0.0%
 stdoutINFO 06-28 19:42:12 metrics.py:334] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 19.3 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 3.9%, CPU KV cache usage: 0.0%
 stdoutINFO 06-28 19:42:17 metrics.py:334] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 19.3 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 5.0%, CPU KV cache usage: 0.0%
 stdoutINFO 06-28 19:42:22 metrics.py:334] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 19.3 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 6.2%, CPU KV cache usage: 0.0%
 stdoutGeneration took 1719601455.61sFormatted prompt: <|begin_of_text|><|im_start|>system
You are a helpful assistant.<|im_end|>
<|im_start|>user
Write a short story about Goku discovering kirby has teamed up with Majin Buu to destroy the world.<|im_end|>
<|im_start|>assistant
INFO 06-28 19:42:24 async_llm_engine.py:120] Finished request 07aba362773940d7aa2154854e6b86c9.
 stdout
Version Details
Version ID
fb2ed69e8d02c1ebe10d89842cf94b562aad4fe4480df7e40a90050328d8a99e
Version Created
June 28, 2024
Run on Replicate →