nateraw/goliath-120b 🔢📝 → 📝
About
An auto-regressive causal LM created by combining 2x finetuned Llama-2 70B into one.

Example Output
"write me a blogpost all about parmigiano reggiano. Include helpful tips, like what to do with the rinds, whether you should buy pre-grated vs non-grated, etc, etc. Grumble about poor quality american Parmesan. "
Output
Title: The King of Cheeses: All Hail Parmigiano Reggiano!
Introduction:
Welcome, cheese aficionados! Today, we're going to dive into the world of the one and only Parmigiano Reggiano. That's right, I'm talking about the real deal, not that pre-grated, mass-produced "Parmesan" you find in American grocery stores. We're here to pay homage to the real thing - the King of Cheeses!
What is Parmigiano Reggiano?
Parmigiano Reggiano (pronounced par-mee-JYAH-no reh-JYAH-no) is a hard, aged Italian cheese that hails from the Emilia-Romagna region of Italy. It has been made using traditional methods for over 800 years! This PDO (Protected Designation of Origin) product means that it can only be called Parmigiano Reggiano if it's made within the specified area and adheres to strict production guidelines.
Why is it better than American "Parmesan"?
American "Parmesan" cheese is often a pale imitation of the real thing. It's typically made from a blend of cheeses, including Cheddar and other cheaper varieties, and is aged for a much shorter period - usually just a few months compared to the minimum 12-24 months for Parmigiano Reggiano. The result? A cheese that lacks depth of flavor, complexity, and that unmistakable umami kick.
Tips for Buying Parmigiano Reggiano:
- Look for the mark: Authentic Parmigiano Reggiano will have the words "Parmigiano Reggiano Stagionato" (or similar, depending on the aging) stamped on the rind. The rind should also have the brand of the dairy and the month and year of production.
- Check the texture: The cheese should be firm but slightly elastic. Avoid any pieces with cracks or mold (other than the natural mold that forms on the rind).
- Smell the cheese: It should have a nutty, grassy aroma with a hint of pungency. If it smells sour or ammonia-like, pass it by.
Grating your own vs. pre-grated:
While pre-grated Parmigiano Reggiano may seem convenient, nothing beats grating your own cheese. Pre-grated cheese can contain anti-caking agents and preservatives to maintain its consistency in the bag. Grating your own ensures you get the freshest, most flavorful cheese possible. Plus, it's therapeutic!
Using Parmigiano Reggiano Rinds:
Don't throw away those precious rinds! Here are some creative ways to use them:
- Soup and stew booster: Add the rind to your favorite soup or stew during cooking to infuse it with rich, cheesy flavor. Remove before serving.
- Risotto enhancer: Stir a piece of rind into your risotto during cooking for an extra burst of umami goodness.
- Pesto elevator: Throw a rind into your food processor when making pesto for a subtle cheesy undertone. Discard before serving.
- Mac and cheese upgrade: Add a rind to your mac and cheese sauce while it's simmering for a cheesier taste without adding more cheese.
Conclusion:
In conclusion, friends, there's no denying that Parmigiano Reggiano is in a league of its own. Next time you're at the cheese counter, skip the imposters and treat yourself to a wedge of the real deal. Your taste buds (and your pasta dishes) will thank you! And remember, always #SayNoToAmericanParmesan and #EatRealCheese.
Performance Metrics
All Input Parameters
{ "top_k": 50, "top_p": 0.95, "prompt": "write me a blogpost all about parmigiano reggiano. Include helpful tips, like what to do with the rinds, whether you should buy pre-grated vs non-grated, etc, etc. Grumble about poor quality american Parmesan. ", "temperature": 0.8, "max_new_tokens": 2048, "prompt_template": "You are a helpful AI assistant.\n\nUSER: {prompt}\nASSISTANT:", "presence_penalty": 1, "frequency_penalty": 0 }
Input Parameters
- top_k
- The number of highest probability tokens to consider for generating the output. If > 0, only keep the top k tokens with highest probability (top-k filtering).
- top_p
- A probability threshold for generating the output. If < 1.0, only keep the top tokens with cumulative probability >= top_p (nucleus filtering). Nucleus filtering is described in Holtzman et al. (http://arxiv.org/abs/1904.09751).
- prompt (required)
- temperature
- The value used to modulate the next token probabilities.
- max_new_tokens
- The maximum number of tokens the model should generate as output.
- prompt_template
- The template used to format the prompt. The input prompt is inserted into the template using the `{prompt}` placeholder.
- presence_penalty
- Presence penalty
- frequency_penalty
- Frequency penalty
Output Schema
Output
Example Execution Logs
INFO 11-16 06:28:37 async_llm_engine.py:370] Received request 0: prompt: 'You are a helpful AI assistant.\n\nUSER: write me a blogpost all about parmigiano reggiano. Include helpful tips, like what to do with the rinds, whether you should buy pre-grated vs non-grated, etc, etc. Grumble about poor quality american Parmesan. \nASSISTANT:', sampling params: SamplingParams(n=1, best_of=1, presence_penalty=1.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.8, top_p=0.95, top_k=50, use_beam_search=False, length_penalty=1.0, early_stopping=False, stop=['</s>'], ignore_eos=False, max_tokens=2048, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True), prompt token ids: None. INFO 11-16 06:28:38 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 0.0 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 0.5%, CPU KV cache usage: 0.0% INFO 11-16 06:28:43 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.5 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 0.9%, CPU KV cache usage: 0.0% INFO 11-16 06:28:48 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.5 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 1.3%, CPU KV cache usage: 0.0% INFO 11-16 06:28:53 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.5 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 1.6%, CPU KV cache usage: 0.0% INFO 11-16 06:28:58 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.4 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 2.0%, CPU KV cache usage: 0.0% INFO 11-16 06:29:03 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.4 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 2.3%, CPU KV cache usage: 0.0% INFO 11-16 06:29:08 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.4 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 2.7%, CPU KV cache usage: 0.0% INFO 11-16 06:29:13 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.3 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 3.0%, CPU KV cache usage: 0.0% INFO 11-16 06:29:18 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.3 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 3.4%, CPU KV cache usage: 0.0% INFO 11-16 06:29:23 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.2 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 3.8%, CPU KV cache usage: 0.0% INFO 11-16 06:29:28 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.2 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 4.1%, CPU KV cache usage: 0.0% INFO 11-16 06:29:33 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.2 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 4.5%, CPU KV cache usage: 0.0% INFO 11-16 06:29:38 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.2 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 4.8%, CPU KV cache usage: 0.0% INFO 11-16 06:29:43 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.2 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 5.2%, CPU KV cache usage: 0.0% INFO 11-16 06:29:48 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.2 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 5.5%, CPU KV cache usage: 0.0% INFO 11-16 06:29:53 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.2 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 5.9%, CPU KV cache usage: 0.0% INFO 11-16 06:29:58 llm_engine.py:624] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 11.2 tokens/s, Running: 1 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 6.2%, CPU KV cache usage: 0.0% INFO 11-16 06:30:00 async_llm_engine.py:111] Finished request 0. generation took 82.316s
Version Details
- Version ID
39c0be5d2e96df975da94e8923535733d81179abed5f37c999afc5f7c33bf04e
- Version Created
- November 16, 2023