google-deepmind/gemma-2-2b-it 🔢📝 → 📝

▶️ 16.3K runs 📅 Aug 2024 ⚙️ Cog 0.9.13 🔗 GitHub 📄 Paper ⚖️ License
code-generation document-summarization question-answering text-generation

About

Gemma2 2b Instruction-tuned variant by Google

Example Output

Prompt:

"Write me a poem about Machine Learning."

Output

A labyrinth of data, vast and deep,
Where patterns hide, secrets to keep.
Machine learning, with algorithms bright,
Unveils the truth through logic's light.

From simple tasks, it starts its quest,
To solve complex problems, put them to test.
With each iteration, knowledge grows,
As models learn, their wisdom shows.

Neural networks, like brains they mimic,
Connecting nodes, information trickling.
Deep learning dives into unknown terrain,
Extracting insights from unseen domain.

Predictive power, in every stride,
Forecasting future, where trends reside

Performance Metrics

8.10s Prediction Time
8.11s Total Time
All Input Parameters
{
  "top_k": 50,
  "top_p": 0.9,
  "prompt": "Write me a poem about Machine Learning.",
  "temperature": 0.6,
  "max_new_tokens": 128,
  "repetition_penalty": 1.2
}
Input Parameters
top_k Type: integerDefault: 50Range: 1 - 1000
When decoding text, samples from the top k most likely tokens; lower to ignore less likely tokens
top_p Type: numberDefault: 0.9Range: 0.05 - 1
When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens
prompt Type: stringDefault: Write me a poem about Machine Learning.
Prompt to send to the model.
temperature Type: numberDefault: 0.6Range: 0.1 - 4
Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.
max_new_tokens Type: integerDefault: 1024Range: 1 - 4096
Maximum number of tokens to generate. A word is generally 2-3 tokens
repetition_penalty Type: numberDefault: 1.2Range: 0 - ∞
Repetition penalty parameter.
Output Schema

Output

Type: arrayItems Type: string

Version Details
Version ID
ff924e24b20727e4e04b9721b403b1a75500b7b8b934714ed2b34afc6de69673
Version Created
August 1, 2024
Run on Replicate →