zsxkib/qwen2-0.5b-instruct 🔢📝❓ → 📝

▶️ 210 runs 📅 Jun 2024 ⚙️ Cog 0.9.9 🔗 GitHub 📄 Paper ⚖️ License
code-generation question-answering text-generation text-translation translation

About

Qwen 2: A 0.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Example Output

Prompt:

"Tell me a funny joke about cowboys in the style of Yoda from star wars"

Output

Why did the cowgirl refuse to ride the horse? Because she was afraid of the horse's horn!

Performance Metrics

1.20s Prediction Time
209.39s Total Time
All Input Parameters
{
  "top_k": 1,
  "top_p": 1,
  "prompt": "Tell me a funny joke about cowboys in the style of Yoda from star wars",
  "model_type": "Qwen2-0.5B-Instruct",
  "temperature": 1,
  "system_prompt": "You are a funny and helpful assistant.",
  "max_new_tokens": 512,
  "repetition_penalty": 1
}
Input Parameters
seed Type: integer
The seed for the random number generator
top_k Type: integerDefault: 1
When decoding text, samples from the top k most likely tokens; lower to ignore less likely tokens.
top_p Type: numberDefault: 1Range: 0.01 - 1
When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens.
prompt Type: stringDefault: Give me a short introduction to large language model.
Input prompt
model_type Default: Qwen2-0.5B-Instruct
Choose from available 0.5B models
temperature Type: numberDefault: 1Range: 0.1 - 5
Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.
system_prompt Type: stringDefault: You are a helpful assistant.
System prompt
max_new_tokens Type: integerDefault: 512Range: 1 - 32768
The maximum number of tokens to generate
repetition_penalty Type: numberDefault: 1Range: 0.01 - 10
Penalty for repeated words in generated text; 1 is no penalty, values greater than 1 discourage repetition, less than 1 encourage it.
Output Schema

Output

Type: arrayItems Type: string

Example Execution Logs
Using seed: 627241552
Time to first token: 0.67 seconds
Total generation time: 1.10 seconds
Total tokens generated: 89
Throughput: 81.23 tokens/second
Version Details
Version ID
f45376a2df2570a076816e9f9c1713d63a5e52851026fa836f14886585591350
Version Created
June 24, 2024
Run on Replicate →