zsxkib/qwen2-0.5b-instruct 🔢📝❓ → 📝
About
Qwen 2: A 0.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Example Output
Prompt:
"Tell me a funny joke about cowboys in the style of Yoda from star wars"
Output
Why did the cowgirl refuse to ride the horse? Because she was afraid of the horse's horn!
Performance Metrics
1.20s
Prediction Time
209.39s
Total Time
All Input Parameters
{ "top_k": 1, "top_p": 1, "prompt": "Tell me a funny joke about cowboys in the style of Yoda from star wars", "model_type": "Qwen2-0.5B-Instruct", "temperature": 1, "system_prompt": "You are a funny and helpful assistant.", "max_new_tokens": 512, "repetition_penalty": 1 }
Input Parameters
- seed
- The seed for the random number generator
- top_k
- When decoding text, samples from the top k most likely tokens; lower to ignore less likely tokens.
- top_p
- When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens.
- prompt
- Input prompt
- model_type
- Choose from available 0.5B models
- temperature
- Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.
- system_prompt
- System prompt
- max_new_tokens
- The maximum number of tokens to generate
- repetition_penalty
- Penalty for repeated words in generated text; 1 is no penalty, values greater than 1 discourage repetition, less than 1 encourage it.
Output Schema
Output
Example Execution Logs
Using seed: 627241552 Time to first token: 0.67 seconds Total generation time: 1.10 seconds Total tokens generated: 89 Throughput: 81.23 tokens/second
Version Details
- Version ID
f45376a2df2570a076816e9f9c1713d63a5e52851026fa836f14886585591350
- Version Created
- June 24, 2024