openai/gpt-oss-20b 🔢📝 → 📝

⭐ Official ▶️ 50.8K runs 📅 Aug 2025 ⚙️ Cog 0.16.8
code-generation question-answering text-generation text-translation

About

20b open-weight language model from OpenAI

Example Output

Prompt:

"How many "r"s are in strawberry?"

Output

There are 3 “r” letters in the word strawberry.

Performance Metrics

1.57s Prediction Time
1.57s Total Time
All Input Parameters
{
  "top_p": 1,
  "prompt": "How many \"r\"s are in strawberry?",
  "max_tokens": 1024,
  "temperature": 0.1,
  "presence_penalty": 0,
  "frequency_penalty": 0
}
Input Parameters
top_p Type: numberDefault: 1
Top-p (nucleus) sampling
prompt Type: stringDefault:
Prompt
max_tokens Type: integerDefault: 2048
The maximum number of tokens the model should generate as output.
temperature Type: numberDefault: 0.1
The value used to modulate the next token probabilities.
presence_penalty Type: numberDefault: 0
Presence penalty
frequency_penalty Type: numberDefault: 0
Frequency penalty
Output Schema

Output

Type: arrayItems Type: string

Example Execution Logs
Prompt: How many "r"s are in strawberry?
Input token count: 9
Output token count: 17
TTFT: 0.92s
Tokens per second: 10.86
Total time: 1.57s
Version Details
Version ID
432f06c0c39f7bb0d88dc2cbe82e94bcc8760b25198a68bd277e567f26e0e22e
Version Created
October 13, 2025
Run on Replicate →