deniyes/dolly-v2-12b-demo 🔢📝❓ → 📝

▶️ 19 runs 📅 Jul 2024 ⚙️ Cog 0.9.13

question-answering text-generation

Performance

1.8sTypical run time

~171sCold start (first call)

19Total runs

About

dolly-v2-12b， just for testing

Example Output

Prompt:

"please compare the Cog and Blentoml"

Output

Cog - A computer program that allows you to create a virtual assistant.
Blentoml - An open-source language model platform for building conversational AI applications.

Performance Metrics

1.81s Prediction Time

171.11s Total Time

All Input Parameters

{
  "top_k": 50,
  "top_p": 1,
  "prompt": "please compare the Cog and Blentoml",
  "decoding": "top_p",
  "max_length": 500,
  "temperature": 0.75,
  "repetition_penalty": 1.2
}

Input Parameters

top_k Type: integerDefault: 50: Valid if you choose top_k decoding. The number of highest probability vocabulary tokens to keep for top-k-filtering
top_p Type: numberDefault: 1Range: 0.01 - 1: Valid if you choose top_p decoding. When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens
prompt (required) Type: string: Input Prompt.
decoding Default: top_p: Choose a decoding method
max_length Type: integerDefault: 500Range: 1 - ∞: Maximum number of tokens to generate. A word is generally 2-3 tokens
temperature Type: numberDefault: 0.75Range: 0.01 - 5: Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.
repetition_penalty Type: numberDefault: 1.2Range: 0.01 - 5: Penalty for repeated words in generated text; 1 is no penalty, values greater than 1 discourage repetition, less than 1 encourage it.

Output Schema

Output

Type: array • Items Type: string

Example Execution Logs

The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results.
Setting `pad_token_id` to `eos_token_id`:0 for open-end generation.

Version Details

Version ID: ef548bcbf14a2dc42292c647523630085bdb7e4a65a8e405237fccdc03e4cbda
Version Created: July 22, 2024

Run on Replicate →