chigozienri/llava-birds 🖼️🔢📝 → 📝

▶️ 73 runs 📅 Nov 2023 ⚙️ Cog 0.9.0-beta10
bird-identification image-to-text visual-question-answering

About

Example Output

Prompt:

"What is the common name for this bird species?"

Output

Crested shriketit

Performance Metrics

0.94s Prediction Time
0.98s Total Time
All Input Parameters
{
  "image": "https://replicate.delivery/pbxt/Jv7hB84kxm3TKjmhS2g8NCFaqhaT5FRChkzV7Sehq5Psk9Ob/3.jpg",
  "top_p": 1,
  "prompt": "What is the common name for this bird species?",
  "max_tokens": 1024,
  "temperature": 0.2
}
Input Parameters
image (required) Type: string
Input image
top_p Type: numberDefault: 1Range: 0 - 1
When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens
prompt (required) Type: string
Prompt to use for text generation
max_tokens Type: integerDefault: 1024Range: 0 - ∞
Maximum number of tokens to generate. A word is generally 2-3 tokens
temperature Type: numberDefault: 0.2Range: 0 - ∞
Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic
Output Schema

Output

Type: arrayItems Type: string

Version Details
Version ID
ee1295a22ca8709eec09ce40e343ee5039276edbb66b1640219a2fd93ebc7741
Version Created
December 2, 2023
Run on Replicate →