lucataco/fuyu-8b 🖼️📝🔢 → 📝

▶️ 4.6K runs 📅 Oct 2023 ⚙️ Cog 0.8.6 🔗 GitHub 📄 Paper ⚖️ License
image-to-text visual-question-answering

About

Fuyu-8B is a multi-modal text and image transformer trained by Adept AI

Example Output

Prompt:

"What is the highest life expectancy at birth of male?"

Output

 The life expectancy at birth of males in 2018 is 80.7.

Performance Metrics

1.86s Prediction Time
1.83s Total Time
All Input Parameters
{
  "image": "https://replicate.delivery/pbxt/JjK2zhdhMpdevuSR7POm4X64qa2fVWv8miI4NBlkoHWVPmpD/chart.png",
  "prompt": "What is the highest life expectancy at birth of male?",
  "max_new_tokens": 512
}
Input Parameters
image (required) Type: string
Input Image
prompt (required) Type: string
Input prompt
max_new_tokens Type: integerDefault: 512Range: 0 - 2048
Max new tokens
Output Schema

Output

Type: string

Example Execution Logs
The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results.
Setting `pad_token_id` to `eos_token_id`:71013 for open-end generation.
Version Details
Version ID
42f23bc876570a46f5a90737086fbc4c3f79dd11753a28eaa39544dd391815e9
Version Created
October 20, 2023
Run on Replicate →