jyoung105/honeybee 🖼️🔢📝✓ → 📝
About
Locality-enhanced Projector for Multimodal LLM

Example Output
Prompt:
"What is the title of this book?"
Output
The Little Book of Deep Learning
Performance Metrics
2.75s
Prediction Time
542.73s
Total Time
All Input Parameters
{ "image": "https://replicate.delivery/pbxt/KJcspdKRzoJNPWO6PsQcOTTNFjc2RmgCyPJdWen5pC12L7OM/demo-1.jpg", "top_k": 5, "prompt": "What is the title of this book?", "do_sample": true, "max_length": 200, "agree_to_research_only": true }
Input Parameters
- image (required)
- Input image
- top_k
- top k for sampling
- prompt (required)
- Input prompt
- do_sample
- Whether you do sampling or not
- max_length
- Maximum number of tokens to generate
- agree_to_research_only
- You must agree to use this model only for research. It is not for commercial use.
Output Schema
Output
Version Details
- Version ID
813e9d681d9936b2a184c2e3aefbb138688aa3100a708b46a05ad1be8b6fad0e
- Version Created
- January 30, 2024