lucataco/moondream-0.5b 🖼️📝 → 📝

▶️ 64 runs 📅 Dec 2024 ⚙️ Cog 0.13.3 🔗 GitHub ⚖️ License
image-captioning image-to-text visual-question-answering

About

Moondream 0.5B, the world's smallest vision language model

Example Output

Prompt:

"Describe this image"

Output

The image features the logo of the app "MoonDREAM". The logo is a blue circle with a smiling face, radiating with a cheerful expression. The text "MoonDREAM" is displayed in a bold, black font below the circle. The entire logo is set against a white background, which highlights the colors and the overall design of the app.

Performance Metrics

30.38s Prediction Time
30.40s Total Time
All Input Parameters
{
  "image": "https://replicate.delivery/pbxt/NEnAw2VuSGJTUFm9BtaQHlga8903vU8vcjSKo9TpZ6jLJIH8/moondream2.png",
  "prompt": "Describe this image"
}
Input Parameters
image (required) Type: string
Input image
prompt Type: stringDefault: Describe this image
Question to ask about the image
Output Schema

Output

Type: string

Version Details
Version ID
00c1f1ed7192004da1607947f940b7030558efc5e00cada7be75ccfaec97e6c2
Version Created
December 5, 2024
Run on Replicate →