muqtadar08/llava_phi-3-mini 🖼️📝 → 🖼️

▶️ 12 runs 📅 Apr 2024 ⚙️ Cog 0.9.5
image-captioning image-to-text visual-question-answering

About

Example Output

Output

what is this The image features a blue and black folding bicycle parked on a concrete surface. The bicycle is equipped with orange reflectors on the wheels and a black seat. The handlebars are black, matching the frame of the bicycle, which is predominantly blue and black. The bicycle is folded, indicating it's ready for easy transportation or storage. The concrete surface on which the bicycle is parked appears to be a sidewalk or a similar urban setting. The overall image gives a sense of a compact, urban-friendly transportation option.

Performance Metrics

14.29s Prediction Time
190.62s Total Time
All Input Parameters
{
  "image": "https://replicate.delivery/pbxt/KpXHY6ytrTokfLipPSCITkpykTFkOg8Pui2lcCcADGkPYf1j/image.png",
  "question": "what is this "
}
Input Parameters
image (required) Type: string
Input image
question (required) Type: string
Ask a question about the image
Output Schema

Output

Type: stringFormat: uri

Version Details
Version ID
458b72b956ff8b1c0140571e6d2d03292c6666ec4b506d118776a165e34149c6
Version Created
April 29, 2024
Run on Replicate →