🤖 Model 🖼️ → 📝
salesforce/blip
Caption images and answer visual questions from an input image. Optionally evaluate image–text matching. Provide an imag...
Found 2 models (showing 1-2)
Caption images and answer visual questions from an input image. Optionally evaluate image–text matching. Provide an imag...
Perform zero-shot image classification by scoring an input image against multiple text labels. Provide one image and a p...