Models tagged "image-embedding"

Generate joint text and image embeddings for semantic search and cross‑modal retrieval. Accepts a single text string or...

🖼️ • text-embedding • image-embedding • multimodal-embedding • 1.2M runs

Embed text, images, and audio into a shared vector space for cross-modal retrieval and similarity search. Accepts a text...

🔊 • text-embedding • image-embedding • audio-embedding • 9.4M runs

Generate dense embeddings for document screenshots and text queries to power document and webpage retrieval. Encode scre...

🖼️ • image-embedding • text-embedding • document-retrieval • 3.1K runs

Generate text and image embeddings. Produce 768-dimensional vectors from text or images using CLIP ViT-L/14 for semantic...

🖼️ • text-embedding • image-embedding • 46.3M runs

Compute CLIP ViT-L/14 embeddings from text and images for semantic search, cross-modal retrieval, and zero-shot classifi...

🖼️ • text-embedding • image-embedding • 113.5M runs

Compute 512-dimensional embeddings from images and/or text for similarity search, cross‑modal retrieval, clustering, ded...

🖼️ • image-embedding • text-embedding • 6.7K runs

Create multilingual text and image embeddings for cross-modal search, retrieval, and similarity. Accept text (up to 8192...

🖼️ • text-embedding • image-embedding • multimodal-embedding • 654.0K runs

Embed images and text into a shared CLIP vector space for similarity search and zero-shot classification. Accepts lists...

🖼️ • image-embedding • text-embedding • 423 runs

Generate image embeddings from an input image for use with the Segment Anything Model (SAM) ViT-H. Accepts a single imag...

🖼️ • image-embedding • segment-anything • 349 runs

Compute CLIP embeddings for batches of text and images. Accepts newline-separated text strings and images (including bas...

🖼️ • image-embedding • text-embedding • 14 runs