lucataco/clip-interrogator ❓🖼️ → 📝

▶️ 123.2K runs 📅 Jul 2023 ⚙️ Cog 0.8.5 🔗 GitHub 📄 Paper ⚖️ License

image-to-prompt image-to-text prompt-generation

Performance

0.9sTypical run time

123.2KTotal runs

About

CLIP Interrogator (for faster inference)

Example Output

Output

painting of a turtle swimming in the ocean with a blue sky in the background, illustrative art, turtle, michael angelo inspired, world-bearing turtle, highly detailed illustration.”, 4k artwork, realistic illustration, highly detailed digital painting, vibrant digital painting, [ 4 k digital art, 4k art, hypperrealistic illustration, high detail illustration, vibrant realistic

Performance Metrics

0.90s Prediction Time

0.86s Total Time

All Input Parameters

{
  "mode": "fast",
  "image": "https://replicate.delivery/pbxt/JVpnDt9nXuAnqBaXFPH8JbLrkU7JxQIoAGrHFwRWnFYqI7Ad/replicate-prediction-lyehbrdbrdztdi7ggx63lmhkgm.png",
  "clip_model_name": "ViT-bigG-14/laion2b_s39b_b160k"
}

Input Parameters

mode Default: best: Prompt mode (best takes 10-20 seconds, fast takes 1-2 seconds).
image (required) Type: string: Input image
clip_model_name Default: ViT-L-14/openai: Choose ViT-L for Stable Diffusion 1, ViT-H for Stable Diffusion 2, or ViT-bigG for Stable Diffusion XL.

Output Schema

Output

Type: string

Example Execution Logs

0%|          | 0/55 [00:00<?, ?it/s]
 49%|████▉     | 27/55 [00:00<00:00, 265.22it/s]
 98%|█████████▊| 54/55 [00:00<00:00, 266.00it/s]
100%|██████████| 55/55 [00:00<00:00, 267.08it/s]

Version Details

Version ID: 14d81f8a13e8ef87cc9b5eb7d03f5940fc7010e7226e93af612c5f0f4df1a35f
Version Created: September 12, 2023

Run on Replicate →