smoretalk/clip-interrogator-turbo ❓🖼️ → 📝

▶️ 3.0M runs 📅 Feb 2024 ⚙️ Cog 0.9.4
image-to-text prompt-extraction prompt-generation sdxl

About

@pharmapsychotic 's CLIP-Interrogator, but 3x faster and more accurate. Specialized on SDXL.

Example Output

Output

a painting of a yellow car parked in front of a house with trees in the, by Makoto Shinkai, featured on pixiv, makoto shinkai. high detail, by makoto shinkai, in style of makoto shinkai, makoto shinkai art style, makoto shinkai. —h 2160, anime keyframe

Performance Metrics

9.31s Prediction Time
187.88s Total Time
All Input Parameters
{
  "mode": "best",
  "image": "https://replicate.delivery/pbxt/KgRWg4JUfnnszNV78fo1PMRvYxCD9nCgf26Va4RtLxWcuujW/illust-car.png",
  "style_only": false
}
Input Parameters
mode Default: best
Prompt Mode: fast takes 1-2 seconds, best takes 15-25 seconds.
image (required) Type: string
Input image
Output Schema

Output

Type: string

Example Execution Logs
Both `max_new_tokens` (=16) and `max_length`(=212) seem to have been set. `max_new_tokens` will take precedence. Please refer to the documentation for more information. (https://huggingface.co/docs/transformers/main/en/main_classes/text_generation)
/root/.pyenv/versions/3.11.9/lib/python3.11/site-packages/transformers/generation/utils.py:1156: UserWarning: Unfeasible length constraints: `min_length` (37) is larger than the maximum possible length (16). Generation will stop at the defined maximum length. You should decrease the minimum length and/or increase the maximum length.
warnings.warn(
Finish generating a caption in 1.91 seconds.
Unused or unrecognized kwargs: padding.
Unused or unrecognized kwargs: padding.
Finish generating an image embedding in 0.23 seconds.
Finish generating a text embedding in 0.15 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.15 seconds.
Finish generating a text embedding in 0.15 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.15 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.14 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.13 seconds.
Finish generating a text embedding in 0.15 seconds.
Finish generating a text embedding in 0.13 seconds.
Best prompt is 'a painting of a yellow car parked in front of a house with trees in the, by Makoto Shinkai, featured on pixiv, makoto shinkai. high detail,  by makoto shinkai,  in style of makoto shinkai,  makoto shinkai art style,  makoto shinkai. —h 2160,  anime keyframe'.
Finish generating a prompt in 8.34 seconds.
Version Details
Version ID
36a9275d4215df72f67de0daa59929fa24866351405b74b8bcdc1991441aafec
Version Created
February 5, 2024
Run on Replicate →