CLIP Interrogator extension! #7968

pharmapsychotic · 2023-02-20T23:39:24Z

pharmapsychotic
Feb 20, 2023

Hello friends!

I've created an extension so the full CLIP Interrogator can be used in the Web UI now. Give it an image and it will create a prompt to give similar results with Stable Diffusion v1 and v2. It can give you a nice starting point and ideas for your prompts.

https://github.com/pharmapsychotic/clip-interrogator-ext

It honors the low/med vram option of the web UI and does it own detection to switch into a low VRAM mode so should hopefully work across a wide range of GPUs. I've tested with nvidia 1070 8GB. Precomputed text embeddings are downloaded in safetensors format from huggingface and stored in models/clip-interrogator to speed up processing.

The first time you run it the progress doesn't display so check the console to see how it's doing.

It's my first attempt at an extension so let me know if there's anything I've done wrong or could improve!

ClashSAN · 2023-02-21T09:23:15Z

ClashSAN
Feb 21, 2023
Collaborator

nice extension! do you have any info on which ones take the least vram?

Since the model has loaded into vram at startup, I unloaded it with https://github.com/hako-mikan/sd-webui-supermerger, an extension that has an "unload model" option. "fast" and "classic" mode seem to use the least.

1 reply

pharmapsychotic Feb 21, 2023
Author

thanks! RN50/openai probably uses the least. ViTL/openai though should give best results with SD 1.* models and ViTH/laion best for SD 2.* models.

ooh thanks for the pointer, I see how they're unloading the generation model there - i'll give it a try!

pharmapsychotic · 2023-03-04T20:08:49Z

pharmapsychotic
Mar 4, 2023
Author

Updated now with batch mode so you can point at folder of images and it will create prompt for each and store in either txt file for each image, big txt file with all the prompts, or csv file.

I haven't sorted out how to display progress bar nicely. I've been looking at how it's done for the textual inversion training and tried to replicate but run into Gradio errors so people still have to watch the console to see how it is progressing. Does anyone know more info on how to display progress bars in your extension nicely?

2 replies

gsgoldma Mar 4, 2023

is there anyway to get the clip models to save into the stable diffusion folder instead of in my appdata or some other location?

MeinDeutschkurs Jun 8, 2023

Updated now with batch mode so you can point at folder of images and it will create prompt for each and store in either txt file for each image, big txt file with all the prompts, or csv file.

I haven't sorted out how to display progress bar nicely. I've been looking at how it's done for the textual inversion training and tried to replicate but run into Gradio errors so people still have to watch the console to see how it is progressing. Does anyone know more info on how to display progress bars in your extension nicely?

It's great. I sent you some issues I ran into at your issue tracker.

BTW, with API-usage. Is there a way to submit the mode together with the JSON? I'd like to use 'fast' instead of 'best'.

C00reNUT · 2023-04-01T05:19:39Z

C00reNUT
Apr 1, 2023

Hello,
may I ask you what would we be the option that would produce the same results as the ones produces by the 'default' Interrogate CLIP button in webui?

0 replies

Painter59 · 2023-12-01T22:10:46Z

Painter59
Dec 1, 2023

This is VERY useful! There are error messages when offline that indicate it is possible to set up to work locally. As usual, they are very cryptic and go off talking about installing transformers, etc. So far I have not beemn able to set everything working. Is there a simpl;e guide to local automatic1111 use that even I can follow?? Thanks.

0 replies

rpillala · 2023-12-29T23:01:47Z

rpillala
Dec 29, 2023

I love this extension. Lately it seems to be running really slow. I see that it has switched to low VRAM because I have 10 GB of VRAM. How much VRAM would keep it from doing that?

0 replies

schoenid · 2024-12-22T21:50:37Z

schoenid
Dec 22, 2024

How to install additional models to this extension?
I've tried to copy a model to the \Stable Diffusion WebUI\models\clip-interrogator directory, but nothing happens.
I've also tried to copy the belonging json file to the same location or even to the \Stable Diffusion WebUI\venv\Lib\site-packages\open_clip\model_configs directory, but without success.
Is there any description available, how to do it?
I found also that there is a ViT-H-16.json located in the venv directory, but no model is available in the interrogator extension interface. Ho to get it?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLIP Interrogator extension! #7968

{{title}}

Replies: 6 comments 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

CLIP Interrogator extension! #7968

pharmapsychotic Feb 20, 2023

Replies: 6 comments · 3 replies

ClashSAN Feb 21, 2023 Collaborator

pharmapsychotic Feb 21, 2023 Author

pharmapsychotic Mar 4, 2023 Author

gsgoldma Mar 4, 2023

MeinDeutschkurs Jun 8, 2023

C00reNUT Apr 1, 2023

Painter59 Dec 1, 2023

rpillala Dec 29, 2023

schoenid Dec 22, 2024

pharmapsychotic
Feb 20, 2023

Replies: 6 comments 3 replies

ClashSAN
Feb 21, 2023
Collaborator

pharmapsychotic Feb 21, 2023
Author

pharmapsychotic
Mar 4, 2023
Author

C00reNUT
Apr 1, 2023

Painter59
Dec 1, 2023

rpillala
Dec 29, 2023

schoenid
Dec 22, 2024