Improve handling of JabRef embedding models #12240

InAnYan · 2024-11-26T21:06:01Z

Is your suggestion for improvement related to a problem? Please describe.

Currently JabRef provides means to view and download embedding models from DJL ModelZoo. JabRef also stores the size of embedding models.

However, it is badly implemented (another black page of my GSoC project).

Problems are:

This list is not auto-updated. Actually, this list is ... hard-coded.
Model size in this list is not properly calculated.
There is no way to view what models are already downloaded.
There is no way to delete old or unused models.
Embedding models access Internet without agreement (agreement on using AI != any Internet connection). There should be a way to download model on 1 computer and then transfer it to another computer. The question is: what to download? Where is it stored? Which files to transfer? Where to put in JabRef?

Describe the solution you'd like

Provide a list of available models using actual DJL API (up-to-date list).
Add the ability to download a model.
Add the ability to select model for using in AI features.
Provide a way to list downloaded models.
Provide the ability to delete a downloaded model.

Additionally, there should be a way to download a model beforehand. E.g. download model on one computer, then transfer it to another and install in JabRef.

Additional context

It seems there are some useful methods in DJL, though they are not documented thoroughly (https://javadoc.io/doc/ai.djl/api/latest/ai/djl/repository/zoo/ModelZoo.html#listModels()). I couldn't quickly grasp how to connect local (downloaded) models + remote, but probably this is a problem of time.

Thi Lo also found a link with models metadata (https://mlrepo.djl.ai/model/nlp/text_embedding/ai/djl/huggingface/pytorch/models.json.gz), which is enough to have.

This is not an easy issue, one needs to create useful UI. However, it's not debatable, so I posted it here.

Maybe introduce a section in AI preferences "Available models" with button "+", button "+" opens a dialog for choosing a remote embedding model or a local one

InAnYan · 2024-11-26T21:55:54Z

Oliver has found an interesting UI for download dialog (https://forum.image.sc/t/trouble-getting-gpu-to-work-with-instanseg-qupath/102042/26):

InAnYan added the component: ai Related to AI Chat/Summarization label Nov 26, 2024

InAnYan mentioned this issue Nov 26, 2024

Enable offline use of embedding model InAnYan/jabref#87

Closed

Siedlerchr added this to Prioritization Nov 29, 2024

github-project-automation bot moved this to Normal priority in Prioritization Nov 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve handling of JabRef embedding models #12240

Improve handling of JabRef embedding models #12240

InAnYan commented Nov 26, 2024

InAnYan commented Nov 26, 2024

Improve handling of JabRef embedding models #12240

Improve handling of JabRef embedding models #12240

Comments

InAnYan commented Nov 26, 2024

InAnYan commented Nov 26, 2024