-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for embedding models (BGE, GTE etc) #22
Comments
Hi @michaeljelly, thanks a lot for the great suggestion! We are currently focused on making LLMs work better and faster, and I haven't really put much thought on what expanding to embedding models would entail. It could potentially be an awesome contribution from the community! Maybe @xenova and @Vaibhavs10 have more insight on what the effort could look like. |
I actually think that embedding models are some of the easiest to add support for. 😅 Once you have BERT models supported, you automatically are able to run most of the models on the MTEB leaderboard. I see the repo already supports the BERT tokenizer, so the only additional step is to add a pooling method (typically mean or CLS pooling) to get sentence embeddings (as opposed to per-token embeddings). Multimodal embedding models, like CLIP, can come at a later stage :) |
IMO, it is okay to open it to the community! I think in terms of overall impact, focusing on LLMs makes more sense since, from an On-device perspective, they'd be used way more used than an embedding model. I believe, most of the embedding models run quite competitively on CPU already. |
Thanks a lot @xenova and @Vaibhavs10 for your comments! I assigned the |
any updates on this one? |
sorry if I can't find this: is there anything like huggingface swift-embedders ? |
Hi @pannous! Unfortunately, there's not. This is a very interesting issue that remains open for the community to explore :) |
While not huggingface, I found this for embeddings in swift: https://github.com/ZachNagengast/similarity-search-kit |
cc @ZachNagengast in case he has any feedback on his experience :) |
Embeddings models are very useful, and can easily be run on device in terms of hardware specs.
It would be awesome if swift-transformers would work with them.
The text was updated successfully, but these errors were encountered: