CPU Stats for when it's possible #15

eren23 · 2023-05-22T05:38:07Z

Running sentence-transformers on a CPU for various tasks is also possible, especially for consumer-grade libraries, etc. People are running these models w/o any GPU acceleration, which might be good to mention in the section.

We were using a sentence-transformer since the beginning in and even if it's a small open-source project all the users I know are using it on their CPUs.

waleedkadous · 2023-05-22T18:27:11Z

Weirdly, I tried it myself and it was considerably slower: like 20x slower. But I think that would be a really good section to add, especially with us also adding more info on llama.cpp (which we are starting to benchmark now). Give us 2 weeks and we'll see if we can do it.

lcrmorin · 2023-12-29T11:54:28Z

I would have appreciated to find this number too. From personnal experience (see: https://www.kaggle.com/code/lucasmorin/mistral-7-b-instruct-electricity-co2-consumption) the run time for the same query is 10x, which generally make the cpu usage impractical (or impossible).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPU Stats for when it's possible #15

CPU Stats for when it's possible #15

eren23 commented May 22, 2023

waleedkadous commented May 22, 2023

lcrmorin commented Dec 29, 2023

CPU Stats for when it's possible #15

CPU Stats for when it's possible #15

Comments

eren23 commented May 22, 2023

waleedkadous commented May 22, 2023

lcrmorin commented Dec 29, 2023