ENH: Support concurrent embedding, update LangChain QA demo with multithreaded embedding creation #348

jiayini1119 · 2023-08-14T07:09:07Z

No description provided.

xinference/core/model.py

xinference/tests/test_concurrent_embedding.py

aresnow1 · 2023-08-25T15:05:07Z

Embedding is a CPU-intensive call, and even for a stateless actor, it is not executed simultaneously because the current loop lock is not released until the first call. Therefore, the embedding operation needs to be called with 'to_thread' in model actor. However, I have tried it, and even embedding is not thread-safe for llamacpp, and the process results in a core dump if called concurrently.

jiayini1119 · 2023-08-28T04:39:57Z

We can first try supporting concurrent embedding creation for PyTorch models.

Embedding is a CPU-intensive call, and even for a stateless actor, it is not executed simultaneously because the current loop lock is not released until the first call. Therefore, the embedding operation needs to be called with 'to_thread' in model actor. However, I have tried it, and even embedding is not thread-safe for llamacpp, and the process results in a core dump if called concurrently.

jiayini1119 added 3 commits August 11, 2023 15:39

add

ebce97b

ckpt

ca2515a

update demo

25cf224

XprobeBot added the enhancement New feature or request label Aug 14, 2023

XprobeBot added this to the v0.2.0 milestone Aug 14, 2023

qinxuye reviewed Aug 14, 2023

View reviewed changes

xinference/core/model.py Outdated Show resolved Hide resolved

jiayini1119 added 7 commits August 14, 2023 15:23

fix

70a72f4

merge

34a12e9

merge

5653d73

update

36e26d3

small fix

3e20b25

fix

e625946

small fix

56a1ddd

XprobeBot modified the milestones: v0.2.0, v0.2.1 Aug 21, 2023

aresnow1 reviewed Aug 22, 2023

View reviewed changes

xinference/core/model.py Show resolved Hide resolved

xinference/tests/test_concurrent_embedding.py Show resolved Hide resolved

jiayini1119 added 3 commits August 22, 2023 15:12

update

45a84e3

update

675244a

test_concurrent_embedding is not for pytest

04d1151

XprobeBot modified the milestones: v0.2.1, v0.3.1 Sep 5, 2023

XprobeBot modified the milestones: v0.4.0, v0.4.2, v0.4.3, v0.4.4 Sep 12, 2023

XprobeBot removed this from the v0.4.4 milestone Sep 19, 2023

XprobeBot modified the milestones: v0.10.0, v0.10.1 Mar 29, 2024

XprobeBot modified the milestones: v0.10.1, v0.10.2 Apr 12, 2024

XprobeBot modified the milestones: v0.10.2, v0.10.3, v0.11.0 Apr 19, 2024

XprobeBot modified the milestones: v0.11.0, v0.11.1, v0.11.2 May 11, 2024

XprobeBot modified the milestones: v0.11.2, v0.11.3 May 24, 2024

XprobeBot modified the milestones: v0.11.3, v0.11.4, v0.12.0, v0.12.1 May 31, 2024

XprobeBot modified the milestones: v0.12.1, v0.12.2 Jun 14, 2024

XprobeBot modified the milestones: v0.12.2, v0.12.4, v0.13.0, v0.13.1 Jun 28, 2024

XprobeBot modified the milestones: v0.13.1, v0.13.2 Jul 12, 2024

XprobeBot modified the milestones: v0.13.2, v0.13.4 Jul 26, 2024

XprobeBot modified the milestones: v0.14, v0.15 Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Support concurrent embedding, update LangChain QA demo with multithreaded embedding creation #348

ENH: Support concurrent embedding, update LangChain QA demo with multithreaded embedding creation #348

jiayini1119 commented Aug 14, 2023 •

edited

Loading

aresnow1 commented Aug 25, 2023

jiayini1119 commented Aug 28, 2023

ENH: Support concurrent embedding, update LangChain QA demo with multithreaded embedding creation #348

Are you sure you want to change the base?

ENH: Support concurrent embedding, update LangChain QA demo with multithreaded embedding creation #348

Conversation

jiayini1119 commented Aug 14, 2023 • edited Loading

aresnow1 commented Aug 25, 2023

jiayini1119 commented Aug 28, 2023

jiayini1119 commented Aug 14, 2023 •

edited

Loading