Hybrid Queries: What is the default value of Theta, in Theta.K #38571
-
Hello Milvus Community, In the paper titled 'Milvus: A Purpose-Built Vector Data Management System',
Also I have given the link to the paper as well for any reference. Milvus paper |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
In the paper, θ-K is for "vector scan first then attribute full scan". In this approach, the search engine performs a two-step opertation:
Let's say the expected result is K=100 items. θ is not a const value, it must be greater than 1.0. In my opinion, the runtime value should be configured by users. Anyway, it is a paper written in 2021. Now 2025 is coming. Now the milvus is mainly using "attribute full scan first then vector scan" for filtering search: In filtering search, a bitset is generated by the first step "attribute full scan", each bit represents a position of a vector, and the second step "vector scan" will skip the vectors that are marked as 0 in the bitset. |
Beta Was this translation helpful? Give feedback.
The paper described some possible methods for filtering search. Not all the methods are good in production. Currently, Milvus only uses "attribute full scan first then vector scan" for filtering search. So, no θ.