handle local docs ? #2059

kalle07 · 2024-04-17T18:50:26Z

kalle07
Apr 17, 2024

any local docs/pdf i can talk with ?
some library all indexed read for use ;)

JackBekket · 2024-04-17T23:27:01Z

JackBekket
Apr 17, 2024

I am thinking about same problem

Might be implemented through embeddings (basically to search among documents for relevant parts ("indexing")) and feed chunk of this data as input for model, so we can have RAG

I have planned to test how just embeddings work with local-ai and langchain, so I can search through doc or database, to find what I want using context similarity search

For example, it might be useful through parsing user's data or to working with historical data

Let's say you have a db with users-locations, which can be any location in any format like "Rim, Italy -- Rome, Italy -- Berlin -- Bay Area -- НижнийНовгород -- Украина Киев -- Украiна Кiев -- worlidwide -- Moon(planet earh)" and so on
And you can fix this problem using embeddings and vector-store, so if you looking for USA then you can find those who inputed 'NY' and so on

Working with historical data means, that you can just add dump of all local-news for a few years and make your model aware of context nowdays. Or it could be chatHistory with chat-sessions, or it could be CVE list dump, or even your own codebase

but that just embeddings