[BUG] Private GPT has infinite loop of responce #2112

SpkArtZen · 2024-10-28T13:39:02Z

Question

I have an issue with Private GPT:

When I send a prompt or chat completion with a large context (file size > 5 KB or multiple context files), the chat takes a long time to generate a response but never sends it. It just keeps generating a response, and the delay gets worse. Eventually, it sends a timeout error.

I don’t know how to fix this. I need to get its initial response, but in the end, I don’t receive anything

jaluma · 2024-10-30T08:20:46Z

Can you give us more details about your environment? Probably, it will related to GPU and vRAM.

SpkArtZen · 2024-10-30T08:29:53Z

Yes, i use default model llama 3.1 7B

SpkArtZen · 2024-10-30T09:46:26Z

Full logs:
logs.txt
I send single request from python sdk.
It works the same with postman and curl

jaluma · 2024-11-04T08:29:02Z

It should work equally using postman and requests. Can you increate request timeout?

client = PrivateGPTApi(base_url="http://localhost:8001", client=...)

And two mode things to take into account:

When you use all window context, reply will take more time in reply, it's normal.
Probably, use a large context instead of using RAG strategies not be the best way to afford this kind of problems.
Consider increase Ollama timeout if you continue having problems as your log. You can do modifying LLMComponent, ollama statement.

SpkArtZen · 2024-11-04T13:44:48Z

The main problem is that when I send a request, even through Postman, the response is generated multiple times and degrades each time.
The same with sdk and Postman.
Also, it itself sends a request:

2024-11-04 15:36:54 13:36:54.133 [INFO ] httpx - HTTP Request: POST http://localhost:11434/api/chat "HTTP/1.1 200 OK"
2024-11-04 15:36:59 [GIN] 2024/11/04 - 13:36:59 | 200 | 5.996617632s | 127.0.0.1 | POST "/api/chat"

After that its generate responce again. I need somehow accept only first responce.

SpkArtZen added the question Further information is requested label Oct 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Private GPT has infinite loop of responce #2112

[BUG] Private GPT has infinite loop of responce #2112

SpkArtZen commented Oct 28, 2024

jaluma commented Oct 30, 2024

SpkArtZen commented Oct 30, 2024

SpkArtZen commented Oct 30, 2024

jaluma commented Nov 4, 2024

SpkArtZen commented Nov 4, 2024

[BUG] Private GPT has infinite loop of responce #2112

[BUG] Private GPT has infinite loop of responce #2112

Comments

SpkArtZen commented Oct 28, 2024

Question

jaluma commented Oct 30, 2024

SpkArtZen commented Oct 30, 2024

SpkArtZen commented Oct 30, 2024

jaluma commented Nov 4, 2024

SpkArtZen commented Nov 4, 2024