使用自带的聊天页面或者接入dify之后输出很慢 #2690

congge27 · 2024-12-19T03:24:36Z

Ubuntu 24.04
单卡4090

无

不知道有没有试过和ollama的对比，我测试了一下，同样的模型ollama的速度明显会快不少，不知道是不是我部署的问题。请问在性能方面有没有什么说明啥的

请问有没有性能方面的说明啥的
比如测能测试

qinxuye · 2024-12-20T02:42:01Z

有部署到显卡上吗？

congge27 · 2024-12-20T03:25:35Z

有部署到显卡上吗？

有的，是用显卡跑的
直观现象是卡，我研究了下接口，
它的流式接口的输出是离散的，比如在16:02:29秒输出五段文字，下一次的输出就会在16:02:34了，中间会卡5秒左右

github-actions · 2024-12-27T19:03:31Z

This issue is stale because it has been open for 7 days with no activity.

XprobeBot added this to the v1.x milestone Dec 19, 2024

congge27 changed the title ~~同样模型条件下和Ollama的性能对比~~ 使用自带的聊天页面或者接入dify之后输出很慢 Dec 19, 2024

github-actions bot added the stale label Dec 27, 2024

Provide feedback