rwkv_clone_context thread-safety when using cuBLAS #182

eduardsui · 2024-09-14T08:30:29Z

Hello,

I'm trying to use rwkv.cpp in two different threads. For this, I'm loading the model and then using two context clones (via rwkv_clone_context). Everything works fine when each thread runs rwkv_eval, but when running simultaneously in two threads, I get an error:

GGML_ASSERT: /root/rwkv.cpp/ggml/src/ggml-cuda.cu:409: ptr == (void *) (pool_addr + pool_used)
GGML_ASSERT: /root/rwkv.cpp/ggml/src/ggml-cuda.cu:409: ptr == (void *) (pool_addr + pool_used)

It seems that alloc/free are called "out of order" for the two contexts. Any idea how to solve this?

Thanks!

The text was updated successfully, but these errors were encountered:

eduardsui changed the title ~~rwkv_clone_context when using cuBLAS~~ rwkv_clone_context thread-safety when using cuBLAS Sep 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rwkv_clone_context thread-safety when using cuBLAS #182

rwkv_clone_context thread-safety when using cuBLAS #182

eduardsui commented Sep 14, 2024

rwkv_clone_context thread-safety when using cuBLAS #182

rwkv_clone_context thread-safety when using cuBLAS #182

Comments

eduardsui commented Sep 14, 2024