Where to learn about threading logic in llama.cpp #10770

Nick-infinity · 2024-12-10T21:10:57Z

Nick-infinity
Dec 10, 2024

Hello, I am trying to understand how multithreading works in llamma.cpp/gmml and where is the control point. While looking at the operator code, I can see only the microkernel. I assume this microkernel is called with N threads to perform chunking and parallelization. Can someone please help me in better understanding how multiple threads are controlled within the operator and who is the owner/ caller of these N thread calls?

Thanks,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Where to learn about threading logic in llama.cpp #10770

{{title}}

Replies: 0 comments

Select a reply

Where to learn about threading logic in llama.cpp #10770

Nick-infinity Dec 10, 2024

Replies: 0 comments

Nick-infinity
Dec 10, 2024