Compile without cublas dlls? #10988

vladfaust · 2024-12-26T18:50:09Z

vladfaust
Dec 26, 2024

Is it possible to compile a llama binary without it requiring cublas64_12.dll and cublasLt64_12.dll in runtime? cudart64_12.dll is tiny, but cublas is around half a gig! I don't want to ship it with my app neither I want to make users install CUDA toolkit (cublas is not found when installing usual Nvidia drivers).

I tried setting -DGGML_CUDA_FORCE_MMQ=ON, but it still crashes because it can't find cublas64_12.dll in runtime.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compile without cublas dlls? #10988

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Compile without cublas dlls? #10988

vladfaust Dec 26, 2024

Replies: 0 comments

vladfaust
Dec 26, 2024