Support for Loading Model from Memory Buffer #10990

sienaiwun · 2024-12-26T21:41:11Z

sienaiwun
Dec 26, 2024

Hi,

In whisper.cpp, there is a method whisper_init_from_buffer_with_params that allows loading a model directly from a memory buffer. This feature is particularly helpful for scenarios like mobile applications, where the model file is packaged and needs to be accessed in-memory without file I/O.

I’m wondering if llama.cpp could support a similar approach for loading models directly from a memory buffer. This would simplify usage in environments where file-based access is constrained or not feasible.

Is this feature already supported or something planned for the future? If not, could it be considered for implementation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Loading Model from Memory Buffer #10990

{{title}}

Replies: 0 comments

Select a reply

Support for Loading Model from Memory Buffer #10990

sienaiwun Dec 26, 2024

Replies: 0 comments

sienaiwun
Dec 26, 2024