We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Accelerate model weight loading,Now it takes a lot of time
When I start tgi, it takes up most of my time
I have modified my local tgi and want to submit PR
The text was updated successfully, but these errors were encountered:
Hi,my idea is different,what I do is load the weights directly to the corresponding device; when i using cuda, this way will make it fast
Sorry, something went wrong.
No branches or pull requests
Feature request
Accelerate model weight loading,Now it takes a lot of time
Motivation
When I start tgi, it takes up most of my time
Your contribution
I have modified my local tgi and want to submit PR
The text was updated successfully, but these errors were encountered: