Error for exllamav2_kernel, running TGI on Google Colab #1762
Unanswered
andychoi98
asked this question in
Q&A
Replies: 2 comments 4 replies
-
The Ensure you have the latest version by using |
Beta Was this translation helpful? Give feedback.
1 reply
-
Seem like the most useful part of that error is "Could not find TensorRT". Have you tried |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Trying to run the tgi launcher on Google Colab after locally installing, but keep on getting error messages that the kernel is not installed.
text-generation-launcher --model-id bigcode/starcoder2-3b --sharded false --quantize bitsandbytes-fp4
ERROR text_generation_launcher: exllamav2_kernels not installed.
ERROR text_generation_launcher: Shard 0 failed to start
Keep getting these errors even though I cloned and installed the turboderp/exllamav2 repo from github.
Seems like a simple issue but can anyone give me help how to solve this?
I'm running locally because google colab doesn't let me use the docker container for running the tgi.
Or is there a better way for doing this?
Thank you.
Beta Was this translation helpful? Give feedback.
All reactions