Triton Example with AI Gateway #13587
FernandoDorado
started this conversation in
Ideas and feature requests
Replies: 1 comment 2 replies
-
@fffonion I believe that this warrants support for a new LLM type in our system (TensorRT-LLM)? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
We are currently evaluating open-source deployment tools and are particularly interested in the integration of TensorRT-LLM with Triton. We understand from the documentation that this can be achieved using a Custom Python Server, but the implementation details are not entirely clear.
Could we possibly collaborate on preparing a detailed demo or example showcasing this integration? Additionally, it might be beneficial to explore the creation of a step-by-step guide, which could serve as a valuable resource for both our team and the broader community.
Beta Was this translation helpful? Give feedback.
All reactions