You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've tried to search the repo and I have not found obvious support for the NVidia Transformer Engine. I don't know if this is already on the developer roadmap but it sounds like it would improve performance, possibly significantly. The C/C++ API is here.
It doesn't look like a simple drop in replacement though. I have no idea the LOE necessary for something like this but I very selfishly would like to use this on our new DGX-H200.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I've tried to search the repo and I have not found obvious support for the NVidia Transformer Engine. I don't know if this is already on the developer roadmap but it sounds like it would improve performance, possibly significantly. The C/C++ API is here.
It doesn't look like a simple drop in replacement though. I have no idea the LOE necessary for something like this but I very selfishly would like to use this on our new DGX-H200.
Beta Was this translation helpful? Give feedback.
All reactions