You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Experimental] The Python package published on PyPI now includes GPU support. The binary is compiled with CUDA 10.1, but all CUDA dependencies are integrated in the package and do not need to be installed on the system. The only requirement should be a working GPU with driver version >= 418.39.
Fixes and improvements
Remove the TensorRT dependency to simplify installation and reduce memory usage:
Reduce GPU Docker images size by 600MB
Reduce memory usage on the GPU and the system by up 1GB
Reduce initialization time during the first GPU translation
Improve TopK performance on GPU for K < 5
Improve INT8 performance on GPU
Accept linear layers without bias when converting models