Replies: 1 comment
-
Hi @micos7, transformer backend supports CUDA acceleration for embeddings. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I use the bert model for embeddings. I build it with
make BUILD_TYPE=cublas build
. When I test an embedding with a fairly large text it takes the same time as the CPU only build (default), Thanks.Beta Was this translation helpful? Give feedback.
All reactions