Skip to content

Used selective_scan_cuda and causal_conv1d_cuda, but still very slow to train #468

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
nice66 opened this issue Jul 15, 2024 · 1 comment

Comments

@nice66
Copy link

nice66 commented Jul 15, 2024

I used selective_scan_cuda and causal_conv1d_cuda. At first I was training at 30 seconds a round, but when I downloaded a couple of libraries, there was a problem ——ImportError: libcudart.so.11.0:. I know it's cuda so I re-downloaded mamba once but this time my training speed changed to 15 minutes a round. Does anyone know the reason for this please?

@AnaRhisT94
Copy link

AnaRhisT94 commented Jul 18, 2024

I created batch decoding code, this can speedup your training. Don't use the cuda version
#477

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants