Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]:The program froze while attempting R1 Lora #6234

Open
2 tasks done
ygxw0909 opened this issue Feb 28, 2025 · 0 comments
Open
2 tasks done

[BUG]:The program froze while attempting R1 Lora #6234

ygxw0909 opened this issue Feb 28, 2025 · 0 comments
Labels
bug Something isn't working

Comments

@ygxw0909
Copy link

ygxw0909 commented Feb 28, 2025

Is there an existing issue for this bug?

  • I have searched the existing issues

The bug has not been fixed in the latest main branch

  • I have checked the latest main branch

Do you feel comfortable sharing a concise (minimal) script that reproduces the error? :)

Yes, I will share a minimal reproducible script.

🐛 Describe the bug

he program froze after print the following logs:

/root/miniconda3/lib/python3.10/site-packages/colossalai/kernel/extensions/utils.py:96: UserWarning: [extension] The CUDA version on the system (12.4) does not match with the version (12.1) torch was compiled with. The mismatch is found in the minor version. As the APIs are compatible, we will allow compilation to proceed. If you encounter any issue when using the built kernel, please try to build it again with fully matched CUDA versions
warnings.warn(
[extension] Loading the JIT-built cpu_adam_x86 kernel during runtime now

Environment

No response

@ygxw0909 ygxw0909 added the bug Something isn't working label Feb 28, 2025
@ygxw0909 ygxw0909 changed the title [BUG]: [BUG]:The program froze while attempting R1 Lora Feb 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant