Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

when I was training I got the following error #9

Open
caogj-0521 opened this issue Jul 10, 2023 · 3 comments
Open

when I was training I got the following error #9

caogj-0521 opened this issue Jul 10, 2023 · 3 comments

Comments

@caogj-0521
Copy link

caogj-0521 commented Jul 10, 2023

Hello:

I'm using the main branch an follow the procedure:

  1. use challenger dataset to process,
    2.use processing data to train
    3.train command:
    python start_train.py --model_name tgnet_fps --config_path train_configs/tgnet_fps.py --experiment_name tgnet --input_data_dir_path processDir --train_data_split_txt_path split/base_name_train_fold.txt --val_data_split_txt_path split/base_name_val_fold.txt
    However, the following error occure
    image

When I comment out the code ‘raise’:
the following error occurre
39d08ca81aeab5de92173beaf461f0a

My running environment:
The graphics card is RTX 3090
Python 3.7.16
torch: 1.13.1+cu117, cuda 11.7,cudnn 8500
Ubuntu 20.4

Now I don't know where the problem is, Can you help me?

@caogj-0521 caogj-0521 changed the title about python version when I was training I got the following error Jul 19, 2023
@limhoyeon
Copy link
Owner

maybe this issue is related to #4.
there are issues with CUDA functions not executing properly in some PC environments. I plan to remove the CUDA functions and replace them with Python code. Please bear with me a little longer, or alternatively, you can try reinstalling the 'pointops' library from https://github.com/POSTECH-CVLab/point-transformer/tree/master/lib/pointops.

@supgy
Copy link

supgy commented Feb 2, 2024

Hello:

I'm using the main branch an follow the procedure:

  1. use challenger dataset to process,
    2.use processing data to train
    3.train command:
    python start_train.py --model_name tgnet_fps --config_path train_configs/tgnet_fps.py --experiment_name tgnet --input_data_dir_path processDir --train_data_split_txt_path split/base_name_train_fold.txt --val_data_split_txt_path split/base_name_val_fold.txt
    However, the following error occure
    image

When I comment out the code ‘raise’: the following error occurre 39d08ca81aeab5de92173beaf461f0a

My running environment: The graphics card is RTX 3090 Python 3.7.16 torch: 1.13.1+cu117, cuda 11.7,cudnn 8500 Ubuntu 20.4

Now I don't know where the problem is, Can you help me?

Hello, have you solved this problem? I encountered the same problem as you.
My running environment:
The graphics card is RTX 4090
Python 3.6.13
pytorch1.10 cuda11.7 cudnn
Ubuntu 20.4
Hope your reply. :)

@limhoyeon
Copy link
Owner

Did you tried to replace the point ops with "https://github.com/POSTECH-CVLab/point-transformer/tree/master/lib/pointops"??

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants