We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
大佬你好,我发现pro_data.py代码上有个小逻辑没有处理。但是不确定会不会最终影响实验结果。
Neu-Review-Rec/pro_data/data_pro.py
Line 256 in a119c1e
将224行划分成train:test = 2:8,可大概率复现该现象。
The text was updated successfully, but these errors were encountered:
master和pl都可能存在该问题
###该死的排版。。。忘了提交前先Preview一下了。
Sorry, something went wrong.
No branches or pull requests
大佬你好,我发现pro_data.py代码上有个小逻辑没有处理。但是不确定会不会最终影响实验结果。
Neu-Review-Rec/pro_data/data_pro.py
Line 256 in a119c1e
247行 和 253行可能导致 train数据集中重复添加相同记录,从而导致数据集处理前后总数不一致问题。
将224行划分成train:test = 2:8,可大概率复现该现象。
解决方法:移除247和253行代码,并在255行后添加data_train = pd.concat([data_train, data_test.loc[all_index]])
The text was updated successfully, but these errors were encountered: