WhoIsWho-IND-KDD-2024-rank6

Reproduction Instructions

Run ipynb 01-10 in sequence or train sh

01 get text(title,abstract,keyword,venue) embedding from tfidf,word2vec,chatglm3 and bge-m3

02 For each autherID, calculate the similarity between each pid and other pids

03 Extract strongly correlated information (co author, co-org, co-keyword...)

04 tree model

05 gnn model and post-processs

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
sub		sub
01_data_prepare.ipynb		01_data_prepare.ipynb
02_bge_m3_feats.ipynb		02_bge_m3_feats.ipynb
03_chatglm_feats.ipynb		03_chatglm_feats.ipynb
04_feature.ipynb		04_feature.ipynb
05_feature.ipynb		05_feature.ipynb
06_feature_merge.ipynb		06_feature_merge.ipynb
07_feature_2.ipynb		07_feature_2.ipynb
08_tree_model.ipynb		08_tree_model.ipynb
09_gnn_model.ipynb		09_gnn_model.ipynb
10_ensemble_postprocess.ipynb		10_ensemble_postprocess.ipynb
README.md		README.md
chatglm_abstract.py		chatglm_abstract.py
chatglm_title.py		chatglm_title.py
test.sh		test.sh
train.sh		train.sh