GitHub

This repository consists of our code submitted to the Textgraphs 2020 {held during COLING 2020} workshop shared task on Multi-Hop Inference Explanation Regeneration. The task details are here: https://competitions.codalab.org/competitions/23615

CITATION

If you find the code useful in your research, then consider citing our paper:

@inproceedings{gupta-srinivasaraghavan-2020-explanation,
title = "Explanation Regeneration via Multi-Hop {ILP} Inference over Knowledge Base",
author = "Gupta, Aayushee and Srinivasaraghavan, Gopalakrishnan",
booktitle = "Proceedings of the Graph-based Methods for Natural Language Processing (TextGraphs)",
month = dec,
year = "2020",
address = "Barcelona, Spain (Online)",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/2020.textgraphs-1.13",
pages = "109--114",
abstract = "Textgraphs 2020 Workshop organized a shared task on 'Explanation Regeneration' that required reconstructing gold explanations for elementary science questions. This work describes our submission to the task which is based on multiple components: a BERT baseline ranking, an Integer Linear Program (ILP) based re-scoring and a regression model for re-ranking the explanation facts. Our system achieved a Mean Average Precision score of 0.3659."}

All our experiments were done with python 3.6 Miniconda environment. Make sure following packages are available in the environment with pip install: numpy, tqdm, scikit-learn, pandas, requests, urllib3, scipy, nltk, torch, torchvision.
For BERT Baseline Ranking, the experiments were done on Google Colab GPU.

A) TF-IDF Ranking

Run baseline.py with the following parameters to generate tf-idf baseline ranking:
python baseline.py -n 30 tablestore/v2.1/tables questions/questions.test.tsv>predictions/tfidf_top30.txt
To evaluate it, run:
python evaluate.py --gold questions/questions.test.tsv predictions/tfidf_top30.txt
This gives MAP on TF-IDF baseline ranking.

B) BERT Baseline Ranking

Run bert_reranker.py to generate training,eval and test examples and features from dataset. Then train and test model and save test predictions as follows:
Requirements:
pip install transformers==2.2.1
pip install tensorboardX
pip install pandas
install apex:
%%writefile setup.sh
git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --no-cache-dir ./
sh setup.sh
To train and evaluate:
python bert_reranker.py train
To test:
python bert_reranker.py test
If not training and testing the model, use the following link that has BERT trained model and test predictions: https://drive.google.com/drive/folders/17nZaZ7t54CgGfCZ1rmi8NSoS3QFCAXnz?usp=sharing
model name: pytorch_model.bin
prediction file on test data from model : tg2020_test_preds.npy
test examples zipped file : examples_test_bert-base-uncased_140_tg2020.zip. unzip it & save in 'questions' folder.
Run write_predictions.py to write baseline ranking predictions as a text file [tg2020_test_predicted.txt]. It requires test predictions from BERT [tg2020_test_preds.npy] to be kept in the main directory and test examples file [examples_test_bert-base-uncased_140_tg2020] in 'questions' directory.
python write_predictions.py
Run reorder_predictions.py to move duplicate predictions to the bottom and then evaluate the predicted file using evaluate.py:
python reorder_predictions.py
python evaluate.py --gold questions/questions.test.tsv predictions/predictions_bert.txt
This gives MAP on BERT baseline ranking.

C) ILP Rescoring and Regression Reranking

The code for ILP is hosted at: https://github.com/aayushee/semanticilp
It is in Scala. Clone the repository, further instructions for setting it up are present in the readme file in the repository.

Run baseline.py and process_regression_data.py to generate labeled file for train data :
python baseline.py -n 30 tablestore/v2.1/tables questions/questions.train.tsv>predictions/tfidf_train_top30.txt
python process_regression_data.py
Run preprocess_ilp.py to generate train and test data file for ILP model.
python preprocess_ilp.py train
python preprocess_ilp.py test
Run get_ilp_results.py to get scores from ILP for train and test data.
Set Solver Port and Solver Domain in lines 9 and 10 as set up in SemanticILP above. They are set to '9003' and 'localhost' by default.
Set n=2/3/4 to get 2/3/4 feature scores.
Set parameter mode=train/test
python get_ilp_results.py mode n
The SemanticILP code is set up for 3 features by default. To make feature parameter change, see the repository link mentioned above.
Run linear_regression.py to perform linear regression on the dataset followed by running regression_predictions.py to sort scores and regenerate ranked predictions.
Set n as 2/3/4 to get 2/3/4 features combined scores respectively.
python linear_regression.py n
Set n as 2/3/4 to get 2/3/4 features predictions respectively.
python regression_predictions.py n
The top 30 predictions from regression model created from above are also available in the 'predictions' folder:
regression_preds3.txt
regression_preds4.txt
Run copy_predictions.py to add remaining predictions from BERT baseline after top-K and evaluate.py to evaluate the generated predictions.
Set n as 2/3/4 to get 2/3/4 features predictions respectively.
python copy_predictions.py n
generated files: pred_bert_regression_2scores_combined.txt/pred_bert_regression_3scores_combined.txt/pred_bert_regression_4scores_combined.txt
python evaluate.py --gold questions/questions.test.tsv predictions/[generated file.txt]

D) Summarizer Ranking

Run textrank.py file to get predictions on test data from summarizer. python preprocess_ilp.py test
python textrank.py
This will create the predictions file. To evaluate the predictions, run:
python evaluate.py --gold questions/questions.test.tsv predictions/predict_textrank.txt

Name	Name	Last commit message	Last commit date
Latest commit aayushee Update ReadMe.md Jun 5, 2021 411bb33 · Jun 5, 2021 History 11 Commits
.idea	.idea	updated readme and other files	Oct 3, 2020
predictions	predictions	committing all files for textgraphs	Oct 2, 2020
questions	questions	updated readme and other files	Oct 3, 2020
tablestore/v2.1/tables	tablestore/v2.1/tables	committing all files for textgraphs	Oct 2, 2020
ReadMe.md	ReadMe.md	Update ReadMe.md	Jun 5, 2021
baseline.py	baseline.py	made final changes to readme and other files	Oct 4, 2020
bert_reranker.py	bert_reranker.py	made final changes to readme and other files	Oct 4, 2020
copy_predictions.py	copy_predictions.py	made final changes to readme and other files	Oct 4, 2020
evaluate.py	evaluate.py	committing all files for textgraphs	Oct 2, 2020
get_ilp_results.py	get_ilp_results.py	made final changes to readme and other files	Oct 4, 2020
linear_regression.py	linear_regression.py	made final changes to readme and other files	Oct 4, 2020
preprocess_ilp.py	preprocess_ilp.py	made final changes to readme and other files	Oct 4, 2020
process_regression_data.py	process_regression_data.py	made final changes to readme and other files	Oct 4, 2020
regression_predictions.py	regression_predictions.py	added args to all files and updated readme	Oct 3, 2020
reorder_predictions.py	reorder_predictions.py	made final changes to readme and other files	Oct 4, 2020
sampled_indices.pkl	sampled_indices.pkl	updated bert files	Oct 3, 2020
textrank.py	textrank.py	committing all files for textgraphs	Oct 2, 2020
write_predictions.py	write_predictions.py	updated bert files	Oct 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

aayushee/Textgraphs

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages