amr_coref

A python library / model for creating co-references between AMR graph nodes.

About

amr_coref is a python library and trained model designed to do co-referencing between Abstract Meaning Representation graphs.

The project follows the general approach of the neuralcoref project and it's excellent blog on the co-referencing. However, the model is trained to do direct co-reference resolution between graph nodes and does not depend on the sentences the graphs were created from.

The trained model achieves the following scores

MUC   :  R=0.647  P=0.779  F₁=0.706
B³    :  R=0.633  P=0.638  F₁=0.630
CEAF_m:  R=0.515  P=0.744  F₁=0.609
CEAF_e:  R=0.200  P=0.734  F₁=0.306
BLANC :  R=0.524  P=0.799  F₁=0.542
CoNLL-2012 average score: 0.548

Project Status

!! The following papers have GitHub projects/code that are better scoring and may be a preferable solution. See the uploaded file in #1 for a quick view of scores.

Note that due to the use of multiprocessing, this code may only be compatible with a Debian style OS. See #3 for details on the issue.

Installation and usage

There is currently no pip installation. To use the library, simply clone the code and use it in place.

The pre-trained model can be downloaded from the assets section in releases.

To use the model create a data directory and un-tar the model in it.

The script 40_Run_Inference.py, is an example of how to use the model.

Training

If you'd like to train the model from scratch, you'll need a copy of the AMR corpus. To complete training, run the scripts in order.

10_Build_Model_TData.py
12_Build_Embeddings.py
14_Build_Mention_Tokens.py
30_Train_Model.py.

You'll need amr_annotation_3.0 and GloVe/glove.6B.50d.txt in your data directory

The first few scripts will create the training data in data/tdata and the model training script will create data/model. Training takes less than 4 hours.

Name	Name	Last commit message	Last commit date
Latest commit bjascob Update README.md Dec 11, 2022 5c79917 · Dec 11, 2022 History 10 Commits
amr_coref	amr_coref	add device override to model loading	Apr 15, 2022
configs	configs	initial commit	Mar 4, 2021
.gitignore	.gitignore	Update README	Mar 4, 2021
10_Build_Model_TData.py	10_Build_Model_TData.py	initial commit	Mar 4, 2021
12_Build_Embeddings.py	12_Build_Embeddings.py	initial commit	Mar 4, 2021
14_Build_Mention_Tokens.py	14_Build_Mention_Tokens.py	Update README	Mar 4, 2021
20_Stats_for_Featurizer.py	20_Stats_for_Featurizer.py	initial commit	Mar 4, 2021
22_Cluster_Stats.py	22_Cluster_Stats.py	initial commit	Mar 4, 2021
24_Plot_Bins.py	24_Plot_Bins.py	initial commit	Mar 4, 2021
30_Train_Model.py	30_Train_Model.py	initial commit	Mar 4, 2021
32_Test_Model.py	32_Test_Model.py	initial commit	Mar 4, 2021
34_Save_Model.py	34_Save_Model.py	initial commit	Mar 4, 2021
40_Run_Inference.py	40_Run_Inference.py	add device override to model loading	Apr 15, 2022
LICENSE	LICENSE	initial commit	Mar 4, 2021
README.md	README.md	Update README.md	Dec 11, 2022
requirements.txt	requirements.txt	initial commit	Mar 4, 2021
run_tensorboard.sh	run_tensorboard.sh	initial commit	Mar 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

amr_coref

About

Project Status

Installation and usage

Training

About

Releases 1

Packages

Languages

License

bjascob/amr_coref

Folders and files

Latest commit

History

Repository files navigation

amr_coref

About

Project Status

Installation and usage

Training

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages