Learning to Deceive with Attention-based Explanations

Code & associated data for the following paper.

Learning to Deceive with Attention-Based Explanations

Danish Pruthi, Mansi Gupta, Bhuwan Dhingra, Graham Neubig, Zachary C. Lipton

The 58th Annual Meeting of the Association for Computational Linguistics (ACL-20).

For dependencies, please check the environment.yml file in src directory. To create the same conda environment you can run conda env create -f environment.yml (You might have to edit the prefix in the last line in the file.) The code also uses a library called log for colorful printing (available here).

Please refer to README files for our experiments on classification tasks, and sequence-to-sequence tasks.

The examples alongside attention-based explanations used for the human-subject experiment:

Bibtex

@article{pruthi2020learning,
  title={Learning to Deceive with Attention-Based Explanations},
  author={Pruthi, Danish and Gupta, Mansi and Dhingra, Bhuwan and Neubig, Graham and Lipton, Zachary C},
  booktitle = {The 58th Annual Meeting of the Association for Computational Linguistics (ACL)},
  address = {Seattle, USA},
  month = {July},
  year = {2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src		src
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning to Deceive with Attention-based Explanations

Bibtex

About

Releases

Packages

Languages

danishpruthi/deceptive-attention

Folders and files

Latest commit

History

Repository files navigation

Learning to Deceive with Attention-based Explanations

Bibtex

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages