systematicity/Analysis at master · emilygoodwin/systematicity

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
comparing_encoders_IOCW.R		comparing_encoders_IOCW.R
comparing_encoders_KWP.R		comparing_encoders_KWP.R
comparing_encoders_consistency.R		comparing_encoders_consistency.R
data_analysis.ipynb		data_analysis.ipynb
format_all.sh		format_all.sh
format_data.py		format_data.py
output_cruncher.py		output_cruncher.py
plotConsistency.R		plotConsistency.R
plotKWP.R		plotKWP.R
plottIOCW.R		plottIOCW.R
word_analysis.ipynb		word_analysis.ipynb

README.md

Point format_data.py at the results file from one experiment; it should write out a file metadata.json.

(Use format_all.sh to save you from manually running format_data.py on all of the output files individually.)

Then run output_cruncher.py, which reads in metadata.json and writes out CSV files with pairs of test items.

To get the plots for known word perturbation probe:
run comparing_encoders_KWP.R and then plotKWP.R

To get the plots for logical consistency probe:
run comparing_encoders_consistency.R and then plotConsistency.R

To get the plots for logical consistency probe:
run comparing_encoders_IOCW.R and then plotIOCW.R

Notebook word_analysis.ipynb will analyze the word embeddings, try to cluster them (Figure 1 in the paper) and get some similarity metrics.