python inference.py
python configure.py
python train.py
- assets -> contains diagram
- spanish_ner -> contains model from iteration 1 of system
- tokenizer -> contains tokenizer from iteration 1 of system
- spanish_ner_2 -> contains model from iteration 2 of system
- tokenizer_2 -> contains tokenizer from iteration 2 of system
- clean.py -> postprocessing script (NOT USED IN FINAL, HURT PERFORMANCE)
- configure.py -> creates transformed_train.csv and transformed_val.csv loaded in train.py
- train.py -> used to fine tune BERT models on spanish dataset