Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
spanish_ner_2		spanish_ner_2
tokenizer_2		tokenizer_2
.DS_Store		.DS_Store
README.md		README.md
clean.py		clean.py
configure.py		configure.py
inference.py		inference.py
run.sh		run.sh
train.py		train.py

Repository files navigation

Spanish-NER

General overview:

running inference (this produces test_ans.csv, everything else should be configured)

python inference.py

running training:

python configure.py python train.py

dir breakdown

assets -> contains diagram
spanish_ner -> contains model from iteration 1 of system
tokenizer -> contains tokenizer from iteration 1 of system
spanish_ner_2 -> contains model from iteration 2 of system
tokenizer_2 -> contains tokenizer from iteration 2 of system
clean.py -> postprocessing script (NOT USED IN FINAL, HURT PERFORMANCE)
configure.py -> creates transformed_train.csv and transformed_val.csv loaded in train.py
train.py -> used to fine tune BERT models on spanish dataset

Spanish-NER

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages