Skip to content

Chain velds encapsulating a spacy NER training setup on APIS data.

License

Notifications You must be signed in to change notification settings

veldhub/veld_chain__train_spacy_apis_ner

Repository files navigation

veld chain veld_chain__train_spacy_apis_ner

This repo contains chain velds encapsulating a spacy NER training setup on APIS data.

requirements

  • git
  • docker compose (note: older docker compose versions require running docker-compose instead of docker compose)

Clone this repo with all its submodules

git clone --recurse-submodules https://github.com/veldhub/veld_chain__train_spacy_apis_ner.git

how to reproduce

The following chain velds were used. Open the respective veld yaml file for more information.

./veld_convert.yaml

Cleaning and converting json into spaCy docbin

docker compose -f veld_convert.yaml up

./veld_create_config.yaml

Creates a spacy training config according to passed arguments. See https://spacy.io/usage/training/#config for the target outcome.

docker compose -f veld_create_config.yaml up

./veld_train.yaml

A NER trainig setup, utilizing spaCy 3's config system.

docker compose -f veld_train.yaml up

./veld_analysis.yaml

Analyses out-of vocabulary occurrences of training data.

docker compose -f veld_analysis.yaml up

./veld_publish_to_hf.yaml

Pushing spacy model to huggingface.

docker compose -f veld_publish_to_hf.yaml up

About

Chain velds encapsulating a spacy NER training setup on APIS data.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published