Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Store labeled examples in a table #63

Open
ecsalomon opened this issue Sep 28, 2017 · 0 comments
Open

Store labeled examples in a table #63

ecsalomon opened this issue Sep 28, 2017 · 0 comments

Comments

@ecsalomon
Copy link

ecsalomon commented Sep 28, 2017

Labeled training example pairs should be stored in a table for selection and reuse. Data stored for examples should include:

  • Source
  • Source ids
  • Label
  • Label date
  • Comment to store labeling rules applied by labeler

Storing examples like this allows them to by reused in the following ways:

  • Select specific subsets of labeled pairs to build models from
  • Store multiple labels for the same pair to determine how different labeling rules affect results
  • Dynamically format labeled examples for models that use different set of input data by linking examples to source data tables

This will also entail a test that any model is only trained on one label per pair!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant