Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Format extension: incorporating annotator notes? #53

Open
nschneid opened this issue Aug 25, 2019 · 1 comment
Open

Format extension: incorporating annotator notes? #53

nschneid opened this issue Aug 25, 2019 · 1 comment
Labels

Comments

@nschneid
Copy link
Contributor

The version of STREUSLE in Xposition contains some annotator notes on P tokens that are not included in the official release. The notes can help clarify the interpretation of the text, provide the annotator's rationale, or help cluster different usages at a finer level of granularity than the supersenses.

Should the .conllulex format have a place for these? An extra column? Or maybe a sentence header row, as they are rare?

Should there also be a standard for releasing rich annotation history metadata (such as who annotated which token, original vs. adjudicated annotations, timestamps, ...)?

@nschneid
Copy link
Contributor Author

nschneid commented Sep 1, 2019

Maybe notes should be in a standoff TSV format (similar to tquery.py output) that gets ingested into the JSON?

Distinguish token notes (tnote), lexical expression notes (lnote), sentence notes (snote)?

Allow notes for arbitrary subsets of a sentence's tokens (e.g. "this was considered but rejected as an MWE")?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant