Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update README.md #4

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,18 +9,18 @@ With a total of 24,460 syntactically annotated texts, the ETCSANS core corpus cu

Overall, ETCSANS syntax annotation is largely derived from manual annotation or translations rather than manually created. Given the amount of data and the high degree of specialization required in doing the annotation, this is unavoidable, but from a methodological view, it presents a challenge. We plan for a moderated crowdsourcing process to improve and verify annotations via CDLI (which provides such a workflow since more than 10 years for transcriptions). The necessary tools are linked in the [`tools/`](tools) folder.

## known issues
## Known Issues

- `v.0.1/extended`: change from morphology format to syntax format, run annotator
- `v.0.1/core`: transaction provide partial annotations, only, to be complemented with morphology-based pre-annotation
- The royal subcorpus incorporates morphological annotations from the ETCSRI corpus. Note that ETCSRI data is different in transliteration and tokenization, and sometimes, in readings, and that annotations projected from ETCSRI to CDLI/ETCSANS data may be partially incorrect.

## history
## History

- 2021-12-08 v.0.1a: initial conversion of transaction subcorpus
- 2021-12-04 v.0.1: consolidated corpus repository, see linked submodules under [`dev/`](dev/) for their respective histories.

## acknowledgements
## Acknowledgements

funding by MTAAC
student support within GSoC
Expand Down