Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Attempt to spread documents more evenly across annotators
Rather than picking the next document for each annotator completely at random, we now prefer documents that have fewer existing annotations. This is achieved by first sorting the list of documents by the number of COMPLETED+PENDING annotations and then randomizing only within each group, i.e. we first try (in random order) those documents with no existing annotations, then if none of those are suitable we try (again in random order) those documents with one annotation, then two, etc. until we either find a valid document to assign or run out of documents to try. The effect of this should be that at any given time the full set of documents should be "evenly" annotated, or as close as possible if the number of completed annotation does not divide evenly into num_docs*annotations_per_doc Fixes #372
- Loading branch information