Skip to content

Latest commit

 

History

History
33 lines (22 loc) · 2.46 KB

annotationSpecifications.md

File metadata and controls

33 lines (22 loc) · 2.46 KB

Data annotation specifications

Specifications to guide annotations for a person tracking project

The information below is meant to guide discussion and provide pointers and considerations that a professional annotation team would have for a person detection annotation project.

General Comments

  • Only visible parts of people should be annotated.
  • Don't include bags, purses, baby carriages, shopping carts, etc. into a bounding box.
  • Individuals should have the same identity if he/she/it/they appear/disappear several times throughout a video.
  • Don't annotate small or really blurry people.

Annotation Format

CVAT supports multiple annotation formats that may be found here.

  • Annotation Format is to be chosen by the annotator. One example is CVAT XML file schema/metadata.

More information on the XML annotation format specifically may be found here. The link describes the tags that are present in the XML, what they mean, and demonstrates an annotation example using annotation boxes, polygons, etc.

Person Tracking

The annotation file should contain the following information per frame (from either manual or interpolated annotations):

Annotation Annotation Type Encoded by
Person (location) Rectangular bounding box (x1, y1, x2, y2) x1: horizontal coordinate of the top left corner
y1: vertical coordinate of the top left corner
x2: horizontal coordinate of the bottom right corner
y2: vertical coordinate of the bottom right corner
Identity Number Number indicating the person's identity (maintained over time).
Occlusion Number Value:
  • Person is not occluded (0)
  • Person is partially occluded (<= 50%) (1)
  • Person is heavily occluded (>50%) (2)