Clarification on Timestamps for Foul Event Clips in Dataset #5

mmiakashs · 2024-04-04T04:17:38Z

In the paper, it has been mentioned that "For both classification tasks, we leverage clips of 16 frames, spanning temporally for 1 second, with a spatial dimension of 224×398 pixels. Specifically, the clips contain 8 frames before the foul and 8 frames after the foul." However, the data annotation lacks timestamps indicating when the foul occurred. Could you please share the details of how the 1-second clip was extracted?

heldJan · 2024-04-09T12:26:43Z

Hello. Each clip is 5 seconds long with 25 frames per second, totaling 125 frames. The point of contact typically occurs at the 75th frame. We trimmed the clips using --start_frame 63, --end_frame 87, and --fps 17 to capture only the frames where the foul takes place, but feel free to adjust the values to your needs.

heldJan self-assigned this Apr 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification on Timestamps for Foul Event Clips in Dataset #5

Clarification on Timestamps for Foul Event Clips in Dataset #5

mmiakashs commented Apr 4, 2024 •

edited

Loading

heldJan commented Apr 9, 2024 •

edited

Loading

Clarification on Timestamps for Foul Event Clips in Dataset #5

Clarification on Timestamps for Foul Event Clips in Dataset #5

Comments

mmiakashs commented Apr 4, 2024 • edited Loading

heldJan commented Apr 9, 2024 • edited Loading

mmiakashs commented Apr 4, 2024 •

edited

Loading

heldJan commented Apr 9, 2024 •

edited

Loading