Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about the face bounding box of VoxCeleb1 #20

Closed
HowieMa opened this issue Feb 21, 2023 · 3 comments
Closed

Question about the face bounding box of VoxCeleb1 #20

HowieMa opened this issue Feb 21, 2023 · 3 comments

Comments

@HowieMa
Copy link

HowieMa commented Feb 21, 2023

Hi,
Thanks for sharing such an amazing project! As for the pre-processing, it is said that you follow the video-preprocessing scripts. However, in their codes, they use the bounding box as shared in the meta-csv file, which is different from the original bounding box from the VoxCeleb1 dataset, as discussed in issue21.

Therefore, may I ask which bounding box you use to obtain the cropped images?

Besides, the original VoxCeleb1 dataset contains 1k+ subjects in the training set and 40+ subjects in the testing set.
However, the meta-file of video-preprocessing only provides 400+ subjects, which is a subset of the original VoxCeleb1 dataset.
Thus, may I ask which dataset you used in the training examples? The dataset with 1k+ subjects or the subset from video-preprocessing?

Looking forward to your reply. Thanks in advance!

@NetEase-GameAI
Copy link
Owner

I use videos in the meta-csv file,

@HowieMa
Copy link
Author

HowieMa commented Mar 20, 2023

非常感谢您关于video选择的回复,这对我非常有帮助。
想请教一下上述的的第一个问题,您用的是哪一种bounding box 从原始视频裁剪脸部区域的呢?
1)是直接从video-preprocessing 提供的 "vox-metadata.csv",
2)还是使用的VoxCeleb1 官方提供的原始bounding box 呢?

具体来说,
VoxCeleb1 官方给的boundingbox 都是正方形的,我看您的dataset demo里似乎使用的是VoxCeleb1官方的正方形框。

但是meta csv 文件里存在很多非正方形的框,如这个issue,他们会把非正方形的框给resize 到正方形,这样就会有畸变。比如您demo 里的图,直接使用video-preprocessing 的meta.csv 就会得到这样压的非常扁的图。
image
所以想找您确认一下在裁剪图片的时候使用的是哪一种bounding box,谢谢!

@NetEase-GameAI
Copy link
Owner

用的是第二种,但我这边是在其给出的bounding box的基础上对其进行了修改,不是你那的那种直接缩放,而是比如原始的框左右或者上下加一些padding的方式,保证出来的都是正方形。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants