Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset size #13

Open
khanwa opened this issue Nov 23, 2023 · 6 comments
Open

Dataset size #13

khanwa opened this issue Nov 23, 2023 · 6 comments

Comments

@khanwa
Copy link

khanwa commented Nov 23, 2023

Hello,
You have mentioned that BinKit 2.0 has 371,928 binaries, however, the Zip file download from the drive contains ~213K files. Could you please clarify?

Thank you

@topcue
Copy link
Collaborator

topcue commented Nov 26, 2023

Hello @khanwa.

The size of BinKit 2.0 dataset is 10G. After checking again, there seems to be no problem with the BinKit 2.0 dataset link in README.

Could you check to see if there was an interruption while downloading the BinKit dataset?
Thank you.

@Rroscha
Copy link

Rroscha commented Dec 5, 2023

Hello,
I have the same problem that BinKit 2.0 has only 213K binary files. And there are only 50 (not 51) projects.

Thank you.

@topcue
Copy link
Collaborator

topcue commented Dec 11, 2023

Hello.
We will check again and respond as quickly as possible.

Thank you

@Rroscha
Copy link

Rroscha commented Dec 11, 2023

Hello. We will check again and respond as quickly as possible.

Thank you

Thank you very much.

@topcue
Copy link
Collaborator

topcue commented Dec 13, 2023

BinKit 1.0 provided precompiled Normal(O0, O1, O2, O3), SizeOpt(Os), Noinline, PIE, LTO, and Obfus datasets. However, BinKit 2.0 only provides precompiled extended compiler versions and optimization level options (O0, O1, O2, O3, Os, Ofast).
(Noinline, PIE or NOPIE, LTO, Obfus dataset can be built directly using a script. but does not provide precompile dataset).

README's '371K binary files' is the total of distinct binaries from BinKit 2.0's optimization level options (O0-O3, Os, Ofast) and BinKit 1.0's Noinline, PIE, LTO, and Obfus dataset.

To reduce this confusion, we plan to provide additional precompiled datasets for options such as NOPIE and LTO for the expanded compiler versions of BinKit 2.0.

Thank you

@khanwa
Copy link
Author

khanwa commented Dec 13, 2023

Thank you very much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants