-
Notifications
You must be signed in to change notification settings - Fork 84
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
V4 Wishlist #110
Comments
Reading the readme, it doesn't mention about processing files that are compressed.
but what if I'm going to ignore several lines anyways. |
hi there, I would love to know what is the current hashing algos? And on automata-based fuzzy searching, will it perform better than current string search algo on paper and design? Thanks! |
@happysalada, search through compressed data is an attractive feature proposition. I've been thinking about it a lot over the years, but it's not trivial for most compression types. Will keep in mind. @0xqd, we currently implement Rabin-style hashing and fingerprinting documented here. The header file also provides some details: StringZilla/include/stringzilla/stringzilla.h Lines 925 to 951 in 57209cb
I am looking into alternative algorithms as well, but want the primary hash and the rolling hash to use the same schema. |
Features
Breaking naming and organizational changes
edit_distance
tolevenshtein_distance
to match HammingAny other requests?
The text was updated successfully, but these errors were encountered: