Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update scripts to not do fuzzy match on User names #10

Open
briri opened this issue Aug 22, 2018 · 0 comments
Open

Update scripts to not do fuzzy match on User names #10

briri opened this issue Aug 22, 2018 · 0 comments
Labels
bug Something isn't working data collection

Comments

@briri
Copy link
Collaborator

briri commented Aug 22, 2018

Fuzzy match is currently getting false positives for Users. For example: 'Dr. John Doe' is getting matched to 'John Blah' because of the percentage of matching words.

We should setup stop words for names (e.g. Dr., PhD, etc.) and then assure that the first+last name match 100%. (this is in lieu of matching identifiers)

@briri briri added bug Something isn't working data collection labels Aug 22, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working data collection
Projects
None yet
Development

No branches or pull requests

1 participant