Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

reduce threshold for fuzzy matching in clean_state() #112

Merged
merged 2 commits into from
Jan 31, 2024

Conversation

MattCowgill
Copy link
Collaborator

@MattCowgill MattCowgill commented Jan 31, 2024

closes #111

clean_state() has an argument max_dist. A higher value permits 'fuzzier' matches, a lower value requires stricter matches.

This PR changes the default value of max_dist from 0.4 to 0.2. This will reduce spurious matches (as documented in the issue #111 ) at the potential cost of more failed 'true matches'. In my view this trade-off is worth it, and testing suggests this will not be a significant issue. Users remain able to manually specify max_dist if they do not want to accept the default value.

@MattCowgill MattCowgill requested a review from wfmackey January 31, 2024 05:34
Copy link
Collaborator

@wfmackey wfmackey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thank you @MattCowgill !

@wfmackey wfmackey merged commit e75f631 into master Jan 31, 2024
1 check passed
@MattCowgill MattCowgill deleted the stricter-fuzzy-match-in-clean-states branch January 31, 2024 06:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Unknown becomes NSW and Auckland becomes Aus
2 participants