v3.7.0
What's Changed
- Adjust caching for our concat tables by @ThomasHepworth in #1013
- _initialise_df_concat optionally returns list by @RobinL in #1023
- Df concat and df concat with tf return SplinkDataframe or None by @RobinL in #1033
- [Docs] Add a dev guide for creating new ComparisonLevels and Comparisons to Splink libraries by @ADBond in #1041
- correct module name duckb_base -> duckdb_base by @ADBond in #1046
- Some cache tests by @ADBond in #1050
- Improving the cache and make cache invalidation easier and more robust by @RobinL in #987
- Bump version to 3.7.0 by @RobinL in #1056
- Release 3.7.0 as dev version by @RobinL in #1057
- Adds the ability to read directly from a settings filepath by @ThomasHepworth in #1062
- Add code to produce tf cols from concat_with_tf by @ThomasHepworth in #1065
- Use Ruff as a linter by @NickCrews in #1004
- cast all value values to varchar by @ThomasHepworth in #1049
- Automatically add tables of comparison (level)s compatible with each dialect to docs by @ADBond in #1035
- add the ability to pass pandas df into the
SparkLinker
by @ThomasHepworth in #1068 - Ruff by @ThomasHepworth in #1070
- Replace
flake8
withruff
as our main linter by @ThomasHepworth in #1071 - Loosen dependency ranges by @NickCrews in #1080
- add new award by @RossKen in #1081
- Merge load settings methods by @ThomasHepworth in #1078
- Fix docs workflow by @ADBond in #1073
- Add docs for testing and creating a venv by @ThomasHepworth in #1086
- Added ability to profile nested lists by @zslade in #1074
- Workflow test multiple python versions by @ADBond in #1090
- WIP: Update new_library_comparisons_and_levels.md by @RossKen in #1082
- Added error message to catch pandas null casting issue when read into duckdb by @zslade in #1098
- add a bash script for linting by @ThomasHepworth in #1100
- parametrize datediff tests to clean them up by @ThomasHepworth in #1101
- update with parametrize to test more file loading options by @ThomasHepworth in #1105
- Improve citation by @RobinL in #1108
- Simplify specific implementations of SplinkDataframe by @RobinL in #1116
- Create Distance in KM Comparison library function by @RossKen in #1117
- Rename target_rows argument to max_pairs in estimate_u_using_random_sampling() by @NickCrews in #1087
- Create wrapper function for date comparisons by @RossKen in #1094
- Rename target rows as max_pairs by @RossKen in #1119
- Small Fixes by @NickCrews in #1115
- Fix benchmark comment action to work better with forks by @ADBond in #1122
- Release 3.7.0 proper by @ADBond in #1126
Full Changelog: v3.6.0...v3.7.0