Release v3.7.0 · moj-analytical-services/splink

What's Changed

Adjust caching for our concat tables by @ThomasHepworth in #1013
_initialise_df_concat optionally returns list by @RobinL in #1023
Df concat and df concat with tf return SplinkDataframe or None by @RobinL in #1033
[Docs] Add a dev guide for creating new ComparisonLevels and Comparisons to Splink libraries by @ADBond in #1041
correct module name duckb_base -> duckdb_base by @ADBond in #1046
Some cache tests by @ADBond in #1050
Improving the cache and make cache invalidation easier and more robust by @RobinL in #987
Bump version to 3.7.0 by @RobinL in #1056
Release 3.7.0 as dev version by @RobinL in #1057
Adds the ability to read directly from a settings filepath by @ThomasHepworth in #1062
Add code to produce tf cols from concat_with_tf by @ThomasHepworth in #1065
Use Ruff as a linter by @NickCrews in #1004
cast all value values to varchar by @ThomasHepworth in #1049
Automatically add tables of comparison (level)s compatible with each dialect to docs by @ADBond in #1035
add the ability to pass pandas df into the SparkLinker by @ThomasHepworth in #1068
Ruff by @ThomasHepworth in #1070
Replace flake8 with ruff as our main linter by @ThomasHepworth in #1071
Loosen dependency ranges by @NickCrews in #1080
add new award by @RossKen in #1081
Merge load settings methods by @ThomasHepworth in #1078
Fix docs workflow by @ADBond in #1073
Add docs for testing and creating a venv by @ThomasHepworth in #1086
Added ability to profile nested lists by @zslade in #1074
Workflow test multiple python versions by @ADBond in #1090
WIP: Update new_library_comparisons_and_levels.md by @RossKen in #1082
Added error message to catch pandas null casting issue when read into duckdb by @zslade in #1098
add a bash script for linting by @ThomasHepworth in #1100
parametrize datediff tests to clean them up by @ThomasHepworth in #1101
update with parametrize to test more file loading options by @ThomasHepworth in #1105
Improve citation by @RobinL in #1108
Simplify specific implementations of SplinkDataframe by @RobinL in #1116
Create Distance in KM Comparison library function by @RossKen in #1117
Rename target_rows argument to max_pairs in estimate_u_using_random_sampling() by @NickCrews in #1087
Create wrapper function for date comparisons by @RossKen in #1094
Rename target rows as max_pairs by @RossKen in #1119
Small Fixes by @NickCrews in #1115
Fix benchmark comment action to work better with forks by @ADBond in #1122
Release 3.7.0 proper by @ADBond in #1126

Full Changelog: v3.6.0...v3.7.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v3.7.0

What's Changed

Contributors