Test and fix debug mode #2481

ADBond · 2024-10-22T17:33:47Z

Some tests of various bits of functionality with debug_mode switched on. Tests are structured so that we can check that issues specifically arise only when debug_mode is on - if the test fails when it is off, a different error will be flagged, to help isolate things. These can be expanded to help diagnose issues, but just as an initial starting point.

This will also include fixes for the failing tests, which will be opened as separate PRs into this branch:

Partial work towards #2429. This tackles 'does stuff work when debug_mode is switched on, but doesn't address 'is debug_mode doing what we think it is'.

ADBond · 2024-10-22T18:09:01Z

~~Oddly the tests that fail are passing in sqlite~~ - makes sense that the tests are okay in sqlite - no salting, and sampling uses absolute number rather than proportion.

Really it's slightly odd that the postgres tests fail - but looks like that is a separate issue.

this double-selection (of count_l, count_r - once each in * and once each explicitly) can cause errors in some dialects, in some execution modes

…s-select Explicit selection

this prevents looking up the wrong table when using debug mode

…broken-clustering Fix clustering in debug mode

This reverts commit 0442718.

…ataframe

…debug-mode Less caching in debug mode

without this, tests become coupled - specifically failures in `tests/test_debug_mode.py::test_debug_mode_clustering` and `tests/test_debug_mode.py::test_debug_mode_cluster_studio` cause failures in `tests/test_u_train.py::test_u_train_link_only` if run beforehand in the same test session

circumvents an issue with parquet method, where empty tables may be cleaned up before they are queried, particularly problematic in debug mode

…session-handling Spark test session handling

ADBond · 2024-11-11T20:20:09Z

@RobinL - there's nothing particularly major here, but might be worth a glance over.

All of the 'real' changes are in separate PRs with some explanation should you feel the need. Beyond that it might also be useful to look at the test decorator for checking issues with functionality in debug mode, to see if you think that seems a sensible way to go about it.

RobinL

Thanks - had a quick look. Looks good to me. I liked the format of the PR, with the links to component PRs, made it easy to understand!

ADBond added 4 commits October 22, 2024 11:30

some basic debug mode tests

325aa6c

don't use parametrize - wrap tests together

2c95968

more debug mode tests

d518f01

couple more tests, necessitating some extra flex in decorator

e533699

ADBond added bug Something isn't working testing debug_mode labels Oct 22, 2024

ADBond added 14 commits October 23, 2024 13:12

don't select columns we don't need

7ce6f0a

this double-selection (of count_l, count_r - once each in * and once each explicitly) can cause errors in some dialects, in some execution modes

Merge pull request #2484 from moj-analytical-services/bug/block-count…

a9eee44

…s-select Explicit selection

stable table with iteration name

84c400c

this prevents looking up the wrong table when using debug mode

Merge pull request #2485 from moj-analytical-services/bug/debug-mode-…

dadf79c

…broken-clustering Fix clustering in debug mode

lint

608f874

enqueue_df_concat - don't use cache in debug mode

0442718

Revert "enqueue_df_concat - don't use cache in debug mode"

5bdd456

This reverts commit 0442718.

debug mode - drop any caching that comes via sql_pipeline_to_splink_d…

418085b

…ataframe

Merge pull request #2488 from moj-analytical-services/bug/estimate-u-…

a8d33ec

…debug-mode Less caching in debug mode

Merge branch 'master' into bug/fixup-debug-mode

c035ea0

sparkapi in tests use checkpoint break lineage method

7213eb5

circumvents an issue with parquet method, where empty tables may be cleaned up before they are queried, particularly problematic in debug mode

Merge pull request #2504 from moj-analytical-services/bug/spark-test-…

1047b6b

…session-handling Spark test session handling

Merge branch 'master' into bug/fixup-debug-mode

938072c

ADBond marked this pull request as ready for review November 11, 2024 15:48

RobinL approved these changes Nov 12, 2024

View reviewed changes

ADBond merged commit 2f67811 into master Nov 12, 2024
25 checks passed

ADBond deleted the bug/fixup-debug-mode branch November 12, 2024 10:17

This was referenced Nov 12, 2024

Clustering gives the wrong answer if debug_mode is on #2480

Closed

debug_mode breaks training workflow #2428

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test and fix debug mode #2481

Test and fix debug mode #2481

ADBond commented Oct 22, 2024 •

edited

Loading

ADBond commented Oct 22, 2024 •

edited

Loading

ADBond commented Nov 11, 2024

RobinL left a comment

Test and fix debug mode #2481

Test and fix debug mode #2481

Conversation

ADBond commented Oct 22, 2024 • edited Loading

ADBond commented Oct 22, 2024 • edited Loading

ADBond commented Nov 11, 2024

RobinL left a comment

Choose a reason for hiding this comment

ADBond commented Oct 22, 2024 •

edited

Loading

ADBond commented Oct 22, 2024 •

edited

Loading