Add context-based post processing for linear features #342

rwood-97 · 2024-01-18T11:23:34Z

Summary

As per #339, this PR implements a post-processing script so that users can filter out false positives.
This works for linear features or anything where you expect multiple patches to be clustered but solo patches would be false positive.
It also adds a save_predictions() method the classifier to make sure predictions and confidence scores are saved in format expected for post-processing.

Fixes #218
Addresses #339

Checklist before assigning a reviewer (update as needed)

Allow users to pick lowest conf for which to change label
Check for edge cases - overlapping patches and non-square patches
Self-review code
Ensure submission passes current tests
Add tests
Update relevant docs

Reviewer checklist

Please add anything you want reviewers to specifically focus/comment on.

Everything looks ok?

codecov-commenter · 2024-01-18T11:55:59Z

Codecov Report

Attention: 11 lines in your changes are missing coverage. Please review.

Comparison is base (b52086e) 59.64% compared to head (f668a73) 60.49%.
Report is 2 commits behind head on main.

❗ Current head f668a73 differs from pull request most recent head 777c857. Consider uploading reports for the commit 777c857 to get more accurate results

Files	Patch %	Lines
mapreader/classify/classifier.py	23.07%	10 Missing ⚠️
mapreader/process/post_process.py	98.50%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #342      +/-   ##
==========================================
+ Coverage   59.64%   60.49%   +0.85%     
==========================================
  Files          35       37       +2     
  Lines        6165     6334     +169     
==========================================
+ Hits         3677     3832     +155     
- Misses       2488     2502      +14

Flag	Coverage Δ
unittests	`60.49% <93.60%> (+0.85%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

rwood-97 · 2024-01-18T13:23:37Z

To run, first run inference on some patches and save outputs by calling my_classifier.save_predictions(set_name="infer") with your chosen dataset.

Then:

import pandas as pd

from mapreader.process.post_process import PatchDataFrame

df = pd.read_csv("./predictions_patch_df.csv", index_col=0)

labels_map = {
    0: "no",
    1: "railspace",
    2: "building",
    3: "railspace+building"
}

patches = PatchDataFrame(df, labels_map=labels_map)
patches.get_context(labels=["railspace", "railspace+building",])
patches.update_preds(remap={"railspace": "no", "railspace+building": "building"}, conf=0.8)

This will select all railspace/railspace+building patches, get their context, then update predictions for patches with no surrounding railspace and confidence score of less than 0.8.

Can also set remap to be new labels completely (e.g. {"railspace": "check me", "railspace+building": "check me"}).

mapreader/process/post_process.py

rwood-97 · 2024-01-30T09:43:48Z

See here for stats on post-processing https://github.com/Living-with-machines/railspace/issues/14

edwardchalstrey1 · 2024-02-02T16:02:54Z

docs/source/User-guide/Post-process.rst

-TBC
+MapReader post-processing's sub-package currently contains one method for post-processing the predictions from your model based on the idea that features such as railways, roads, coastlines, etc. are continuous and so patches with these labels should be found near to other patches also with these labels.
+
+For example, if a patch is predicted to be a railspace, but is surrounded by patches predicted to be non-railspace, then it is likely that the railspace patch is a false positive.


I guess you could be even more explicit and say: "The current method checks whether any of the 8 surrounding patches have the same label as a given patch (e.g. railspace), and if not, assumes this to be a false positive".

Perhaps could also mention: "Future releases may add functionality to create custom filter rules for your use case"

edwardchalstrey1

One comment, but otherwise LGTM

rwood-97 added 3 commits January 17, 2024 13:56

enable easier saving of predictions to csv

02e2436

add post processing script

60641bb

add docstrings, allow user to specify conf

f6f5e89

skip edge patches, allow new labels

08136a4

rwood-97 requested a review from edwardchalstrey1 January 18, 2024 13:24

edwardchalstrey1 reviewed Jan 18, 2024

View reviewed changes

mapreader/process/post_process.py Show resolved Hide resolved

edwardchalstrey1 reviewed Jan 18, 2024

View reviewed changes

mapreader/process/post_process.py Outdated Show resolved Hide resolved

rwood-97 linked an issue Jan 25, 2024 that may be closed by this pull request

Post processing of predicted labels using patch context #339

Closed

force image_id index

9b9003c

rwood-97 mentioned this pull request Jan 30, 2024

Consider adding further filtering rules in context-based post processing #344

Open

rwood-97 added 2 commits January 30, 2024 11:45

add tests

3c58460

Add post-processing docs

f668a73

rwood-97 requested a review from edwardchalstrey1 February 2, 2024 15:19

edwardchalstrey1 reviewed Feb 2, 2024

View reviewed changes

edwardchalstrey1 approved these changes Feb 2, 2024

View reviewed changes

rwood-97 added 2 commits February 5, 2024 13:19

add suggestion

1abce20

Merge branch 'main' into 339-postproc

777c857

rwood-97 merged commit 033917f into main Feb 5, 2024
3 checks passed

rwood-97 deleted the 339-postproc branch February 5, 2024 13:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add context-based post processing for linear features #342

Add context-based post processing for linear features #342

rwood-97 commented Jan 18, 2024 •

edited by edwardchalstrey1

Loading

codecov-commenter commented Jan 18, 2024 •

edited

Loading

rwood-97 commented Jan 18, 2024

rwood-97 commented Jan 30, 2024

edwardchalstrey1 Feb 2, 2024

edwardchalstrey1 left a comment

Add context-based post processing for linear features #342

Add context-based post processing for linear features #342

Conversation

rwood-97 commented Jan 18, 2024 • edited by edwardchalstrey1 Loading

Summary

Checklist before assigning a reviewer (update as needed)

Reviewer checklist

codecov-commenter commented Jan 18, 2024 • edited Loading

Codecov Report

rwood-97 commented Jan 18, 2024

rwood-97 commented Jan 30, 2024

edwardchalstrey1 Feb 2, 2024

Choose a reason for hiding this comment

edwardchalstrey1 left a comment

Choose a reason for hiding this comment

rwood-97 commented Jan 18, 2024 •

edited by edwardchalstrey1

Loading

codecov-commenter commented Jan 18, 2024 •

edited

Loading