Speedup 05: Retrieve unique detections in family and in `matched_filter` #527

flixha · 2022-12-12T09:57:35Z

What does this PR do?

core.match_filter.family._uniq: 1.9x speedup
- retrieving the unique list of detections is quicker for many detections with list(set) (1.9x speedup for 43000 detections, fastest: 3.1 s), but 1.2x slower for small sets (e.g., 430 detections; 50 ms --> 27 ms).
core.match_filter.detect - 1000x speed up for many calls to family._uniq
- using family._uniq in a loop over all families is still rather slow with _uniq. Checking tuples of (detection.id, detection.detect_time, detection.detect_val) with numpy.unique and avoiding a loop is 1000x faster. From 752 s to <1 s for 82000 detections.

Why was it initiated? Any relevant Issues?

Retrieving unique detections for matched_filter-run was getting slower than needed when there are a lot of detections.

This PR contributes to the summary issue in #522

PR Checklist

develop base branch selected?
This PR is not directly related to an existing issue (which has no PR yet).
All tests still pass.
~~- [ ] Any new features or fixed regressions are be covered via new tests.~~
~~- [] Any new or changed features have are fully documented.~~
Significant changes have been added to CHANGES.md.
~~- [ ] First time contributors have added your name to CONTRIBUTORS.md.~~

calum-chamberlain

It looks like the changes to _uniq shouldn't work, and if they do then we should make sure they don't... Otherwise this is a really useful speedup!

eqcorrscan/core/match_filter/family.py

eqcorrscan/core/match_filter/tribe.py

…ction should be unhashable

calum-chamberlain

Golden, thanks!

flixha added 3 commits December 12, 2022 10:45

speed up unique detection list making

8d16563

add changelog, remove commented lines

316461a

pycodestyle

a31090c

flixha mentioned this pull request Dec 12, 2022

WIP: Speed up a few slowdowns when handling large datasets #522

Open

Merge branch 'develop' into speedup_05_uniq_detections_in_family

c725528

calum-chamberlain requested changes Jan 1, 2023

View reviewed changes

eqcorrscan/core/match_filter/family.py Outdated Show resolved Hide resolved

eqcorrscan/core/match_filter/tribe.py Show resolved Hide resolved

flixha and others added 4 commits January 3, 2023 14:41

revert change for uniq() with set() that should not work because dete…

56ea322

…ction should be unhashable

Merge branch 'develop' into speedup_05_uniq_detections_in_family

95219d5

fix reversion to set

c359fde

Merge branch 'develop' into speedup_05_uniq_detections_in_family

59a31f5

calum-chamberlain approved these changes Jan 3, 2023

View reviewed changes

calum-chamberlain merged commit 8e5ca9d into eqcorrscan:develop Jan 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speedup 05: Retrieve unique detections in family and in `matched_filter` #527

Speedup 05: Retrieve unique detections in family and in `matched_filter` #527

flixha commented Dec 12, 2022 •

edited

Loading

calum-chamberlain left a comment

calum-chamberlain left a comment

Speedup 05: Retrieve unique detections in family and in matched_filter #527

Speedup 05: Retrieve unique detections in family and in matched_filter #527

Conversation

flixha commented Dec 12, 2022 • edited Loading

What does this PR do?

Why was it initiated? Any relevant Issues?

PR Checklist

calum-chamberlain left a comment

Choose a reason for hiding this comment

calum-chamberlain left a comment

Choose a reason for hiding this comment

Speedup 05: Retrieve unique detections in family and in `matched_filter` #527

Speedup 05: Retrieve unique detections in family and in `matched_filter` #527

flixha commented Dec 12, 2022 •

edited

Loading