-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance improvements for index generation #202
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #202 +/- ##
=======================================
Coverage 99.74% 99.74%
=======================================
Files 24 24
Lines 1166 1176 +10
=======================================
+ Hits 1163 1173 +10
Misses 3 3 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for spending some time on the groupby suggestion, I think this looks good!
Change Description
Related to astronomy-commons/lsdb#108
This improves the performance of generating indexes for very large datasets.
drop_duplicates
operationpytest
on epyc: the last test would often fail as a timeout.