Add benchmarks #967

NickCrews · 2022-02-21T17:17:28Z

EDIT: See next comment for using ASV instead of @Profile

Place @Profile decorators on bottleneck functions using memory_profiler

List this dependency as extra, so that most users don't need to install it.

Also, to prevent overhead from @Profile getting run always (even when we don't want profiling), wrap it in our own custom decorator that is usually a noop:

def dd_profile(func, *args, **kwargs):
    # Maybe a better way to configure this? Would have to be at import time
    if os.environ["DEDUPE_PROFILE"]:
        # Actually add the profiler wrapper
        return profile(func, *args, **kwargs)
    else:
        # noop
        return func

Next steps are to probably actually make a new branch and apply it to some of the examples?

The text was updated successfully, but these errors were encountered:

NickCrews · 2022-04-27T16:02:47Z

OK, I think a better option is Air Speed Velocity.

Don't have to modify the codebase at all, benchmarks are separate, like tests
measure time, memory, and custom attributes (eg we could measure accuracy)
used by big names like numpy, pandas, scipy
See examples at that link of static websites, it has built in support for github pages
Could run on every PR, or could trigger only when we care from a comment like @benchmark main..HEAD --bench benchmarks.TimeSuite.time_range as pandas does

Initially, I would think that we would get a lot of value out of measuring

time
peak memory
accuracy
on just one or two end-to-end runs (training and evaluating separate). Either could use the test datasets, or could use the examples from dedupe examples

One tricky thing would be to decide how to store the results (eg the metrics collected from past runs). Would want a way to ensure a consistent machine (using github actions? and/or with docker?). Keep the results in this repo, or keep them in a separate repo, possibly using submodules as they mention?

@fgregg @fjsj any thoughts here?

fgregg · 2022-04-27T16:29:34Z

I love this ides @NickCrews . if you want to try to get something rough and ready set up, i think that would a really good step forward. if it's looking very valuable, then we will figure out the tricky bits.

NickCrews · 2022-04-27T18:54:43Z

Cool, it might be a bit, but I will try to get to this.

I'll start with the benchmarks in this repo, as those other packages do. Will try to start out without the stored state of past metrics. Will make the decision at game time of which example data to use, haven't explored enough yet to see which I like better.

NickCrews · 2022-04-28T01:14:34Z

@fgregg @fjsj I'm not well versed in entity resolution, any suggestions on what metrics I should use for "accuracy"? Based on the abstract of https://arxiv.org/abs/1509.04238 (didn't read it yet as I thought maybe you'd have pointers) it sounds like the standard measures like F score and precision might need to get tweaked a little bit. I can also do my own research but if you can tell me where to start it would help. Thanks!

fgregg · 2022-04-28T01:22:57Z

i think precision and recall are really still the best ones.

fgregg · 2022-04-28T01:23:18Z

look at canonical.py in tests to see how precision and recall is calculated there.

NickCrews mentioned this issue Feb 21, 2022

Explore shelve for connected components dict #966

Closed

NickCrews mentioned this issue Apr 27, 2022

start to use sklearn for ml algorithms #992

Merged

3 tasks

NickCrews changed the title ~~Explore adding profiling code~~ Add benchmarks Apr 28, 2022

NickCrews mentioned this issue May 2, 2022

Add basic benchmarks #1002

Merged

fgregg closed this as completed in #1002 May 4, 2022

github-actions bot locked as resolved and limited conversation to collaborators May 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmarks #967

Add benchmarks #967

NickCrews commented Feb 21, 2022 •

edited

Loading

NickCrews commented Apr 27, 2022

fgregg commented Apr 27, 2022

NickCrews commented Apr 27, 2022

NickCrews commented Apr 28, 2022 •

edited

Loading

fgregg commented Apr 28, 2022

fgregg commented Apr 28, 2022

Add benchmarks #967

Add benchmarks #967

Comments

NickCrews commented Feb 21, 2022 • edited Loading

NickCrews commented Apr 27, 2022

fgregg commented Apr 27, 2022

NickCrews commented Apr 27, 2022

NickCrews commented Apr 28, 2022 • edited Loading

fgregg commented Apr 28, 2022

fgregg commented Apr 28, 2022

NickCrews commented Feb 21, 2022 •

edited

Loading

NickCrews commented Apr 28, 2022 •

edited

Loading