Feature: cross validate timings #233

alekseykalyagin · 2024-12-12T14:05:42Z

Added compute_timings argument to cross_validate

Closes #138

codecov · 2024-12-12T14:25:06Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (9b3992e) to head (46e05c9).
Report is 86 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff             @@
##              main      #233     +/-   ##
===========================================
  Coverage   100.00%   100.00%             
===========================================
  Files           45        59     +14     
  Lines         2242      3896   +1654     
===========================================
+ Hits          2242      3896   +1654

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

rectools/model_selection/cross_validate.py

tests/model_selection/test_cross_validate.py

rectools/model_selection/cross_validate.py

tests/model_selection/test_cross_validate.py

blondered · 2025-01-10T09:07:46Z

rectools/model_selection/cross_validate.py

@@ -36,6 +57,7 @@ def cross_validate(  # pylint: disable=too-many-locals
    ref_models: tp.Optional[tp.List[str]] = None,
    validate_ref_models: bool = False,
    on_unsupported_targets: ErrorBehaviour = "warn",
+    compute_timings: bool = False,


Please add new argument to docstring

do we really need this param? what's wrong if we always measure the time?

blondered · 2025-01-10T09:08:11Z

rectools/model_selection/cross_validate.py

+    else:
+        yield
+
+
 def cross_validate(  # pylint: disable=too-many-locals


Please update CHANGELOG.MD

feldlime · 2025-01-10T10:07:02Z

rectools/model_selection/cross_validate.py

@@ -24,6 +25,26 @@
 from .splitter import Splitter


+@contextmanager
+def compute_timing(label: str, timings: tp.Optional[tp.Dict[str, float]] = None) -> tp.Iterator[None]:


It seems this function accepts the label timings param but don't really need it

Let's please rewrite it in one of the following ways:

Remove both params and simply return the elapsed time without dictionaries

Rewrite it as a class

I personally prefer the second option since it's clearer.
But anyway let's not use this labels and dictionary inside. We can easily fill them out of the class

And example (it's simplified a bit, please add init if required by linters, also types)

class Timer: def __enter__(self): self._start = time.perf_counter() self._end = None return self def __exit__(self, *args): self._end = time.perf_counter() @property def elapsed(self): return self._end - self._start with Timer() as timer: # code pass fit_time = timer.elapsed

feldlime · 2025-01-10T10:09:13Z

rectools/model_selection/cross_validate.py

@@ -36,6 +57,7 @@ def cross_validate(  # pylint: disable=too-many-locals
    ref_models: tp.Optional[tp.List[str]] = None,
    validate_ref_models: bool = False,
    on_unsupported_targets: ErrorBehaviour = "warn",
+    compute_timings: bool = False,


do we really need this param? what's wrong if we always measure the time?

feldlime · 2025-01-10T10:09:47Z

rectools/model_selection/cross_validate.py

+        Dictionary to store the timing results. If None, timing is not recorded.
+    """
+    if timings is not None:
+        start_time = time.time()


Please use time.perf_counter instead, it's more correct for measuring time intervals

feldlime · 2025-01-10T10:14:06Z

rectools/model_selection/cross_validate.py

+    if timings is not None:
+        start_time = time.time()
+        yield
+        timings[label] = round(time.time() - start_time, 5)


Please don't do this

If we need to format the values somehow it should always be done separately. We should separate the computing level and presentation level. From the computing level we should always return the raw values.

In this specific case I think we shouldn't format the value at all. I don't see much sense in it, also we're not doing this for other metrics.

alekseykalyagin and others added 2 commits December 12, 2024 16:55

Add compute_timing arg for cross_validate function

2b95c08

Merge branch 'MobileTeleSystems:main' into main

e474430

alekseykalyagin added 2 commits December 13, 2024 12:35

Update compute timing using contextmanager

b05fbe1

Remove unused variable

a64db58

blondered reviewed Dec 13, 2024

View reviewed changes

rectools/model_selection/cross_validate.py Outdated Show resolved Hide resolved

Remove fit_recommend function

85c1604

blondered reviewed Dec 18, 2024

View reviewed changes

fix comments

5afc0e4

blondered reviewed Dec 20, 2024

View reviewed changes

rectools/model_selection/cross_validate.py Outdated Show resolved Hide resolved

tests/model_selection/test_cross_validate.py Outdated Show resolved Hide resolved

tests/model_selection/test_cross_validate.py Outdated Show resolved Hide resolved

fix comments

ce5f1c0

blondered reviewed Dec 24, 2024

View reviewed changes

tests/model_selection/test_cross_validate.py Outdated Show resolved Hide resolved

fix comments

46e05c9

blondered reviewed Jan 10, 2025

View reviewed changes

blondered requested a review from feldlime January 10, 2025 09:25

feldlime requested changes Jan 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature: cross validate timings #233

Feature: cross validate timings #233

Uh oh!

alekseykalyagin commented Dec 12, 2024 •

edited by blondered

Loading

Uh oh!

codecov bot commented Dec 12, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

blondered Jan 10, 2025

Uh oh!

feldlime Jan 10, 2025

Uh oh!

blondered Jan 10, 2025

Uh oh!

feldlime Jan 10, 2025

Uh oh!

feldlime Jan 10, 2025

Uh oh!

feldlime Jan 10, 2025

Uh oh!

feldlime Jan 10, 2025

Uh oh!

Uh oh!

Feature: cross validate timings #233

Are you sure you want to change the base?

Feature: cross validate timings #233

Uh oh!

Conversation

alekseykalyagin commented Dec 12, 2024 • edited by blondered Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Dec 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

blondered Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

feldlime Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

blondered Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

feldlime Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

feldlime Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

feldlime Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

feldlime Jan 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alekseykalyagin commented Dec 12, 2024 •

edited by blondered

Loading

codecov bot commented Dec 12, 2024 •

edited

Loading