@cached_var may store wrong values if method has arguments #148

mensch72 · 2021-02-25T18:02:46Z

A method decorated with @cached_var whose return value depends on some method arguments will wrongly return the cached results of the first call even for calls with different argument values!

lenas95 · 2022-03-17T12:31:48Z

Hi, could you elaborate which method exactly? And what your output was?

Very much appreciated

ntfrgl · 2022-04-08T02:40:46Z

This seems like a bug in core.network.cache_helper. All uses of @cached_const and @cached_var should be replaced by standard library implementations of memoization, taking care to use only functions available in the lowest Python version supported by pyunicorn.
https://docs.python.org/3/library/functools.html

ntfrgl · 2023-06-27T06:52:20Z

As of 4cf6969, the minimal supported Python version is 3.8, and @functools.cached_property() from the standard library should replace the aforementioned helper functions.

- replace `@cached_var` with `@functools.lru_cache` and `@cached_const` with `@functools.cached_property` - refactor calls of any `cached_property`, now not callable anymore, substitute with `self.set_link_attribute(property)` where necessary - remove `cache_helper` and `cache` attribute in `core.network` - adapt `clear_cache()` methods accordingly, introduce `clear_cache_startswith()` - remove `Network.clear_link_attribute()` - adapt `Network.{del|set}_link_attribute()` accordingly, drop clearing cache of `path_lengths()` in `set_link_attribute()` - related issue: #148 - locally disable pylint false positive `not-callable` on `multiprocess.Pool()`

fkuehlein · 2023-11-28T16:53:15Z

hi @ntfrgl,

I had started getting into this but it turned out to be a bit more work than I had first thought. With fe23f23 I basically substituted the caching in core.Network with functools memoization methods as you proposed, see commit message for details. Some additional notes on that:

methods decorated with functools.cached_property() are now class properties and thus not callable, which slightly alters their handling and entailed a bit of refactoring. I could resolve that in all cases except for test_network.compare_measures() and test_network.compare_nsi(). I therefore pytest.mark.skip'd the tests that depend on those functions. Any ideas on how to resolve that? Or should we just use lru_cache throughout?
functools.lru_cache(), on the other hand, requires hashable objects as parameters. This would currently only be a problem for Network.{nsi_}interregional_betweenness because they have list arguments. I just left the @cache decorator commented out on those, as for some reason had already been done in f4df6c6. Maybe that's useless and can be dropped, what do you think?
I subsitituted the Network.clear_cache() method and its helper methods accordingly. In that process, I realized that other classes have their own clear_cache() methods, some inheriting from Network, but many also using their own caching system, which is basically just creating and deleting class attributes and setting self._{...}_cached-flags (as in RecurrencePlot.clear_cache()). This could also be substituted with functools, right? Anyway, that might exceed the scope of this issue.
All tests are passing now, except for the skipped ones, but I'll make sure to add some more before merging anything.

Any other objections/additions/comments from your side?

- replace removed `clear_paths_cache()` with new `clear_cache_startswith('path_')` - related: #148

ntfrgl · 2023-12-02T03:00:57Z

Thank you for making progress on this issue! Your approach mostly follows the recommendations in the official documentation, which suggest that @functools.cached_property() and @functools.lru_cache() could roughly replace @cached_const() and @cached_var().
https://docs.python.org/3/faq/programming.html#faq-cache-method-calls

However, as you noted, a direct replacement conflicts with our existing API. Let me comment in more detail on our present requirements for a purely internal fix without any public API changes, and refine my suggestions accordingly.
- Even when the affected method has no arguments, what we need is a cache for return values, while the user-facing methods should remain methods. Furthermore, the logging behaviour previously defined in cache_helper() via the msg argument should be retained.
- The documentation above points out that a @functools.cached_property() cannot detect changes to instance attributes it depends on, which means that the correctness of cache invalidation relies on every possible mutation of instance attributes to also manually delete the cached property. This is a likely source of user errors.
- More generally, let us note that the core problem in this issue, as well as in the one referenced in (2), is a consistent approach to cache invalidation, for a library in which recomputing network metrics after changing network attributes is a common use case.
- Overall, I therefore think that the most ergonomic solution would be a caching decorator which unifies the roles of @cached_const() and @cached_var(), which uses @functools.lru_cache() internally with a configurable cache size, which takes care of logging as before, and which assumes that, as explained in the documentation above, the class owning the decorated method has well-defined __eq__() and __hash__() methods. All of our classes with custom caching mechanisms would require a little work to make this assumption valid, but that would then be the canonical solution to the cache invalidation problem. In particular, this will constitute an executable specification of all non-derived mutable properties in the user-facing classes, and will remove the need of manually calling clear_cache() in various places, while still allowing for it.
- For inspiration, here is a commit I wrote for another package which ensures consistency of the __eq__() and __hash__() methods.
Those method caches were commented out as a workaround in Repeatedly calling core.network.Network.interregional_betweenness gives wrong results #124. The correct solution would be to use the same caching mechanism as for other methods, but after mapping the arguments bijectively into a hashable type, i.e., to use an uncached wrapper method which calls the implementation body as a private cached worker method with hashable arguments of fixed type. Specifically, since the expected arguments are of type Iterable[int], the easiest choice would be the hashable type Tuple[int,...].
It would certainly reduce future maintenance burden if the caching system was consistent throughout the library, following (1). A straightforward approach here would be to define the clear_cache() method in a mix-in class that is inherited by all classes that use caching. Note, however, that your current implementation of Network.clear_cache() is incorrect, since the return type of dir(self) is List[str].
Since this has been a recurring issue, before merging this PR, I would recommend extending the test suite with cache consistency checks, for as many as possible of the library methods which already have some tests and whose implementations are cached. This would take some effort, but would clearly increase the reliability of the library, and is related to my comment elsewhere. One possible approach would be along the following lines:
- The vast majority of the existing tests is of the form <provide data, instantiate class, call method, compare output>. One could define an abstract test class, let's say in tests/conftest.py, with a generic test method which receives some specification for these steps, e.g., as objects and lambda functions, and executes them in order. In each of the existing test modules, this abstract test class would then be inherited and instantiated with the specific methods and arguments to recover the existing test suite behaviour.
- In addition, there would now be new steps <modify instance attributes, un-modify instance attributes, modify method arguments>, such that the abstract test can check whether repeatedly calling a method yields the same result (when the instance attributes and method arguments are unchanged) or not (otherwise).
- For inspiration, here is a little test class I wrote for another package which dynamically constructs a number of tests that execute the steps <import tutorial, call its main() method, check output>.

fkuehlein · 2023-12-05T14:49:39Z

Thanks a lot for all your ideas, hints and corrections!

For now, as it will require a good bit of work to implement this accordingly, I will postpone this issue and prioritize the smaller issues. Once those are solved, we can see if we still have time and resources to include a thorough fix of this issue in the release.

- resolve lines too long - remove `FIXME`, refer to #148

fkuehlein · 2024-01-26T10:45:04Z

Noticed an Issue related to memoization:

Several methods of climate.MutualInfoClimateNetwork and climate.HavlinClimateNetwork use an attribute named self._path_lengths_cached that is never assigned to. Due to the respective pylint warning no-member being globally disabled (see #196) and the respective methods not being covered by tests, this problem has apparently not been run into yet.

Improve the approach in reverted commits fe23f23, 0166cd2: - introduce `@Cached.method()` based on `@functools.lru_cache()` - track mutable dependencies for cache invalidation - fix cache clearing helper - add behavioural spec in `tests/test_core/test_cache.py` Adjustments in existing classes in this commit: - define `__cache_state__()` for all subclasses of `Cached` (might require updates when adding further methods to the cache) - define `@Cached.method(attrs=(...))` where appropriate - remove obsolete class-specific caching mechanisms - factor out cached worker: `Network.nsi_betweenness()` - define helper: `Network.find_link_attribute()` - define helper: `ClimateNetwork._weighted_metric()` Adding this behaviour to classes without caching should be straightforward. Classes that remain to be adjusted (still have a `clear_cache()` method or subclass `Cached` without conforming to its protocoll): - `ClimateNetwork` and its subclasses: The recursive interaction between `__init__()`, `_similarity_measure` and `_regenerate_network()` across the class hierarchy is very stateful and does not fit into the regular OO dependency pattern assumed by `Cached`. A redesign of this logic would be advisable, but is left for future work. - `RecurrenceNetwork` and its subclasses, and `Surrogate`: Left as an exercise for the reader.

MAINT: Refactor caching (#124, #148, #153)

mensch72 added the bug label Feb 25, 2021

fkuehlein added this to the New Release milestone Oct 12, 2023

fkuehlein added a commit that referenced this issue Nov 28, 2023

MAINT: fixup fe23f23

0166cd2

- replace removed `clear_paths_cache()` with new `clear_cache_startswith('path_')` - related: #148

fkuehlein added a commit that referenced this issue Dec 5, 2023

LINT: fixup 8bca11e and d23d30b

b98e8b4

- resolve lines too long - remove `FIXME`, refer to #148

fkuehlein mentioned this issue Dec 6, 2023

Directed nsi #153

Merged

ntfrgl mentioned this issue Feb 21, 2024

MAINT: Refactor caching (#124, #148, #153) #219

Merged

ntfrgl linked a pull request Feb 21, 2024 that will close this issue

MAINT: Refactor caching (#124, #148, #153) #219

Merged

fkuehlein closed this as completed in #219 Feb 23, 2024

fkuehlein added a commit that referenced this issue Feb 23, 2024

Merge pull request #219 from pik-copan/148-cache-helper

634d3a0

MAINT: Refactor caching (#124, #148, #153)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

@cached_var may store wrong values if method has arguments #148

@cached_var may store wrong values if method has arguments #148

mensch72 commented Feb 25, 2021

lenas95 commented Mar 17, 2022

ntfrgl commented Apr 8, 2022

ntfrgl commented Jun 27, 2023

fkuehlein commented Nov 28, 2023

ntfrgl commented Dec 2, 2023

fkuehlein commented Dec 5, 2023 •

edited

Loading

fkuehlein commented Jan 26, 2024

@cached_var may store wrong values if method has arguments #148

@cached_var may store wrong values if method has arguments #148

Comments

mensch72 commented Feb 25, 2021

lenas95 commented Mar 17, 2022

ntfrgl commented Apr 8, 2022

ntfrgl commented Jun 27, 2023

fkuehlein commented Nov 28, 2023

ntfrgl commented Dec 2, 2023

fkuehlein commented Dec 5, 2023 • edited Loading

fkuehlein commented Jan 26, 2024

fkuehlein commented Dec 5, 2023 •

edited

Loading