Add step method state and make step results deterministic with respect to it #7508

lucianopaz · 2024-09-18T09:32:11Z

Description

The original intent of this PR was to fully address #7503. That proved to be a very long task, so the current PR focuses only on closing #5797 and to add sampling_state to all pymc step methods. I'll leave the past description further down.

Related Issue

Closes Refactor step methods to use their own random stream #5797
Related to Resuming a MCMC run #143, Ability to work with chains in progress #292, Resuming a multi-process/chain run #3661, and ENH: Add checkpoints during sampling #7503

Checklist

Checked that the pre-commit linting/style checks pass
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks)
If you are a pro: each commit corresponds to a relevant logical change

Type of change

📚 Documentation preview 📚: https://pymc--7508.org.readthedocs.build/en/7508/

Original intent (OUTDATED)

This will be a long PR, but I want to open the discussion of the design in its early phases. The overall goal is to provide the ability to pause and later resume sampling that is based on pymc step methods. Once this PR is finished, I hope that we'll get into the problem of adding this ability when sampling with blackjax, nutpie and numpyro.

There will be 4 subgoals in this PR. I'll write them down and list the tasks that I'll do in each:

Write something that can dump or load the step method's sampling state.

Create a class that represents the sampling state
Get to dump or load the state for metropolis step methods
Get to dump or load the state for compound step methods
Get to dump or load the state for slice step methods
Get to dump or load the state for HMC step methods
Get to dump or load the state for NUTS step methods
Ensure sampling state completely determines the step method's results

Write something that can dump or load the trace in its intermediate stages.

Get to dump or load the trace for NDArray backend
Get to dump or load the trace for McBackend (this might be done already?)
Get to dump or load the state for Arviz backend.

Add a way to restart sampling using previous sampling states and traces

Single chain MCMC
Parallel MCMC
Population sampling

Add async sampling methods (like nutpie's non blocking)

Single chain MCMC
Parallel MCMC
Population sampling

codecov · 2024-09-19T12:47:29Z

Codecov Report

Attention: Patch coverage is 97.43590% with 11 lines in your changes missing coverage. Please review.

Project coverage is 92.69%. Comparing base (af5ea5c) to head (af74f2c).
Report is 106 commits behind head on main.

Files with missing lines	Patch %	Lines
pymc/step_methods/hmc/quadpotential.py	96.15%	4 Missing ⚠️
pymc/sampling/mcmc.py	83.33%	3 Missing ⚠️
pymc/sampling/parallel.py	83.33%	1 Missing ⚠️
pymc/step_methods/metropolis.py	99.02%	1 Missing ⚠️
pymc/step_methods/state.py	98.11%	1 Missing ⚠️
pymc/util.py	93.75%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7508      +/-   ##
==========================================
+ Coverage   92.43%   92.69%   +0.25%     
==========================================
  Files         103      104       +1     
  Lines       17109    17402     +293     
==========================================
+ Hits        15814    16130     +316     
+ Misses       1295     1272      -23

Files with missing lines	Coverage Δ
pymc/math.py	`77.71% <100.00%> (ø)`
pymc/sampling/population.py	`74.67% <100.00%> (+1.12%)`	⬆️
pymc/step_methods/arraystep.py	`94.66% <100.00%> (+0.07%)`	⬆️
pymc/step_methods/compound.py	`97.56% <100.00%> (+0.38%)`	⬆️
pymc/step_methods/hmc/base_hmc.py	`91.91% <100.00%> (+1.15%)`	⬆️
pymc/step_methods/hmc/hmc.py	`93.54% <100.00%> (+0.69%)`	⬆️
pymc/step_methods/hmc/nuts.py	`97.40% <100.00%> (+0.12%)`	⬆️
pymc/step_methods/slicer.py	`96.70% <100.00%> (+0.31%)`	⬆️
pymc/step_methods/step_sizes.py	`80.95% <100.00%> (+5.95%)`	⬆️
pymc/sampling/parallel.py	`88.50% <83.33%> (ø)`
... and 5 more

... and 12 files with indirect coverage changes

lucianopaz · 2024-09-23T10:10:01Z

I think that this PR now has a good coverage for the first subgoal that I talked about in the description. All of pymc step methods now expose a sampling_state property that fully determines the following draws that the sampler will get. To do this, and also to test it, I had to do one major change: I had to make all of the step methods become detached from numpy's global random state. This doesn't bear significant consequences for an average user, but it opens up many possibilities:

samplers could run concurrently if they use different step methods
to ensure reproducibility, one doesn't need to affect the global state, which might have consequences on other packages that are out of pymc's scope
the init_nuts step method needs to be adapted down the line to use random generators instead of a list of random seeds.

I'll ping some people to get some feedback on the current situation, because I've invested quite a lot of time into getting it right (and also have a clean git history), and I would like to get reviews before moving on. Pinging @ricardoV94, @michaelosthege, @Armavica, @ColCarroll and @junpenglao.

ricardoV94

Hi Luciano, had a quick glance. Looks alright but I wonder if we can make it a tad simpler by using dataclasses instead of reinventing something that seems similar?

I left other questions as comments.

Thanks for the initiative, making step methods less opaque will be great for customizing/controlling sampling

pymc/step_methods/arraystep.py

pymc/step_methods/hmc/base_hmc.py

ricardoV94 · 2024-09-23T10:17:10Z

pymc/step_methods/hmc/base_hmc.py

+    _num_divs_sample: int
+
+
+class BaseHMC(GradientSharedStep, WithSamplingState):


Why do we need a separate class? Why not be part of the baseclass?

Which baseclass? WithSamplingState? Or BaseHMCState? If you mean the latter, it's because the BaseHMC step method has properties that are different from other step methods, so I need to represent its state differently. If you mean the WithSamplingState, that's because the WithSamplingState provides the sampling_state property accessors to the step method. Let me know if you meant something else.

I mean all step methods should have whatever WithSamplingState implements in the base class

Oh, I think you’re right. It must be a left over from a previous state of my commits

I would consider a mixin to be a cleaner design, because it doesn't make third party step methods inherit WithSamplingState even if they aren't compatible. It should also make things easier to test by not introducing cross-dependencies.

I don't see any compatibility issues it a step sampler spent make use of the functionality in the base class.

Oh, I see what @michaelosthege is saying. The current design is more inline with what @ricardoV94 said, I already have a BlockedStep as a subclass of WithSamplingState, so including it in BaseHMC ancestors explicitly is pointless. The base StepMethodState only tries to access the rng property. I could provide a default value factory for that property, and that way there won't be any problem for third party libraries that define their own step methods. They would simply get a useless random number Generator object. They could use it if they wanted to though.

ricardoV94 · 2024-09-23T10:20:44Z

pymc/step_methods/state.py

+import numpy as np
+
+
+class MetaDataClassState(type):


What is this doing?

I already said it somewhere else but I'll add it here too. I wanted to make the step method states simple dataclass wrapped classes. I ran into problems revolving around __eq__ and then also around positional arguments being defined after arguments with default values. To avoid these problems, I had to add some arguments to the dataclass decorator and I would sometimes forget them, leading to errors down the line. The simplest solution that didn't involve having to always write down boilerplate code with every State definition was to add this metaclass. It simply creates the subclass type and then wraps that using the dataclass(eq=False, kw_only=True) decorator. That way, you just need to inherit from DataClassState and you'll get the guarantee of working with a dataclass that has the specially crafted __eq__ method.

ricardoV94 · 2024-09-23T10:21:22Z

pymc/step_methods/state.py

+        return dataclass(eq=False, kw_only=True)(super().__new__(cls, name, bases, namespace))
+
+
+class DataClassState(metaclass=MetaDataClassState):


Why is this needed? Can we do simpler than this?

Can we just use a vanilla dataclass?

I already answered this above. This solution avoids having to add boilerplate code all around the State definitions. The main problem is that if you have a class that inherits from another class that uses the @dataclass decorator, the subclass wont be a "dataclass". With the metaclass approach, the subclasses will also be "dataclass" types and they will use the nice __eq__ method that I had to write here.

pymc/step_methods/state.py

ricardoV94 · 2024-09-23T10:24:53Z

pymc/step_methods/state.py

+            kwargs[field.name] = deepcopy(_val)
+        return state_class(**kwargs)
+
+    @sampling_state.setter


When is the setter used?

At the moment, it's only used in the tests. Calling step_method.sampling_state = state will set the step method's attributes to the proper values represented by the provided state. The goal is that we'll use it when we want to jump start the sampler to some past sampling state. The workflow I have in mind is this:

Have the model already built

Have the samplers already built

If someone provides a past sampling state from which to start, set the sampler's state to that

Start sampling using the sampler's set state and collect the results

I prefer an explicit method for that functionality? Something like step_method.set_state(state)?

It's 100% subjective opinion though

I agree to disagree on this one. The set_state method looks like any other method call and it’s not explicitly saying that it won’t have any return value. The attribute assignment syntax on the other hand is much more explicit on its intent.

So why is there a set_rng? The same argument you used against set_state would apply? Also, in my experience properties have always be a PITA because at some point we figure out we would like kwargs/customization and there's no way to refactor a property into a method with back_compat.

I added a set_rng method because the subclasses are supposed to overload it. The HMC step methods have their own rng and their potential also have a spawned rng. Setting the rng had to work differently than with the rest of the step methods, so I decided to make it a method instead of a property. Anyway, I still prefer the sampling_state as a property mixin. If we eventually realize that we don't want to, we can always add a new set_state method and issue a DeprecationWarning or RuntimeError in the property setter.

Let me try one last argument. Setting a property (which looks like an attribute) does not make me intuitively think that it will actually affect the sampler. Specially in this case, where the property (sampler.sample_sate) is actually a read-only copy of the internal state, not the state itself.

It's like if you have a PyMC model, which has the attribute model.datalogp. I wouldn't expect model.datalogp = 0 to be a valid way of overriding the model logp. Of course if I read the source code I'll be able to figure out, but from just reading use code I wouldn't think it would do what I want it to do.

I find sampler.set_rng() must more obvious that will actually affect the rng used. And sampler.set_state() that it will actually affect the state used.

Final argument I found, but not necessarily care about has to do with inheritance. Calling super() on a property is clumsy, if you want to combine the effects of the base class method and some tweaking in the inherited class.

ricardoV94 · 2024-09-23T10:26:03Z

pymc/util.py

+
+def get_random_generator(seed: RandomGenerator = None, copy=True) -> np.random.Generator:
+    if copy:
+        seed = deepcopy(seed)


When/why is copying the seed needed?

When seed is a numpy.random.Generator. If you don't do that, numpy.random.default_rng(seed) will return seed. If you provide a BitGenerator, the Generator object itself will be new, but its BitGenerator will be the same object that you passed in, making it potentially shared with another Generator. To ensure that those two scenarios wont happen, I deepcopy the seed by default.

Add an explicit instance check for Generator? And/or comment?

I’ll add a comment but I found that numpy does all of the hard type checking work and it felt like a waste to repeat

ricardoV94 · 2024-09-23T10:30:26Z

to ensure reproducibility, one doesn't need to affect the global state, which might have consequences on other packages that are out of pymc's scope

This is a desired feature not a problem. There's an old issue that you can link to/ close if we get rid of the global RNG with this PR

samplers could run concurrently if they use different step methods

How? Don't they always require interleaving steps (ie conditoned on a valid state?)

lucianopaz · 2024-09-23T11:40:45Z

Hi Luciano, had a quick glance. Looks alright but I wonder if we can make it a tad simpler by using dataclasses instead of reinventing something that seems similar?

I left other questions as comments.

Thanks for the initiative, making step methods less opaque will be great for customizing/controlling sampling

Thanks for the review @ricardoV94! I am using dataclass. The original approach was to add the @dataclass decorator to every state class. The problem there was that __eq__ didn't work well with numpy arrays or with random number generators. That's why my second approach was to have a base class (DataClassState) that was a dataclass with an __eq__ method. The problem with that approach was that I also had to add the @dataclass(eq=False) decorator to every state class that inherited from that common base. While working on this, I frequently forgot to add either the decorator or the eq=False, leading to failures down the line. That's why, in the final and current approach, I have a metaclass that will call dataclass(eq=False)(cls) on any class that inherits from the DataClassState base class. That way, I can ensure that all child classes are handled as dataclasses, and also use the base class __eq__ that handles the weird array and random generator types. One last thing was that I also had to add kw_only to the dataclass, to enable safe inheritance between the classes. If I didn't, some classes that had default values for some fields could not be used as ancestors because the auto generated __init__ would put those before other mandatory positional arguments.

lucianopaz · 2024-09-23T13:31:03Z

to ensure reproducibility, one doesn't need to affect the global state, which might have consequences on other packages that are out of pymc's scope

This is a desired feature not a problem. There's an old issue that you can link to/ close if we get rid of the global RNG with this PR

Ouch, it looks like I'm a bit of a broken record... I opened #5797 more than 2 years ago.

samplers could run concurrently if they use different step methods

How? Don't they always require interleaving steps (ie conditoned on a valid state?)

At the moment, step methods need their own global random state to be able to run concurrently. That means that the samplers were limited to using different processes with their own random state to work. I'm not sure how fork or forkserver work with numpy's global numpy.random.mtrand._rand state, but if they somehow share it and somehow use locks to ensure that they don't break the state with race conditions, the results of sampling wouldn't be deterministic based on the seed, because one chain could draw a sample faster than another at some times and slower than another at other times, making it use different random states to generate samples from. But even if fork and forkserver multiprocessing produce copies of the global state that are unique to each process, if there is some concurrent thread in the process that touches upon the global random state, it would affect the potential draws from the step method. And likewise, the steps from pymc would affect the global state, indirectly affecting other things that might rely on it. Just to make things clear, none of this means that pymc or other concurrent stuff would be breaking the sampling, I just mean to say that the sampling results could be affected because the global random state could be changed in the middle of sampling by anyone. What I did with this PR was to isolate the step methods from anything else, ensuring that they won't interact or interfere with other things that we don't control and that our users might not even be aware of.

ricardoV94 · 2024-09-23T13:48:43Z

The problem there was that eq didn't work well with numpy arrays or with random number generators.

Why do we need eq?

lucianopaz · 2024-09-23T14:19:07Z

Why do we need eq?

For convenience. If we need to assert equality, it’s much better to have this method

ricardoV94 · 2024-09-23T14:29:45Z

Why do we need eq?

For convenience. If we need to assert equality, it’s much better to have this method

Why not wait until we see a need then?

lucianopaz · 2024-09-23T14:35:31Z

Why do we need eq?

For convenience. If we need to assert equality, it’s much better to have this method

Why not wait until we see a need then?

I did need it in all of the tests I wrote

ricardoV94 · 2024-09-23T15:08:11Z

Why do we need eq?

For convenience. If we need to assert equality, it’s much better to have this method

Why not wait until we see a need then?

I did need it in all of the tests I wrote

That's more an argument for a test utility than code we need to strictly maintain. Checking numpy array and random generator equality shows up in other scenarios

michaelosthege

Great work! The refactoring of sampler RNGs could be extracted into its own PR and merged first?

Item 4. of your description sounds unrelated.

If I understand correctly, your approach with the mixin class has the nice benefit of not adding overhead to every iteration!

Regarding item 2. (where to dump traces) I previously pointed at the stats, because they already contain some/many state fields. In McBackend the stats can be sparse>, so one could emit a sampler_state state every 100 iterations or so.

Last time I checked ArviZ/xarray didn't support saving sparse arrays to disk. But that's for sure workaroundable at the ArviZ level.
ClickHouseBackend can persist sparse stats already.

michaelosthege · 2024-09-23T21:04:42Z

tests/step_methods/test_metropolis.py

@@ -292,7 +295,7 @@ def test_step_discrete(self):
        unc = np.diag(C) ** 0.5
        check = (("x", np.mean, mu, unc / 10.0), ("x", np.std, unc, unc / 10.0))
        with model:
-            step = Metropolis(S=C, proposal_dist=MultivariateNormalProposal)
+            step = Metropolis(S=C, proposal_dist=MultivariateNormalProposal, rng=123456)


Why is this seed different?

Seeding these tests was a PITA. I kept running into sporadic errors and flakiness while I was polishing the step method detachment from the global random state, and I some rng's were left with the weird intermediate seeds.

michaelosthege · 2024-09-23T21:05:28Z

tests/step_methods/test_metropolis.py

@@ -36,6 +36,8 @@
 from tests.helpers import RVsAssignmentStepsTester, StepMethodTester
 from tests.models import mv_simple, mv_simple_discrete, simple_categorical

+SEED = sum(ord(c) for c in "test_metropolis")


but why 😂

Same as above. I can only answer with an argentinian meme

michaelosthege · 2024-09-23T21:07:00Z

pymc/step_methods/state.py

+        this_fields = set([f.name for f in fields(self)])
+        other_fields = set([f.name for f in fields(other)])


inner list comprehension is unnecessary

Why not? The fields function will return Field objects, that have a bunch of extra dataclass specific attributes (e.g. type, default, default_factory). I just want to check that the names are the same and use those names later for getattr.

Oh! I think I understand what you're saying. A set of Field objects should already be enough to test if this_fields == other_fields because all of the other attributes should also match.

actually that was not my point (but you might be right about it)

set(generator) aka set(a for a in "ABCD") works. You can leave out creating the inner list (and then iterating it again when creating the set)

michaelosthege · 2024-09-23T21:23:11Z

pymc/step_methods/state.py

+            return v1 == v2
+
+
+class WithSamplingState:


Can you add docstrings for the three new classes to explain how they fit together?

For WithSamplingState I understand that it's a mixin adding a sampling_state property which, upon access, returns a new container object of a DataClassState subtype. This container holds copies of field values of the WithSamplingState object. (?)

That's exactly right. I'll add comments and docstrings.

ricardoV94 · 2024-09-24T03:49:24Z

If I understand correctly, your approach with the mixin class has the nice benefit of not adding overhead to every iteration!

What overhead? There's nothing computed every iteration

lucianopaz · 2024-09-24T12:54:07Z

Great work! The refactoring of sampler RNGs could be extracted into its own PR and merged first?

Thanks @michaelosthege! I think that it could be extracted. I would need to add docstring entries for rng first though.

Regarding item 2. (where to dump traces) I previously pointed at the stats, because they already contain some/many state fields. In McBackend the stats can be sparse>, so one could emit a sampler_state state every 100 iterations or so.
* Last time I checked ArviZ/xarray didn't support saving sparse arrays to disk. But that's for sure workaroundable at the ArviZ level.

* `ClickHouseBackend` can persist sparse stats already.

Thanks for the pointers. I'll try to use that once I arrive at point 2

lucianopaz · 2024-09-24T12:57:05Z

If I understand correctly, your approach with the mixin class has the nice benefit of not adding overhead to every iteration!

What overhead? There's nothing computed every iteration

I think that Michael means that I'm not building a StepMethodState at each step. The state is a property that can be built on demand, but there is no extra compute involved in regular sampling. The step method just chugs along until we eventually call step.sampling_state at some point during sampling. On the other hand, stats objects are built at each step, so they produce some overhead when compared to a situation in which only the samples got collected.

michaelosthege · 2024-09-24T14:50:37Z

On the other hand, stats objects are built at each step, so they produce some overhead when compared to a situation in which only the samples got collected.

The stats are just dicts and as far as I can tell you didn't change anything about how they are collected (alongside draws) in every iteration.

ricardoV94 · 2024-09-26T15:03:27Z

Sounds good to me, can you add a more informative PR title?

ricardoV94 · 2024-09-26T15:15:06Z

BTW I still favor going with dataclasses instead of the custom new classes and have the equality as a detached test utility. There is no functionality that depends on equality right now or in the foreseeable future, unless I missed something.

Complexity for testing purposes seems backwards to me.

ricardoV94 · 2024-09-26T15:25:06Z

pymc/step_methods/state.py

+            return False
+        if isinstance(v1, (list, tuple)):  # noqa: UP038
+            return len(v1) == len(v2) and all(
+                DataClassState.compare_values(v1i, v2i) for v1i, v2i in zip(v1, v2)


You could use zip(..., strict=True) and avoid explicitly comparing the length

This actually wasn’t good because strict=True raises a ValueError, and I just want it to return False. I’ll keep the length comparison as it was

lucianopaz · 2024-09-26T16:56:18Z

BTW I still favor going with dataclasses instead of the custom new classes and have the equality as a detached test utility. There is no functionality that depends on equality right now or in the foreseeable future, unless I missed something.

Complexity for testing purposes seems backwards to me.

I think that I can have a workaround for part of this, but it really depends on what you mean by dataclasses. Is the problem that I'm using a metaclass? Is it that I'm defining an __eq__? Is the problem that I'm using inheritance between DataClassState and some of its subclasses?

If your problem is that I'm relying on metaclasses, I think that I can do something differently to avoid them. If the problem is the __eq__, I think that I can move that to the tests as a function. What I don't want to change is the inheritance from DataClassState. That is important for static type analysis and also helped with reducing the amount of duplicate code.

ricardoV94 · 2024-09-26T17:02:39Z

Yes my problem is the implementation of __eq__ and metaclass. I don't see why we need this for what is basically a namedtuple. We hold RNG / np.ndarray in many kinds of objects and we don't usually go about implementing equality if we don't have to (we do for TensorConstants in PyTensor for example)

lucianopaz · 2024-09-27T04:23:17Z

Yes my problem is the implementation of __eq__ and metaclass. I don't see why we need this for what is basically a namedtuple. We hold RNG / np.ndarray in many kinds of objects and we don't usually go about implementing equality if we don't have to (we do for TensorConstants in PyTensor for example)

@ricardoV94, I changed the code to avoid the metaclass and detached the __eq__ from the code. I did had to leave a comparison utility function in the main codebase because I want to be able to compare frozen fields. Let me know if you think that's good enough. If it is, I'll clean up the last commit and we can merge. I chose to leave it dirty for now to make it easier to undo the change if we don't like it.

ricardoV94 · 2024-09-27T11:10:44Z

@lucianopaz sounds good to me. Besides my personal preference for methods vs setter, I have one last suggestion and one question.

Rename compare_dataclass_values and compare_states to equal_dataclass_values, equal_states (or whatever it was called, point being compare -> equal.

What's the deal with frozen fields? Why do we need them / to worry about them?

To be clear, I'm happy with the state and I am not blocking the merge after the rebase.

lucianopaz · 2024-09-27T12:40:47Z

Rename compare_dataclass_values and compare_states to equal_dataclass_values, equal_states (or whatever it was called, point being compare -> equal.

Good point, I'll do that.

What's the deal with frozen fields? Why do we need them / to worry about them?

The step methods have a bunch of extra information in them that gets set when they are created. My very first idea was to try to have some kind of pickle.dump approach where the entire step method would be stored. The problem with that was that the step methods have references to model variables and compiled functions. Serializing the step method would then force us to also serialize the whole model instance along with some compiled functions. This extra burden made me think that it would be better to only store and set some small bits of information from the steppers. Part of this information changes as samples are drawn, and other parts don't.

The long term goal was to be able to set the step method to a state where it could continue sampling as it had been doing before. Since I wouldn't be able to rebuild the full step method from what I save to disk, I needed to add some way to determine that the stored state was compatible with the step method that was being modified. That's why I decided to include some step information that doesn't change during sampling as frozen fields. If the saved state doesn't match with the stepper's frozen fields, that means that the state is not valid for the step method and an error should be raised.

PRs pymc-devs#7508 and pymc-devs#7492 introduced incompatible changes but were not tested simultaneously. Deepcopying the steps in the tests leads to deepcopying the model which uses `clone_model`, which in turn does not support initvals.

PRs #7508 and #7492 introduced incompatible changes but were not tested simultaneously. Deepcopying the steps in the tests leads to deepcopying the model which uses `clone_model`, which in turn does not support initvals.

PRs pymc-devs#7508 and pymc-devs#7492 introduced incompatible changes but were not tested simultaneously. Deepcopying the steps in the tests leads to deepcopying the model which uses `clone_model`, which in turn does not support initvals.

lucianopaz added enhancements major Include in major changes release notes section samplers labels Sep 18, 2024

lucianopaz force-pushed the checkpoints branch 5 times, most recently from 8782ed3 to 8c44fd0 Compare September 19, 2024 12:15

lucianopaz force-pushed the checkpoints branch from c84b4d1 to cac126e Compare September 23, 2024 10:03

ricardoV94 reviewed Sep 23, 2024

View reviewed changes

michaelosthege reviewed Sep 23, 2024

View reviewed changes

lucianopaz force-pushed the checkpoints branch from cac126e to fb80ed5 Compare September 24, 2024 07:20

lucianopaz force-pushed the checkpoints branch from fb80ed5 to d02a5b7 Compare September 25, 2024 08:29

Fix dangling step in test_population

131df71

lucianopaz force-pushed the checkpoints branch 2 times, most recently from aa4f007 to 92d0845 Compare September 25, 2024 12:19

lucianopaz marked this pull request as ready for review September 26, 2024 10:02

ricardoV94 reviewed Sep 26, 2024

View reviewed changes

lucianopaz changed the title ~~Checkpoints~~ Add step method state and make step results deterministic with respect to it Sep 26, 2024

lucianopaz force-pushed the checkpoints branch from 8df720e to 112862f Compare September 27, 2024 04:20

lucianopaz force-pushed the checkpoints branch from 112862f to de72375 Compare September 27, 2024 06:54

lucianopaz added 5 commits September 27, 2024 14:49

Add sampling state base classes

c399241

Add step method state

5f6ac33

Add metropolis sampling state

ca2c60b

Add slice sampling state

04fbe64

Add HMC sampling state

af74f2c

lucianopaz force-pushed the checkpoints branch from de72375 to af74f2c Compare September 27, 2024 12:51

ricardoV94 approved these changes Oct 2, 2024

View reviewed changes

lucianopaz merged commit 465d8ac into pymc-devs:main Oct 7, 2024
20 checks passed

lucianopaz deleted the checkpoints branch October 7, 2024 08:00

ricardoV94 mentioned this pull request Oct 8, 2024

Do not use initval in test model #7529

Merged

lucianopaz mentioned this pull request Oct 16, 2024

Add ZarrTrace #7540

Merged

19 tasks

ricardoV94 mentioned this pull request Dec 5, 2024

Bump numpy version due to use of Generator.spawn only available in >=1.25 #7607

Merged

ricardoV94 mentioned this pull request Dec 13, 2024

Sampling with generators as seeding no longer deterministic #7612

Closed

		_num_divs_sample: int


		class BaseHMC(GradientSharedStep, WithSamplingState):

		return dataclass(eq=False, kw_only=True)(super().__new__(cls, name, bases, namespace))


		class DataClassState(metaclass=MetaDataClassState):

		this_fields = set([f.name for f in fields(self)])
		other_fields = set([f.name for f in fields(other)])

Uh oh!

Add step method state and make step results deterministic with respect to it #7508

Add step method state and make step results deterministic with respect to it #7508

Uh oh!

Conversation

lucianopaz commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Checklist

Type of change

Original intent (OUTDATED)

Uh oh!

codecov bot commented Sep 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

lucianopaz commented Sep 23, 2024

Uh oh!

ricardoV94 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Sep 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented Sep 23, 2024

Uh oh!

lucianopaz commented Sep 23, 2024

Uh oh!

lucianopaz commented Sep 23, 2024

Uh oh!

ricardoV94 commented Sep 23, 2024

lucianopaz commented Sep 18, 2024 •

edited

Loading

codecov bot commented Sep 19, 2024 •

edited

Loading

ricardoV94 Sep 26, 2024 •

edited

Loading

ricardoV94 Sep 27, 2024 •

edited

Loading

ricardoV94 commented Sep 23, 2024 •

edited

Loading