Implement Bayesian Structural Time Series (BSTS) #473

cetagostini · 2025-05-24T12:22:55Z

PR Description:

This PR introduces Bayesian Structural Time Series (BSTS) modeling to CausalPy, enabling advanced time series analysis, forecasting, and causal impact estimation. Inspired by principles from structural time series modeling basic with only trend/seasonal components (e.g., TensorFlow Probability's STS first example), this work decomposes time series into trend, seasonality, and optional regressor components within a Bayesian framework.

Key Changes:

BayesianStructuralTimeSeries Model (causalpy/pymc_models.py):
- New PyMC model for BSTS. Defaults to pymc-marketing components (LinearTrend, YearlyFourier), supports custom components, and optional exogenous regressors (X).
- Adapted _data_setter, predict, score, and fit for time-dependent forecasting and ensuring mu (sum of components) is sampled.
StructuredTimeSeries Experiment (causalpy/experiments/structured_time_series.py):
- New experiment class for causal inference with BSTS.
- Handles DatetimeIndex data, treatment_time, and patsy formulas for exogenous regressors.
  - Intercept Logic: Correctly manages formulas like y ~ 0 or y ~ 1 by ensuring the BSTS model's trend component provides the baseline, avoiding redundant patsy-generated intercepts for exogenous regressors.
- Provides summary(), plot() (3-panel: fit/counterfactual, pointwise impact, cumulative impact), and get_plot_data().
Testing & Integration:
- Added comprehensive integration tests (test_bayesian_structural_time_series) covering various scenarios, including custom components and error handling.
- Fixed a plotting issue in plot_xY (causalpy/plot_utils.py).
API & Docs:
- StructuredTimeSeries available as causalpy.StructuredTimeSeries.
- Added backward-compatible wrapper in causalpy.pymc_experiments.py.
- Updated relevant docstrings.

📚 Documentation preview 📚: https://causalpy--473.org.readthedocs.build/en/473/

review-notebook-app · 2025-05-24T12:23:00Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

cetagostini · 2025-05-24T12:43:26Z

Thinking to bring HSGP to here, and be able to replicate the second example.

drbenvincent · 2025-05-24T14:56:38Z

WOAH!

Ultra quick review / questions until I've got some more time:

Is this actually BSTS? It doesn't seem the use the BSTS stuff from @jessegrabowski in pymc-extras. If it's leaning heavily on linear trend and Fourier bases (from pymc-marketing, is this more like a prophet approach? No autoregressive components as far as I can tell? Still would be very very useful and an advancement on what we have so far.
Not able to take a deep dive at this point, but my hope in terms of the API would be that we could use the existing InterruptedTimeSeries experiment class and simply inject a new time series based pymc model class. Will aim to look into this in more detail - if there was a reason for a new experiment class then we can look into whether we can adapt it to make it more general.
Need to edit docs/source/notebooks/index.md to get the new notebook to render in the docs.
The new notebook is a great start. Could be good to compare the previous approach "y ~ 1 + t + C(month)" in its_pymc.ipynb to see how things differ.

Sorry for the relatively superficial review at this point - family weekend time. Will enjoy diving into the details soon.

cetagostini · 2025-05-24T18:39:12Z

Hey @drbenvincent

Still BSTS, the Structural time series (STS) models are a family of probability models for time series that includes and generalizes many standard time-series modeling ideas, including:

autoregressive processes,
moving averages,
local linear trends,
seasonality, and regression and variable selection on external covariates (other time series potentially related to the series of interest).

Definitely, it's handy and popular to talk about state-space, but my understanding is that BSTS in the broad sense does not require a true state‐space; it merely requires a Bayesian, additive decomposition into trend, seasonality, and optional regressors. Unless you insist on the classic DLM/state‐space definition from a Kalman‐Filter sense.

In the Google blog the first model its very similar to this, the only difference is that the slope from the linear trend is "evolving slowly over time following a random walk".

Thanks to the HSGP class, we can add something similar if we want to.

cetagostini · 2025-05-24T18:57:41Z

Not able to take a deep dive at this point, but my hope in terms of the API would be that we could use the existing InterruptedTimeSeries experiment class and simply inject a new time series based pymc model class. Will aim to look into this in more detail - if there was a reason for a new experiment class then we can look into whether we can adapt it to make it more general.

Happy to do it, my only opinion is that the API for the new class is quite different and allow for arbitrary components for trend and seasonality. IF they follow pymc-marketing protocols.

I'm doing that because will help me to easily input HSGP and replace this fourier bases approach (just as example), for the same structural time series. Could probably help to just replace by the state-space.

I'll be working on that as well :) I feel should be easy to do.

cetagostini · 2025-05-24T19:00:19Z

The new notebook is a great start. Could be good to compare the previous approach "y ~ 1 + t + C(month)" in its_pymc.ipynb to see how things differ.

Almost the same. The bsts fits more during training (high R2) and has less variance (lower sd).

drbenvincent · 2025-05-24T21:11:26Z

Nice. So I am a beginner in time series modelling - which is why I've not given this a go yet. But that's a good clarification BSTS != state space.

Do you think your implementation leaves the door open for extension into autoregressive components etc?

cetagostini · 2025-05-24T22:16:45Z

Nice. So I am a beginner in time series modelling - which is why I've not given this a go yet. But that's a good clarification BSTS != state space.

Do you think your implementation leaves the door open for extension into autoregressive components etc?

Indeed, I'll try to bring an example tomorrow!

drbenvincent

Firstly, this is very cool! Will probably need a few rounds of review to get this merged, but it will happen :)

Let's try to make it fit into the current docs and codebase as seamlessly as possible. So far we've got a nice distinction between "experiments" and "models".

The new model class BayesianStructuralTimeSeries fits in with the current structure very nicely. Given the range of experiments we've got so far the obvious candidate is to work with the interrupted time series experiment, and maybe multi-observation differences in differences. So the natural way to make this work would be to make the InterruptedTimeSeries class accept your new BayesianStructuralTimeSeries class. That would be in an ideal world, but let's see what we can do quickly to move in that direction.

The API differences between InterruptedTimeSeries and StructuredTimeSeries are minimal. Although obviously what is done with the provided data diverges a lot. Trying to resolve this in one go might be a bit much, but at the moment I've got these concrete suggestions:

Move the new StructuredTimeSeries class into interrupted_time_series.py and delete structured_time_series.py.
I relatively strongly think we should avoid the duplication of the plotting code. Like I say, we're trying to make the interrupted time series experiment work with a BSTS model, I don't think there is any reason why any custom plot logic is needed. We could extract the plot code into a mixin class (called something like ITSPlotter) which can be injected into both StructuredTimeSeries and InterruptedTimeSeries.

I think this already goes a long way towards keeping the nice distinction between experiment classes and model classes. The PR is already not far off that - the main issue I see is that StructuredTimeSeries is currently treated like a new experiment when in fact it's exactly the same as interrupted time series, but just separate in order to handle the different model input.

The new notebook is clearly applied to the interrupted time series experiment, so I think the notebook contents can be appended to the existing its_pymc.ipynb. That could be quite nice - the notebook would then be a kind of "how to do ITS from simple linear model to proper time series modeling".

AlexAndorra · 2025-05-26T14:11:10Z

This looks really cool and useful!!
Just dropping in as I see you guys are thinking of making this broader, with HSGP and broader TS decomposition (which I love!): any reason you're not plugging into pymc-extras.statespace? It should already have everything you need, and you can offload the model dev to it

drbenvincent · 2025-05-26T14:19:22Z

any reason you're not plugging into pymc-extras.statespace

Thanks @AlexAndorra - this is the direction I was hoping we'd go in. I've done some basic experimentation with it, but I'm a time series newbie and am not confident enough to know when I need to use what component, how to deal with non-stationarity etc. There's a lot more lower hanging fruit for me to work on in CausalPy, so I'm happy that this has been dropped on my lap by @cetagostini :)

I have a very crude understanding of how BSTS and state space approaches relate to each other (though @cetagostini explained it a bit above).

Would it make sense to end up having both this kind of BSTS model class and (eventually) the pymc-extras state space stuff? If that is silly then I guess the proposed implementation could be over-written by a state space implementation? Very happy to take guidance on this.

drbenvincent · 2025-05-26T14:25:48Z

@cetagostini The BayesianStructuralTimeSeries.build_model method seems like a reasonably simple bit of pymc code. Though as far as I can tell, this seems to be build as an out of the box solution. Nothing wrong with that, but the TFP blog post you linked has a very modular API where users can build their own model from components.

I don't think having a fixed out of the box BSTS model is bad, and in some ways it is good because it is what it is and doesn't require a user to meddle much. That said, it could be pretty cool to think about an API where users can modularly build up a pymc model to pass into the experiment class. Though I imagine this would probably be a considerable bit more work?

AlexAndorra · 2025-05-26T14:46:56Z

Yeah, the statespace module has a submodule unlocking STS. I think it's doing everything you guys are trying to do here. The one thing we're missing (and actively working on; almost done!) is vectorizing the Kalman filter, to allow for batched time series

cetagostini · 2025-05-26T14:56:21Z

This looks really cool and useful!!
Just dropping in as I see you guys are thinking of making this broader, with HSGP and broader TS decomposition (which I love!): any reason you're not plugging into pymc-extras.statespace? It should already have everything you need, and you can offload the model dev to it

I'll take a look here! @AlexAndorra Thanks for jumping!

@cetagostini The BayesianStructuralTimeSeries.build_model method seems like a reasonably simple bit of pymc code. Though as far as I can tell, this seems to be build as an out of the box solution. Nothing wrong with that, but the TFP blog post you linked has a very modular API where users can build their own model from components.

I'm already thinking no this, thats why I left the parameters trend_component and seasonal_component with a simple wrapper with a similar pymc-marketing signature, then should be easy to add them and replace by a state-space trend or seasonal.

That said, it could be pretty cool to think about an API where users can modularly build up a pymc model to pass into the experiment class

Thats cool, and seeing signatures, I feel could be. But not sure if for a first iteration? My guess, better to make a class that can manage different stuff like non state-spaces, and state spaces type of models, then we can move forward in order PR and say, lets bring any pymc time series model, and thats it. What do you think @drbenvincent ?

drbenvincent · 2025-05-26T15:03:39Z

Sure @cetagostini I'm happy with an iterative approach. Happy to proceed along the lines in my first review.

cetagostini · 2025-05-26T21:52:31Z

@drbenvincent

Apply the changes:

Under experiements, change Interrupted Time Series name to Structural Time Series, more generic and adhoc to get Interrupted time and basis expansion time series. This will help to hold the same structural time series class able to get StateSpaceTimeSeries, BasisExpansionTimeSeries or InterruptedTimeSeries. All under same family.
Keep interrupted time series signatures to backward compatibility.
Move the example under the same ITS notebook.
Avoid duplication of plotting and stuff because we rely on the same class.

cetagostini · 2025-05-26T23:55:04Z

@drbenvincent Implementing the StateSpace it's taking more effort than expected, will continue during week... But I think, it requires probably a following PR to not make this massive, instead of all in one.. Not sure, take a look and give me feedback. I already add a class draft in the notebook but need more work to be able to be used by the new experiment STS class.

cetagostini · 2025-05-28T00:23:00Z

@drbenvincent

integration with state space ready, took more than expected because the state-spaces needed a few wrappers.... Can you modify and give me hand with the docs? All in my local looks okay, not sure whats missing here!

Currently:

Test are done.
Backward compatible.
Support for BSTS (state and non state spaces)

Important

We are using y_hat and mu in different places, I think, y_hat should be the one for the plots, right?

drbenvincent

Thanks @cetagostini.

I can see that we've got 2 new examples in the its_pymc.ipynb notebook. That looks cool.

Though it's making me think that we should make the synthetic data a bit more complex - at the moment it's just got a simple linear trend and annual seasonality. We should find some classic time series dataset suitable for interrupted time series. That way, the new stuff you've done should be better than the pre-existing y ~ 1 + t + C(month) linear model, which would better sell the whole thing :)

I'm also seeing the state space model fit doesn't look too great. Maybe we could work on that, possibly getting some input from the state space guru himself?

The changes to the experiment classes isn't quite what I was thinking - though it's close. I was hoping to keep the class name InterruptedTimeSeries because it's most obviously the name of what we're doing in terms of a quasi experiment. Can we move the entire contents of structural_time_series.py and put it in interrupted_time_series.py, and use the InterruptedTimeSeries name for the experiment class rather than StructuralTimeSeries? Happy to jump on a call about it.

In terms of plots, it maybe depends. But for causal impact stuff we want mu. A frequentist approach focusses on the model expectation (mu), so in the Bayesian case we just have a posterior distribution over that expectation.

With the state space stuff, I see you've got some checks for the presence of pymc-extras. Maybe we should add some info in the ITS docs notebook to tell the reader to install it, maybe pointing them to the pymc-extras install instructions?

Though this is looking much closer to done now :)

cetagostini · 2025-06-09T18:18:46Z

@drbenvincent

Sure, I'll make the changes.

The issue with state space it's observe even in the examples in BSTS from pycm-extras.

You can see that the series shift a bit from the data.

I'll keep the name InterruptedTimeSeries, agree with you. Regarding the data, I'll need to build something, how urgent it is?

Finally, why so many things with conflicts? 😅

drbenvincent · 2025-06-09T18:26:41Z

I'll take a look at the conflicts when I'm back from a work trip.

Just wanted to keep up the momentum but I know you've got other demands on your time.

…s not StructuralTimeSeries class

drbenvincent · 2025-06-20T11:37:30Z

@cetagostini I've now resolved all the conflicts. There were some major changes that got merged into main.

Note, the way how we build docs has now slightly changed. It should be identical to how you do it in pymc-marketing now, but see here if in doubt

There is still a failing doctest for the new class though. I'll leave that to you :)

jessegrabowski · 2025-07-22T07:55:50Z

The issue with state space it's observe even in the examples in BSTS from pycm-extras.

I talked to Carlos about this in private, but just so that it's repeated here: there's no bug in extras, only in that plotting code.

drbenvincent · 2025-07-22T14:34:28Z

Anything needed from me at this point? Even if it's just to coordinate and provide extra motivational pizazz?

…odules from pytest config

Resolved conflicts by: - Keeping BSTS method signatures in pymc_models.py - Combining BSTS tests with new multi-unit synthetic control tests - Updating interrogate badge to latest coverage (95.6%) - Taking main branch version of notebook to avoid JSON conflicts This merge adds both BSTS functionality and multi-unit synthetic control features.

cetagostini · 2025-07-28T20:49:45Z

Would be to much to ask to keep by now? cp.StructuralTimeSeries and cp.InterruptedTimeSeries for a bit? I made both to be equivalent, after a few PRs, interrupted change quite a bit and they are not similar any more, and looks like a bit of work to go over all changes to release.

I feel better to keep this separate, merge, and make another just to kill StructuralTimeSeries (Try to tackle all in this will make it as stuck as power analysis at some point). Probably, would be nice, if I can get a grasp of the recent changes.

I applied, @jessegrabowski explanation and everything looks nicer on the plot now :)

drbenvincent · 2025-07-29T19:12:15Z

Dealt with some merge conflicts. So you'll need to pull recent changes @cetagostini
~~We need to manually merge your changes (in its_pymc_copy.ipynb) into the recently changed its_pymc.ipynb).~~ (see below)
Just thinking again - do we want to make pymc-marketing a dependency? Or shall we just ask people to manually install pymc-marketing into their environment if they are to use this new functionality? I think we did something like that with nutpie in one of the notebooks by @NathanielF. Thoughts on this @NathanielF ?

Notes

Just some quick notes because I've got a small window now to re-familiarise myself.

We've got TWO new models, BayesianBasisExpansionTimeSeries and BayesianBasisExpansionTimeSeries.
We've got ONE new experiment class StructuralTimeSeries. From memory, this is very similar to the InterruptedTimeSeries, in that it's functionally the same, but just has some changes to make it work with the new models? In an ideal world we would just have the InterruptedTimeSeries experiment class and make it such that we don't need another class. Not sure how much work this would take, but I'm sure that between Claude 4 and myself we could figure something out.
Docs wise, we've got pretty terse examples added to the basic interrupted time series example page. This is fine, though that runs off very simple synthetic data and I'm wondering if that is causing issues. For example, the pre-intervention $R^2$ for ssts_result is 1. I'm thinking that we could instead (or additionally) update the docs for the interrupted time series notebook using real covid deaths data. That should pretty much be a cut/paste job?
We do want some additional explanatory/introductory information in the interrupted time series notebook which introduced the new functionality. My slight preference is to do this now in this PR, but if it's going to be a big bottleneck, we can always add this later - hopefully very soon?

Will come back again and do a deeper dive on this soon. But what are your thoughts (and availability) on making some changes in the mean time @cetagostini?

Tagging @ErikRingen and @JakePiekarski314 on this PR because it's a big shiny new feature. If you have time it would be good to get your thoughts.

drbenvincent · 2025-07-29T19:16:26Z

Doc test failing for StructuralTimeSeries. Currently that docstring uses the LinearRegression model, which I guess is not a doable experiment/model combination? Do just need to replace with the right model I assume?

drbenvincent · 2025-07-29T19:19:35Z

causalpy/pymc_models.py

+        \text{seasonality} &\sim \text{YearlyFourier}(...) \\
+        \beta &\sim \mathrm{Normal}(0, \sigma_{\beta}) \quad \text{(if X is provided)} \\
+        \sigma &\sim \mathrm{HalfNormal}(\sigma_{err}) \\
+        \mu &= \text{trend_component} + \text{seasonality_component} [+ X \cdot \beta] \\


This line in the docstring is confusing. That + in the square brackets.

drbenvincent

Could add pymc-extras to the autodoc_mock_imports in conf.py. Should help with autodoc when we refer to the time series stuff in pymc-extras. But consider as optional - can come back to polish this in another small and focussed PR.
Mentioned elsewhere that it would be super cool to get more intro information in the docs about time series modelling. Maybe it would be a lower bar to just expand on this in the docstrings of the new classes?

NathanielF · 2025-07-29T19:29:07Z

Don't have complete familiarity with this PR, but just on the dependency question. For the IV experiment class, we recommended posterior predictive sampling with numpyro rather than the baseline pymc because MvNormal ppc was so slow.

I think recommending a sampler as an optional dependency seems more reasonable than a whole package....

Not saying we shouldn't but pymc-marketing is huge. Would avoid inflating dependencies if we could avoid it...

drbenvincent · 2025-07-29T19:44:43Z

On the desired merge `StructuralTimeSeries` into `InterruptedTimeSeries`

Bear in mind that since this PR was started all experiment classes (including InterruptedTimeSeries) has embraced using xarray data structures, rather than numpy. So at the very least, we need to embrace that change. Not looked in detail, but that might also eliminate some of the difference between the two classes? Could be much easier with some LLM help on that front. I can give it a go if you're not keen @cetagostini?

Looking at the diff between the two, I think it's doable. Most of the differences related to the numpy -> xarray change.

Implement Bayesian Structural Time Series (BSTS)

ecc143b

cetagostini requested a review from drbenvincent May 24, 2025 12:22

cetagostini self-assigned this May 24, 2025

cetagostini added documentation Improvements or additions to documentation enhancement New feature or request labels May 24, 2025

Dependency

c49a179

docstring

22c2eba

drbenvincent requested changes May 26, 2025

View reviewed changes

Massive set of changes

858db9c

cetagostini added 3 commits May 27, 2025 01:04

Namings mix

7ead617

Fucking gpt

64c1e60

draft adding state space

da47d90

Integration with state space

f2579f1

cetagostini requested a review from drbenvincent May 28, 2025 00:23

drbenvincent requested changes May 29, 2025

View reviewed changes

drbenvincent added 2 commits June 20, 2025 12:31

resolve conflicts with main

c48dd97

fix mistake in ITS tests - should still be InterruptedTimeSeries clas…

f4da3fb

…s not StructuralTimeSeries class

cetagostini added 7 commits July 28, 2025 23:09

Adding BSTS changes and notebook updates

0aa7942

Resolve merge conflicts: fix __init__.py imports and remove doctest-m…

7c71519

…odules from pytest config

Add notebook copy file

3acc970

Remove leftover notebook copy file

256d30c

Update notebook execution outputs

aab6310

Making its work better

31ee9b9

cetagostini and others added 2 commits July 28, 2025 23:54

Delete the dot model.

13c0a62

Merge branch 'main' into cetagostini/adding_bsts_to_causalpy

6fd62ce

drbenvincent reviewed Jul 29, 2025

View reviewed changes

drbenvincent requested changes Jul 29, 2025

View reviewed changes

Implement Bayesian Structural Time Series (BSTS) #473

Are you sure you want to change the base?

Implement Bayesian Structural Time Series (BSTS) #473

Conversation

cetagostini commented May 24, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented May 24, 2025

Uh oh!

cetagostini commented May 24, 2025

Uh oh!

drbenvincent commented May 24, 2025

Uh oh!

cetagostini commented May 24, 2025

Uh oh!

cetagostini commented May 24, 2025

Uh oh!

cetagostini commented May 24, 2025

Uh oh!

drbenvincent commented May 24, 2025

Uh oh!

cetagostini commented May 24, 2025

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Uh oh!

AlexAndorra commented May 26, 2025

Uh oh!

drbenvincent commented May 26, 2025

Uh oh!

drbenvincent commented May 26, 2025

Uh oh!

AlexAndorra commented May 26, 2025

Uh oh!

cetagostini commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

drbenvincent commented May 26, 2025

Uh oh!

cetagostini commented May 26, 2025

Uh oh!

cetagostini commented May 26, 2025

Uh oh!

cetagostini commented May 28, 2025

Uh oh!

drbenvincent left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cetagostini commented Jun 9, 2025

Uh oh!

drbenvincent commented Jun 9, 2025

Uh oh!

drbenvincent commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jessegrabowski commented Jul 22, 2025

Uh oh!

drbenvincent commented Jul 22, 2025

Uh oh!

cetagostini commented Jul 28, 2025

Uh oh!

drbenvincent commented Jul 29, 2025

Notes

Uh oh!

drbenvincent commented Jul 29, 2025

Uh oh!

drbenvincent Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Uh oh!

NathanielF commented Jul 29, 2025

Uh oh!

drbenvincent commented Jul 29, 2025

On the desired merge StructuralTimeSeries into InterruptedTimeSeries

Uh oh!

Uh oh!

cetagostini commented May 24, 2025 •

edited by github-actions bot

Loading

cetagostini commented May 26, 2025 •

edited

Loading

drbenvincent left a comment •

edited

Loading

drbenvincent commented Jun 20, 2025 •

edited

Loading

On the desired merge `StructuralTimeSeries` into `InterruptedTimeSeries`