Pseudo-Observation Parametrisations #121

willtebbutt · 2022-03-24T13:48:34Z

As discussed on the call the other day, I've implemented a couple of pseudo-observation parametrisations.
The first has been used in a few places now, so I really think that we should support it. The thing I refer to as the "decoupled pseudo-observation" parametrisation is quite non-standard (I've not actually seen it used anywhere before, just been using it in my own work), but it's a really obvious extension, so I figure why not?

I've had to add an additional abstract type to make it possible to have different fields from those in the `SparseVariationalApproximation", that's why there are quite a lot of LOC changes.

Also, I've expanded on the parametrisation explanations, and provided some code examples that actually run and are now CI'd, so they won't go stale.

Keen to know what people make of this.

examples/d-sparse-parametrisations/script.jl

src/ApproximateGPs.jl

src/SparseVariationalApproximationModule.jl

test/SparseVariationalApproximationModule.jl

src/ApproximateGPs.jl

src/SparseVariationalApproximationModule.jl

test/SparseVariationalApproximationModule.jl

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

codecov · 2022-03-24T13:58:04Z

Codecov Report

Merging #121 (a7a7143) into master (490ece8) will increase coverage by 0.79%.
The diff coverage is 100.00%.

❗ Current head a7a7143 differs from pull request most recent head d47c809. Consider uploading reports for the commit d47c809 to get more accurate results

@@            Coverage Diff             @@
##           master     #121      +/-   ##
==========================================
+ Coverage   93.61%   94.41%   +0.79%     
==========================================
  Files           5        5              
  Lines         329      376      +47     
==========================================
+ Hits          308      355      +47     
  Misses         21       21

Impacted Files	Coverage Δ
src/SparseVariationalApproximationModule.jl	`93.89% <100.00%> (+3.41%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 490ece8...d47c809. Read the comment docs.

src/ApproximateGPs.jl

src/SparseVariationalApproximationModule.jl

src/ApproximateGPs.jl

src/SparseVariationalApproximationModule.jl

st-- · 2022-03-25T06:53:17Z

examples/d-sparse-parametrisations/script.jl

+# pseudo-observation covariance matrix `Ŝ` such that ``\hat{C} = C - C (C + \hat{S})^{-1} C``.
+#
+# However, ths is not necessarily a problem: if the likelihood used in the model is
+# log-concave then the optimal choice for `Ĉ` can always be represented using this


do we have a citation for this? 🤔

I was wondering about that. I've definitely seen the result floating around (and it's easy enough to prove) -- will have a hunt.

if it's that easy to prove, just do it here in the docs 😂

st-- · 2022-03-25T06:55:03Z

examples/d-sparse-parametrisations/script.jl

+# The reason to think that this parametrisation will do something sensible is this property.
+# Obviously when ``\mathbf{v} \neq \mathbf{x}`` the optimal approximate posterior cannot be
+# recovered, however, when the hope is that there exists a small pseudo-dataset which gets
+# close to the optimum.


what's the advantage of this parametrisation ?

st-- · 2022-03-25T06:55:54Z

src/SparseVariationalApproximationModule.jl

 )
    return AbstractGPs.elbo(sva, l_fx, ys; kwargs...)
 end

+_get_prior(approx::SparseVariationalApproximation) = approx.fz.f


How about just calling it

Suggested change

_get_prior(approx::SparseVariationalApproximation) = approx.fz.f

prior(approx::SparseVariationalApproximation) = approx.fz.f

?
I think it shows up in a bunch more places (within AbstractGPs' VFE/DTC code too)...

st-- · 2022-03-25T06:57:10Z

src/SparseVariationalApproximationModule.jl

+    PseudoObsSparseVariationalApproximation(
+        f::AbstractGP,
+        z::AbstractVector,
+        S::AbstractMatrix{<:Real},


Hm, it's got to be a symmetric, pos-def matrix, right?

st-- · 2022-03-25T06:58:09Z

src/SparseVariationalApproximationModule.jl

+        S::AbstractMatrix{<:Real}, v::AbstractVector, y::AbstractVector{<:Real}
+    )
+
+Chooses `likelihood(u) = N(y; f(v), S)` where `length(y)` need not be equal to the number


I don't understand how this makes any sense. On the left-hand side you have u, on the right hand side you don't. How are the two coupled ?

Ah, so it's implicit via f. u := f(z), so by making noisy observations of f(v), you learn something about f(z). Not clear from what I wrote though.

....so then the inducing points are actually v? why are u/z needed then?

st-- · 2022-03-25T07:01:56Z

src/SparseVariationalApproximationModule.jl

+    y = approx.likelihood.y
+    S = approx.likelihood.S
+    v = approx.likelihood.v
+    return posterior(AbstractGPs.VFE(f(z, 1e-9)), f(v, S), y)


Could we name all these magic constants floating around? Should they be consistent? Should they be configurable? Here it's 1e-9, above it's zero, below it's 1e-18....

st-- · 2022-03-25T07:03:01Z

test/SparseVariationalApproximationModule.jl

+                approx_centered = SparseVariationalApproximation(
+                    Centered(), f(z, 1e-12), qu
+                )
+                approx_post_centered = posterior(approx_centered)
+                approx_centered = SparseVariationalApproximation(
+                    Centered(), f(z, 1e-12), qu
+                )


Suggested change

approx_centered = SparseVariationalApproximation(

Centered(), f(z, 1e-12), qu

)

approx_post_centered = posterior(approx_centered)

approx_centered = SparseVariationalApproximation(

Centered(), f(z, 1e-12), qu

)

approx_centered = SparseVariationalApproximation(

Centered(), f(z, 1e-12), qu

)

approx_post_centered = posterior(approx_centered)

st-- · 2022-03-25T07:04:06Z

test/SparseVariationalApproximationModule.jl

+                approx_centered = SparseVariationalApproximation(
+                    Centered(), f(z, 1e-12), qu
+                )


Suggested change

approx_centered = SparseVariationalApproximation(

Centered(), f(z, 1e-12), qu

)

st-- · 2022-03-25T07:07:14Z

but it's a really obvious extension, so I figure why not?

I'm rather more hesitant about it, because any added code incurs a maintenance burden throughout the future lifetime... so if you'd like to add these I'd like to be more convinced why they're genuinely useful. Are they easier/faster to optimise? What do you gain from using them?

rossviljoen · 2022-04-01T15:08:43Z

Whether or not we end up adding these specific parameterisations, I think it would still be good to add the AbstractSparseVariationalApproximation type anyway to allow people to add parameterisations like this

willtebbutt added 3 commits March 21, 2022 12:22

Pseudo-observation parametrisations

2e529e0

Pseudo-obs example

2d91393

Merge branch 'master' into wct/pseudo-observation-parametrisations

a7a7143

github-actions bot reviewed Mar 24, 2022

View reviewed changes

willtebbutt and others added 2 commits March 24, 2022 13:53

Apply suggestions from code review

50d0a3a

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Apply suggestions from code review

cf64d24

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Fix problems from formatting

348d0d0

github-actions bot reviewed Mar 24, 2022

View reviewed changes

src/ApproximateGPs.jl Show resolved Hide resolved

src/SparseVariationalApproximationModule.jl Outdated Show resolved Hide resolved

Fix formatting

6961c23

github-actions bot reviewed Mar 24, 2022

View reviewed changes

src/ApproximateGPs.jl Show resolved Hide resolved

src/SparseVariationalApproximationModule.jl Outdated Show resolved Hide resolved

willtebbutt closed this Mar 24, 2022

willtebbutt reopened this Mar 24, 2022

willtebbutt added 3 commits March 24, 2022 16:06

Bump patch

41e93ba

Fix things the formatter broke

87e2c39

Fix remaining error from formatter

d47c809

st-- reviewed Mar 25, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pseudo-Observation Parametrisations #121

Pseudo-Observation Parametrisations #121

willtebbutt commented Mar 24, 2022

codecov bot commented Mar 24, 2022 •

edited

Loading

st-- Mar 25, 2022

willtebbutt Mar 25, 2022

st-- Mar 25, 2022

st-- Mar 25, 2022

st-- Mar 25, 2022

st-- Mar 25, 2022

st-- Mar 25, 2022

willtebbutt Mar 25, 2022 •

edited

Loading

st-- Mar 25, 2022

st-- Mar 25, 2022

st-- Mar 25, 2022

st-- Mar 25, 2022

st-- commented Mar 25, 2022

rossviljoen commented Apr 1, 2022

	_get_prior(approx::SparseVariationalApproximation) = approx.fz.f
	prior(approx::SparseVariationalApproximation) = approx.fz.f

	approx_centered = SparseVariationalApproximation(
	Centered(), f(z, 1e-12), qu
	)

Pseudo-Observation Parametrisations #121

Are you sure you want to change the base?

Pseudo-Observation Parametrisations #121

Conversation

willtebbutt commented Mar 24, 2022

codecov bot commented Mar 24, 2022 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

willtebbutt Mar 25, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

st-- commented Mar 25, 2022

rossviljoen commented Apr 1, 2022

codecov bot commented Mar 24, 2022 •

edited

Loading

willtebbutt Mar 25, 2022 •

edited

Loading