Rework of how transformations are handled #575

torfjelde · 2024-01-31T18:09:56Z

While working on #555, I ran into the pain that is our current way of handling transformations.

This is partially because we've been slowly adding support for things that wasn't support back in the day, while trying to preserve functionality.

I think it's now time rip out quite a bit of the historical artifacts and rework this.

In DynamicPPL.jl we effectively have a very annoying problem where we can have three different "representation" of a realization:

The "internal representation", i.e. how a AbstractVarInfo stores the realization internally.
The "linked internal representation", i.e. how a AbstractVarInfo stores a "linked" realization internally (this might differ from (1) in dimensionality, e.g. VarInfo, but also type, e.g. SimpleVarInfo).
The "model representation", i.e. how a realization of a variable is expected to be represented when working with it in the model body; this is always decided by the Distribution from which the variable is sampled.

Back in the day when everything was easier, we could define the map from_linked_internal (needed in getindex, assume, etc.) as a simple composition of from_internal (aka reconstruct) and invlink, and similarly for to_linked_internal (needed in setindex!, push!, etc.). But this resulting in multiple bugs once we started working with less convenient distributions, e.g. LKJCholesky where the linked representation is a Vector but the model representation is a Cholesky.

So these days we can't really think about from_linked_internal as a simple composition, but instead has to think about it as an "independent" mapping that can sometimes be represented in the old way not always.

To handle this, we made reconstruct, a function which was meant to take us from "internal representation" to "model representation", also accept the linking transformation, and attempts to construct whatever representation is needed for the pair (invlink_transform(dist), dist). This is okay, but because we have different implementations of AbstractVarInfo, e.g. VarInfo and SimpleVarInfo, each of which have different internal representations, the reconstruct has this behavior where it sometimes does what you expect and sometimes doesn't do anything, and it's difficult to determine exactly when what happens.

This PR removes reconstruct completely, in addition to other related methods, and effectively boils the entire transformation handling down to two mappings:

from_internal_transform(varinfo, vn, dist): construct a transformation that takes us from the internal representation to the model representation.
~~from_internal_transformation(varinfo, vn, dist): construct a transformation that takes us from the internal representation to the model representation.~~
from_linked_internal_transform(varinfo, vn, dist): construct a transformation that takes us from the linked internal representation to the model representation. (correction by @yebai)

Notice that these methods construct a transformation, and we want these transformations to also define InverseFunctions.inverse and ChangesOfVariables.with_logabsdet_jacobian so that we can easily define the to_* mappings and obtain the log-abs-det-jacobian corrections for the transformations easily.

But that's really all we need + now everything is very explicit and clear.

There is a longer exposition on the topic in the docs accompanying this PR (see the docs preview under the "internals" tab) where I discuss why we need certain methods that might at a first glance now seem necessary.

Note that there is one drawback with this approach: we might run into type-instabilities since we're constructing a transform. One would hope that union splitting saves us; indeed that seems to work nicely for our test-suite on more recent Julia versions but not on 1.6. I'm not sure how much we should care about this though because once we move away from all the Selector stuff + old Gibbs sampler #573 , there's really no reason why anyone would every link only a subset of a varinfo, at which point we should always use the immutable link method and make whether or not a varinfo is linked available at compile-time, not runtime.

@devmotion @yebai

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

…ector`

…process

…o this

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

torfjelde · 2024-08-06T21:41:12Z

Aight, so it seems we weren't really testing cases such as

product_distribution(fill(Dirichlet(ones(4)), 2, 3))

(which might be because we didn't have support for this in Bijectors.jl until recently), and so I just added a test for this and now things are failiing 🙃

It turns out there's a bug in Bijectors.logpdf_with_trans for these distributions, so we need to fix this in Bijecotrs.jl. Will open an PR.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

…ations

test/model.jl

Co-authored-by: Markus Hauru <[email protected]>

torfjelde · 2024-08-14T13:36:02Z

Thanks @mhauru

torfjelde · 2024-08-15T11:14:49Z

Looks like everything is passing nicely:)

mhauru

My only two unresolved comments are very minor, so I'm happy to approve. Likewise @yebai has a few unresolved comments, but he approved as well, so I assume he considers them non-blocking. @torfjelde, good to merge on your side?

mhauru · 2024-08-15T11:42:17Z

src/varinfo.jl

@@ -2005,7 +2026,39 @@ end

 function values_from_metadata(md::Metadata)


Having read the code more carefully, I think I agree with from_internal_transform being the right thing. Could still have a docstring, but optional.

shravanngoswamii · 2024-08-15T11:49:22Z

I will open another PR for converting diagrams images to Mermaid!

mhauru · 2024-08-21T09:26:09Z

The API docs index.html had grown to exceed the threshold size allowed by Documenter by a few kilobytes. I doubled the threshold to 400 KiB. There doesn't seem to be anything wrong with the API page, it's just plain long.

torfjelde · 2024-08-21T09:40:06Z

Does this count the images? Or is it just the HTML?

mhauru · 2024-08-21T10:00:51Z

I'm not sure if it would count the images, but they are small, and not on the page that exceeds the threshold.

mhauru · 2024-08-21T10:08:36Z

Hurray! Great stuff @torfjelde!

torfjelde and others added 30 commits October 31, 2023 23:10

initial implementation of VarNameVector

5af1afa

added some hacky getval and getdist get things to work for VarInfo

8ce53f7

Apply suggestions from code review

fc6a051

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

added arbitrary metadata field as discussed

7cd599d

renamed idcs to varname_to_index

ed0a757

renamed vns to varnames for VarNameVector

4ebd252

added keys impl for Metadata

9f12c9a

added push! and update! for VarNameVector

5a15121

added getindex_raw! and setindex_raw! for VarNameVector

edde2c1

added iterate and convert (for AbstractDict) impls for VarNameV…

ed46002

…ector`

make the key and eltype part of the VarNameVector type

5b00059

added more tests for VarNameVector

bef7e0a

formatting

006ee8d

more testing for VarNameVector

9802811

minor changes to some comments

88b1721

added a bunch more tests for VarNameVector + several bugfixes in the …

ca7b173

…process

formatting

fb01b94

added similar implementation for VarNameVector

9634839

formatting

5179f6f

removed debug statement

9f632bb

made VarInfo slighly more generic wrt. underlying metadata

3c210f7

Merge branch 'master' into torfjelde/varnamevector

8bf6589

fixed incorrect behavior in keys for Metadata

8b2720f

minor style changes to VarNameVector tests

9fa6446

style

0900c57

added testing of update! with smaller sizes and fixed bug related t…

1f7e633

…o this

formatting

8d05586

move functionality related to push! for VarNameVector into push!

7801fe1

Update src/varnamevector.jl

cdc2373

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Merge branch 'master' into torfjelde/varnamevector

d2d776d

Addeed tests for product of distributions with dynamic support

a7673fd

Apply suggestions from code review

e8d4c96

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

torfjelde mentioned this pull request Aug 6, 2024

Fix for product of Dirichlet TuringLang/Bijectors.jl#322

Merged

mhauru mentioned this pull request Aug 8, 2024

More work on VarNameVector #637

Merged

torfjelde self-assigned this Aug 12, 2024

mhauru added 2 commits August 14, 2024 13:49

Empty commit to trigger CI

d7c224e

Merge remote-tracking branch 'origin/master' into torfjelde/transform…

951ffd5

…ations

mhauru reviewed Aug 14, 2024

View reviewed changes

test/model.jl Outdated Show resolved Hide resolved

Update test/model.jl

a0a8761

Co-authored-by: Markus Hauru <[email protected]>

mhauru approved these changes Aug 15, 2024

View reviewed changes

torfjelde enabled auto-merge August 21, 2024 08:48

torfjelde added this pull request to the merge queue Aug 21, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Aug 21, 2024

Increase HTML page size threshold for docs

a8812e9

mhauru enabled auto-merge August 21, 2024 09:26

mhauru added this pull request to the merge queue Aug 21, 2024

Merged via the queue into master with commit 138bd40 Aug 21, 2024
11 of 12 checks passed

mhauru deleted the torfjelde/transformations branch August 21, 2024 10:04

yebai mentioned this pull request Aug 21, 2024

Convert doc diagrams to Mermaid #642

Closed

mhauru mentioned this pull request Sep 12, 2024

Mark istrans as inactive #658

Merged

penelopeysm mentioned this pull request Sep 26, 2024

DynamicPPL -> 0.29; Julia -> 1.10; Tapir -> Mooncake TuringLang/Turing.jl#2341

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework of how transformations are handled #575

Rework of how transformations are handled #575

torfjelde commented Jan 31, 2024 •

edited by yebai

Loading

torfjelde commented Aug 6, 2024

torfjelde commented Aug 14, 2024

torfjelde commented Aug 15, 2024

mhauru left a comment

mhauru Aug 15, 2024

shravanngoswamii commented Aug 15, 2024

mhauru commented Aug 21, 2024

torfjelde commented Aug 21, 2024

mhauru commented Aug 21, 2024

mhauru commented Aug 21, 2024

		@@ -2005,7 +2026,39 @@ end

		function values_from_metadata(md::Metadata)

Rework of how transformations are handled #575

Rework of how transformations are handled #575

Conversation

torfjelde commented Jan 31, 2024 • edited by yebai Loading

torfjelde commented Aug 6, 2024

torfjelde commented Aug 14, 2024

torfjelde commented Aug 15, 2024

mhauru left a comment

Choose a reason for hiding this comment

mhauru Aug 15, 2024

Choose a reason for hiding this comment

shravanngoswamii commented Aug 15, 2024

mhauru commented Aug 21, 2024

torfjelde commented Aug 21, 2024

mhauru commented Aug 21, 2024

mhauru commented Aug 21, 2024

torfjelde commented Jan 31, 2024 •

edited by yebai

Loading