Reverse-over-forward AD #162

jrmaddison · 2024-07-10T15:05:14Z

Reverse-over-forward AD.

The usage

u = [initialize forward variable]
u.block_variable.tlm_value = [initialize tangent-linear variable]

continue_annotation()
continue_reverse_over_forward()
...
J = [functional]
pause_annotation()
pause_reverse_over_forward()

leads to tangent-linear operations being recorded on the tape, allowing a high-order adjoint calculation via e.g.

hessian_action = compute_gradient(J.block_variable.tlm_value, Control(u))

The primary advantage is that this allows checkpointing at higher-order. The primary disadvantage is that multiple Hessian actions require reruns of the forward and first order adjoint.

API changes:

Add reverse-over-forward controls reverse_over_forward_enabled, no_reverse_over_forward (decorator), stop_reverse_over_forward (context manager), pause_reverse_over_forward, and continue_reverse_over_forward.
Add optional n_outputs argument to the Block constructor. This is currently required only for reverse-over-forward AD, and is used to trigger tangent-linear operations.
Add Block.solve_tlm method, for performing differentiable tangent-linear operations.
Add OverloadedType._ad_assign for in-place assignment. This is used to temporarily reset forward variables to the required (input) values before performing tangent-linear operations.
Add PosBlock, used by AdjFloat.__pos__.

Limitations:

Not added for NumpyArraySliceBlock, as this is lacking tangent-linear methods.
In principle higher order calculations with all tangent-linear directions equal is possible via this approach, but this would need to be used quite carefully. Avoiding inefficiency in the (likely much more common) second order case would also add quite a bit of complexity. Higher-order is therefore disabled (around the solve_tlm call in Block.add_output). Higher order calculations with different directions requires multiple tangent-linear values (multiples values for tlm_value for a single BlockVariable) and would likely require much more extensive changes.
Functionality has been added for AdjFloat operations, but this could lead to a large number of operations appearing on the tape (the usual symbolic differentiation scaling problem, but here appearing on the pyadjoint tape). The complexity could perhaps be moved from the tape into SymPy expressions (e.g. a port of the tlm_adjoint FloatEquation tape object).

jrmaddison added 5 commits July 10, 2024 10:55

Reverse-over-forward configuration controls

2880f2b

Reverse-over-forward AD

19e7a1d

Test setup

857438f

In-place assignment, restore old values in reverse-over-forward AD

04c48d6

Test setup fix

27d6355

jrmaddison marked this pull request as draft July 10, 2024 15:06

jrmaddison mentioned this pull request Jul 10, 2024

Reverse-over-forward AD firedrakeproject/firedrake#3681

Open

jrmaddison added 16 commits July 10, 2024 17:01

BlockVariable.restore_output fix

d123af3

Move will_add_as_output call until after tangent-linear operations

ad76a17

Handle zero case to avoid unnecessary higher-order processing

8b0d18c

restore_output fixes

111fb9f

Limit reverse-over-forward to second order

74e45fe

Reverse-over-forward AD: ExpBlock

adcad2d

Reverse-over-forward AD: LogBlock

3724a74

Reverse-over-forward AD: AddBlock

7e77ccd

Reverse-over-forward AD: NegBlock

c3ad8ea

Reverse-over-forward AD: SubBlock

f585e18

Expand AddBlock and SubBlock reverse-over-forward tests

f537007

Reverse-over-forward AD: MulBlock

ec60d22

Reverse-over-forward AD: PowBlock

664b320

Add PosBlock, use to fix a bug in AdjFloat reverse-over-forward AD

1ad6323

Reverse-over-forward AD: DivBlock

42b1d60

Reverse-over-forward AD: MinBlock and MaxBlock

b535ae0

jrmaddison marked this pull request as ready for review July 11, 2024 17:11

jrmaddison added 6 commits July 11, 2024 18:21

== -> np.allclose

7ff113d

More reverse-over-forward testing

7deeb81

Bugfix

8a9c334

Extra AdjFloat.__truediv__ testing

1d84ccf

Minor __pos__ fixes

8fa8a0a

Documentation fixes

05a8365

Test updates

ec6e2d5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reverse-over-forward AD #162

Reverse-over-forward AD #162

jrmaddison commented Jul 10, 2024 •

edited

Loading

Reverse-over-forward AD #162

Are you sure you want to change the base?

Reverse-over-forward AD #162

Conversation

jrmaddison commented Jul 10, 2024 • edited Loading

jrmaddison commented Jul 10, 2024 •

edited

Loading