Separate forward/backward pass #72

Ebanflo42 · 2024-04-04T12:57:00Z

We need to carefully design the API such that the user has access to both an executable that returns the desired diff calls (for training) and an executable that returns everything except that (for testing).

This is part of a larger array of issues that will emerge from the need to embed contexts in other contexts (for example, separating the optimizer step, or designing recurrent architectures). In this case it might make sense to allow the user to design a forward pass context which doesn't take labels or output gradients, then allow them to clone that context and recover all desired node identifiers in order to create another context that takes both labels and inputs and outputs both predictions and loss and gradients. Then both executables can be used separately.

The text was updated successfully, but these errors were encountered:

Ebanflo42 · 2024-04-04T13:02:21Z

Also would make sense to add gradient clipping as a part of the solution to this issue.

Ebanflo42 · 2024-04-04T13:03:33Z

Scratch that, gradient clipping should be an extra feature in the core autodiff engine, i don't know why I was thinking these things are related.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate forward/backward pass #72

Separate forward/backward pass #72

Ebanflo42 commented Apr 4, 2024

Ebanflo42 commented Apr 4, 2024

Ebanflo42 commented Apr 4, 2024

Separate forward/backward pass #72

Separate forward/backward pass #72

Comments

Ebanflo42 commented Apr 4, 2024

Ebanflo42 commented Apr 4, 2024

Ebanflo42 commented Apr 4, 2024