Schedule callbacks to run asynchronously and in parallel #85

lorenzoh · 2021-07-14T20:10:17Z

Using the callback dependency graph, it's possible to determine which callbacks access what state and which callbacks need to run before.

Hence it should be possible to run callbacks that access unrelated state in parallel or asynchronously (or both). For example, you may have an integration with a experiment tracking backend like Weights and Biases which needs to perform network requests. These could be run in the background as not to slow down the training loop.

In practice, with the information from the dependency graph and whether state access are read/write a Dagger.jl DAG could be constructed and run asynchronously. Callbacks that write state, like ToGPU would still be run synchronously.

There already exists an extension interface for how callbacks are executed in FluxTraining.CallbackExecutor. The default (and so for only) implementation performs a topological sort and executes the callbacks serially.

The text was updated successfully, but these errors were encountered:

lorenzoh added the enhancement New feature or request label Jul 14, 2021

lorenzoh mentioned this issue Jul 14, 2021

Training loop profiler #86

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Schedule callbacks to run asynchronously and in parallel #85

Schedule callbacks to run asynchronously and in parallel #85

lorenzoh commented Jul 14, 2021 •

edited

Loading

Schedule callbacks to run asynchronously and in parallel #85

Schedule callbacks to run asynchronously and in parallel #85

Comments

lorenzoh commented Jul 14, 2021 • edited Loading

lorenzoh commented Jul 14, 2021 •

edited

Loading