Skip to content

Commit

Permalink
Update rfc/system/5598-deterministic-errors-distributed-training.md
Browse files Browse the repository at this point in the history
Signed-off-by: Fabio M. Graetz, Ph.D. <[email protected]>
  • Loading branch information
fg91 authored Jul 29, 2024
1 parent 68a623a commit b3add89
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -78,7 +78,7 @@ We don't see any drawbacks to making the error handling of distributed training

## 6 Alternatives

*What are other ways of achieving the same outcome?*
A poor man's version would be to not override the error file if it already exists. While this is a worse solution than proposed above as there still is a race condition, this would still better than the current behavior because at least we would *favor* earlier errors instead of later ones.

## 7 Potential Impact and Dependencies

Expand Down

0 comments on commit b3add89

Please sign in to comment.