Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add attention_mask argument to loss_fn() and lm_cross_entropy_loss() …
…and adjust the cross entropy calculation to ignore masked (padding) tokens.
- Loading branch information