You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Added fertility as a further type of attention coverage.
Added an option for training to keep the initializations of the model via --keep-initializations. When set, the trainer will avoid deleting the params file for the first checkpoint, no matter what --keep-last-params is set to.
[1.18.66]
Fixed
Fix to argument names that are allowed to differ for resuming training.
[1.18.65]
Changed
More informative error message about inconsistent --shared-vocab setting.
[1.18.64]
Added
Adding translation sampling via --sample [N]. This causes the decoder to sample each next step from the target distribution probabilities at each
timestep. An optional value of N causes the decoder to sample only from the top N vocabulary items for each hypothesis at each timestep (the
default is 0, meaning to sample from the entire vocabulary).
[1.18.63]
Changed
The checkpoint decoder and nvidia-smi subprocess are now launched from a forkserver, allowing for a better separation between processes.
[1.18.62]
Added
Add option to make TranslatorInputs directly from a dict.