Coverage vector #3

Waino · 2016-11-25T14:10:38Z

Implement coverage in the attention mechanism, following [1].

[1] Tu, Zhaopeng, et al. "Coverage-based Neural Machine Translation." arXiv preprint arXiv:1601.04811 (2016).
http://arxiv.org/pdf/1601.04811

robertostling · 2016-12-07T10:17:22Z

Perhaps a better (= quicker to implement) start would be to implement the decoding-time coverage normalization method from section 7 of the Google NMT paper. This would only require changing the HNMT code so that it returns attention predictions, and then modifying the beam search code in BNAS to use it.

Waino added the enhancement label Nov 25, 2016

Waino self-assigned this Jan 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Coverage vector #3

Coverage vector #3

Waino commented Nov 25, 2016

robertostling commented Dec 7, 2016

Coverage vector #3

Coverage vector #3

Comments

Waino commented Nov 25, 2016

robertostling commented Dec 7, 2016