Reproducing the paper's results #8

vibhavagarwal5 · 2020-05-15T14:01:01Z

Hi,

I'm not able to reproduce the same numbers as reported in the paper for Predict&Explain model even after using the same parameters. I'm getting ~76% label accuracy while the paper reports ~84%. Any idea what I might be missing?

Also, the expl_to_label model for predicting the label using the expl is not working. The model produces a static accuracy curve of ~33% and just early stops after epoch=7. Can you help me out in this as well?

vibhavagarwal5 · 2020-05-15T14:04:20Z

I'm also getting out of memory error in the attention model training. How large was your training GPU size @OanaMariaCamburu ? Coz, I'm trying on a 12GB GPU and its not fitting.

OanaMariaCamburu · 2020-05-15T15:53:05Z

Hi,

Sorry to hear this. I also had a 12GB GPU and it was filling the memory at the limit but didn't crash.

Are you able to reproduce the results when you use the py2 version? maybe there were too many changes at a time to port it to py3. There's clearly a bug since you basically get random accuracy for Expl2Lbl.

vibhavagarwal5 · 2020-05-15T15:59:37Z

I see. I couldn't check with py2 due to the obsoleteness of the py2 and the older PyTorch version 😞

Due to this, I had to shift to py3. I had a double-check with the py2 code as well and the changes seemed pretty independent of the py2/py3 changeability. I'll check again anyways. :/

vibhavagarwal5 · 2020-05-16T09:35:04Z

I just checked once, the logic remains the same. Beside this, I don't understand why the expl_to_label model is not training, it is essentially the same infersent arch encoder with the same classifier model. :/

vibhavagarwal5 · 2020-05-17T07:42:47Z

I feel there's something wrong with the decoder architecture. The loss for the expl in predict2expl model drops for a few epochs and then flats out. Even the expl generated is not that great. I feel I might be missing something but I just ran the plain code given.

OanaMariaCamburu · 2020-05-17T08:27:42Z

The architecture is very straightforward: the encoder is a BiLSTM, the decoder is an LSTM (of course, at training time you input the correct tokens and at test time you input the previously outputted tokens), the loss is the typical cross-entropy loss. You could try to code it from scratch and only look in the code for details. I'd suggest to start with the Expl2Label model, it's very clear that given the for of the explanations it should train at very high accuracy, i got ~97%, so getting 33% on that shows a fundamental problem. Hope this helps.

vibhavagarwal5 · 2020-05-17T08:38:22Z

I see. Thanks for the help @OanaMariaCamburu! 😄

OanaMariaCamburu · 2020-05-17T08:48:56Z

No worries, please reach out if you are unsure about any details in the architecture.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducing the paper's results #8

Reproducing the paper's results #8

vibhavagarwal5 commented May 15, 2020

vibhavagarwal5 commented May 15, 2020

OanaMariaCamburu commented May 15, 2020

vibhavagarwal5 commented May 15, 2020

vibhavagarwal5 commented May 16, 2020

vibhavagarwal5 commented May 17, 2020

OanaMariaCamburu commented May 17, 2020

vibhavagarwal5 commented May 17, 2020 •

edited

Loading

OanaMariaCamburu commented May 17, 2020

Reproducing the paper's results #8

Reproducing the paper's results #8

Comments

vibhavagarwal5 commented May 15, 2020

vibhavagarwal5 commented May 15, 2020

OanaMariaCamburu commented May 15, 2020

vibhavagarwal5 commented May 15, 2020

vibhavagarwal5 commented May 16, 2020

vibhavagarwal5 commented May 17, 2020

OanaMariaCamburu commented May 17, 2020

vibhavagarwal5 commented May 17, 2020 • edited Loading

OanaMariaCamburu commented May 17, 2020

vibhavagarwal5 commented May 17, 2020 •

edited

Loading