right-branching result do not match with that presented in the paper #3

SyahX · 2019-01-26T09:03:44Z

Hi,
I am trying to compute right-branching and upper bound baselines on the WSJ10 dataset. When I use the code in prpn to evaluation, I get 56.68 (right-branching) and 84.06 (upper bound), different from 61.7(RBranch) 88.1(upper bound) in the paper. But when I use EvalB to do the evaluation, I get 61.7(RBranch) 88.1(upper bound).
So is that mean the way to evaluate the right-branching structure is different from prpn model? Could you please show the right way to compute right-branching and upper bound baselines on the WSJ10 dataset?

Thanks,
Yunfan

yoonkim · 2019-02-08T22:51:57Z

I think the discrepancy is due to sentence-level F1 (adopted by PRPN) vs corpus-level F1 (adopted by EVALB and previous works). Thus the numbers are not exactly comparable, though they are in the general ballpark. There does seem to be a lack of consistency across grammar induction papers (preprocessing, evaluation metric, including sentence-level span vs not, etc.) to make inter-paper comparison difficult to say the least.

Quick question, what did you get for the right branching baselines on the entire dataset?

SyahX · 2019-02-16T06:30:55Z

40.348, right branching baseline for WSJ40

yikangshen · 2019-02-17T21:53:07Z

Thanks for pointing this out. I am using the baseline results from previous papers. Could you push the code for evaluation with EvalB?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

right-branching result do not match with that presented in the paper #3

right-branching result do not match with that presented in the paper #3

SyahX commented Jan 26, 2019

yoonkim commented Feb 8, 2019

SyahX commented Feb 16, 2019

yikangshen commented Feb 17, 2019

right-branching result do not match with that presented in the paper #3

right-branching result do not match with that presented in the paper #3

Comments

SyahX commented Jan 26, 2019

yoonkim commented Feb 8, 2019

SyahX commented Feb 16, 2019

yikangshen commented Feb 17, 2019