remove the nltk POS tagger from convert_rst_discourse_tb.py #23

mheilman · 2014-07-24T01:48:20Z

Currently, convert_rst_discourse_tb.py uses NLTK's POS tagger to create flat trees for sentences that are in the RST treebank but not the Penn Treebank. This dependency should eventually be removed and replaced with ZPar.

The text was updated successfully, but these errors were encountered:

YTZ01 · 2023-10-28T01:57:08Z

Hello, I have a problem in running this line'''convert_rst_discourse_tb ~/corpora/rst_discourse_treebank ~/corpora/treebank_3'''. I'm wondering the PDTB dataset in your setting is PDTB-v1（2019） or PDTB-v2（2020）, cause I downloaded the dataset from LDC, but it doesn't have a 'parsed' file under it, only data，docs and tools, index.html. Have you met this issue? @mheilman

mheilman added the enhancement label Jul 24, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove the nltk POS tagger from convert_rst_discourse_tb.py #23

remove the nltk POS tagger from convert_rst_discourse_tb.py #23

mheilman commented Jul 24, 2014

YTZ01 commented Oct 28, 2023

remove the nltk POS tagger from convert_rst_discourse_tb.py #23

remove the nltk POS tagger from convert_rst_discourse_tb.py #23

Comments

mheilman commented Jul 24, 2014

YTZ01 commented Oct 28, 2023