python tags_spider.py --start 1 --end 5
to spider tags from page 1 to 5 in stackoverflow ranked by popularity.python .\corenlp_chunk_candidate_relations_triples.py
to spider data from tags' page, preprocess text(resolve coreference) and chunk candidate relation triples- After above 2 steps,
- In txts folder you can get
tags.txt
generated by spider tag process - In csvs folder you can get
input_sentence.csv
generated by preprocessing text andcandidate_relation_triples.csv
generated by chunk candidate relation triples
- In txts folder you can get
HDSKG-Chunking
Folders and files
Name | Name | Last commit date | ||
---|---|---|---|---|
parent directory.. | ||||