Skip to content

Latest commit

 

History

History

HDSKG-Chunking

The implementation of HDSKG Chunking

Usage

  1. python tags_spider.py --start 1 --end 5 to spider tags from page 1 to 5 in stackoverflow ranked by popularity. spider-tags
  2. python .\corenlp_chunk_candidate_relations_triples.py to spider data from tags' page, preprocess text(resolve coreference) and chunk candidate relation triples chunk-crt
  3. After above 2 steps,
    • In txts folder you can get tags.txt generated by spider tag process
    • In csvs folder you can get input_sentence.csv generated by preprocessing text and candidate_relation_triples.csv generated by chunk candidate relation triples