This repository contains code and data for the paper: WebKE: Knowledge Triple Extraction from Semi-structured Web with Pre-trained Markup Language Models. Chenhao Xie, Wenhao Huang, Jiaqing Liang, Chengsong Huang and Yanghua Xiao. CIKM. 2021. [doi] [pdf]
pretrained_model/
contains the pretrained model HTMLBERT.
webke/
contains all code and data.
@inproceedings{xie2021webke,
title={WebKE: Knowledge Triple Extraction from Semi-structured Web with Pre-trained Markup Language Models.},
author={Xie, Chenhao and Huang, Wenhao and Liang, Jiaqing and Huang, Chengsong and Xiao, Yanghua},
booktitle={Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM)},
year={2021}
}