Link: https://cs.stanford.edu/~srijan/hoax/
Paper link: Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes
Task:
- binary classification - hoax or nonhoax, according to folder structure
This dataset contains publicly available hoax articles and non-hoax articles that were created on Wikipedia.
Important note: Data of this dataset are HTML files with pages from Wikipedia, so they are not analysed as other datasets.