Claim Verification Using Generated Data by ChatGPT and Wikipedia

Claim verification is the task of predicting a text's truthfulness. Verification models are often trained on manually created datasets which are expensive and time consuming to create. In this project, a verification model was trained on generated data. The data was generated by feeding scraped Wikipedia articles together with instructions to generate a true or false statement in a prompt to ChatGPT 3 turbo (via API). The articles were used as evidence, providing a source of truth for the model. The ChatGPT response was used as claims, either a true statement meaning it aligns with the evidence, or a false statement, meaning it contradicts the evidence. The evidence and claim were then embedded by using BERT-small. A neural network containing bidirectional LSTM layers was trained to distinguish between false and true claims. The model was able to classify the validation samples with a macro average F1-score of $0.80$. After evaluating the model using custom inputs it was found that it is sensitive to the term 'not'. The reason for this could be due to an overrepresentation of the term 'not' in the false claims.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
1. Scrape wikipedia		1. Scrape wikipedia
2. Generate statements		2. Generate statements
3. Fact verification model		3. Fact verification model
4. Evaluation		4. Evaluation
Claim Verification Using Generated Data by ChatGPT and Wikipedia.pdf		Claim Verification Using Generated Data by ChatGPT and Wikipedia.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Claim Verification Using Generated Data by ChatGPT and Wikipedia

About

Releases

Packages

Languages

Leonnorblad/fact-verification

Folders and files

Latest commit

History

Repository files navigation

Claim Verification Using Generated Data by ChatGPT and Wikipedia

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages