Controversy Detection using LLMs

data.csv contains the handpicked training examples. There are 142 examples in total, with 71 being controversial, and 71 non-controversial.

Instructions to run:

Open your terminal and cd to this directory
Run 'pip install -r requirements.txt'
Add your OpenAI API key to constants.py
You can choose which model to run: { 1: "Logistic Regression on Edit Counts", 2: "Naive Bayes'", 3: Logistic Regression on Edit Counts and Content, 4: Using Controversy Score, 5: Using Similarity-Based LLMs } Enter your choice as the number corresponding to each model: i.e. choice = 2 for Naive Bayes'
Add the input path (path to data.csv) and output path (path where the wordcloud will be downloaded)
If you wish to generate the wordcloud, set WORDCLOUD = True, otherwise False
run python3 models.py

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
COL865_9.pdf		COL865_9.pdf
README.md		README.md
constants.py		constants.py
data.csv		data.csv
generate_data.py		generate_data.py
models.py		models.py
requirements.txt		requirements.txt