data.csv contains the handpicked training examples. There are 142 examples in total, with 71 being controversial, and 71 non-controversial.
- Open your terminal and cd to this directory
- Run 'pip install -r requirements.txt'
- Add your OpenAI API key to constants.py
- You can choose which model to run: { 1: "Logistic Regression on Edit Counts", 2: "Naive Bayes'", 3: Logistic Regression on Edit Counts and Content, 4: Using Controversy Score, 5: Using Similarity-Based LLMs } Enter your choice as the number corresponding to each model: i.e. choice = 2 for Naive Bayes'
- Add the input path (path to data.csv) and output path (path where the wordcloud will be downloaded)
- If you wish to generate the wordcloud, set WORDCLOUD = True, otherwise False
- run python3 models.py