Women in Polictics and Misogynistic Twitter Mentions

Objective

Classify tweets containing explicit and implicit misogynistic content
Identify the themes underpinning this content

Methodology

Scrape tweets tagging women in Congress and Senate in the week leading up to the 2020 elections.
Use BERT to classify misogynistic and non-misogynistic twitter mentions
Compare both misogynistic and non-misogynistic tweets
Perform unsupervised classification (LDA Topic Model) to identify themes in non-misogynistic tweets
Communicate results in an interactive dashboard

Data

I scraped over 400,000 twitter mentions for 146 female senators. A sample of the scraped tweets can be found in "final_data.csv" and the larger file (which was too large to upload on github without git lfs) can be found at https://drive.google.com/file/d/1HLWQuaJzQwqlYYGOHjspKyiQwPAwQK-T/view?usp=sharing. We also used data from Congress.gov and GovTrack USA to conduct further analysis. All the data used for this project can be found under the "Data" folder, and the raw urls (manually compiled) used for getting twitter handles of female senators, and other data can be found under the folder "Scraping tweets -data+code". The trained topic models cab be found under "ldamodels".

Code

The code used for the models and data analysis and their respective folders can be found as mentioned below.

Scraping twitter mentions: Scraping_tweets.ipynb in the folder "Scraping tweets- data+code. The file might be slightly different from what was shown in the video to increase readability. However, only markdowns and comments were changed, and not the actual code.
BERT model : BERT_Classifier_Final.ipynb
LDA Topic Model : Topic_Model_Final.ipynb
Metadata Cleaning, Word cloud, and Linear Regression : Wordcloud_Regression.ipynb
Dashboard and data visualization : Visualization.ipynb
Functions used in Wordcloud_Regression.ipynb : util.py
Files for Heroku App hosting : requirements.txt, app.py, runtime.txt, Procfile, .gitignore, assets

Visualization Link

https://twittermisogyny.herokuapp.com/ (currently under construction)

Packages

Packages required to run each notebook.

Scraping data from Twitter: (Twint, 2.1.21), (pandas, 1.2.4). In order to be able to scrape data for more than one day using twint, one would have to uncomment line 92 from the url.py file of the downloaded package. This can be done by typing "pip3 show twint" in the terminal, changing the directory to that path, and editing the url.py file using nano, vim or any suitable text editor.
BERT : (pandas, 1.3.5), (numpy, 1.21.5), (keras, 2.8.0), (tensorflow, 2.8.0), (nltk, 3.2.5), (sklearn, 1.0.2)
Topic Model: (gensim, 4.1.2), (sklearn, 0.23.2), (numpy, 1.19.2), (pandas, 1.1.3), (nltk, 3.5), (wordcloud, 1.8.1), (pyLDAvis, 3.3.1), (statsmodels, 0.13.2), (nltk, 3.6.7),
Metdata cleaning, Word Cloud and Linear Regression: (pandas, 1.4.1), (seaborn, 0.11.2), (matplotlib, 3.5.1), (gensim, 4.1.2), sklearn(1.0.2), (numpy, 1.21.5)
Data Visualization : (pandas, 1.4.1), (numpy, 1.20.3), (dash, 2.2.0), (base64, 1.0.0), (plotly, 5.6.0), (dash-bootstrap-components, 1.0.3), (dash-core- components, 2.0.0), (dash-html-components, 2.0.0),(dash-table, 5.0.0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Women in Polictics and Misogynistic Twitter Mentions

Objective

Methodology

Data

Code

Visualization Link

Packages

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Data		Data
Scraping tweets- data+code		Scraping tweets- data+code
assets		assets
env		env
ldamodels		ldamodels
BERT_Classifier_Final.ipynb		BERT_Classifier_Final.ipynb
Procfile		Procfile
README.md		README.md
Topic_Model_Final.ipynb		Topic_Model_Final.ipynb
Visualization.ipynb		Visualization.ipynb
Wordcloud_Regression.ipynb		Wordcloud_Regression.ipynb
app.py		app.py
requirements.txt		requirements.txt
runtime.txt		runtime.txt
util.py		util.py

atowey-uchi/misogynistic_tweets

Folders and files

Latest commit

History

Repository files navigation

Women in Polictics and Misogynistic Twitter Mentions

Objective

Methodology

Data

Code

Visualization Link

Packages

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages