HKUST COMP4641 Course Project
Contributors: Haoran Liang Yu Man Poon Chi Ho Wong
Script use to fix the bad lines in the tweet csv.
Script use to fix the bad lines in the retweet csv data.
txt file use to store keywords. Feel free to play with it by adding new keywords.
Two versions of script to filter out English data in tweet csv.
Script use to filter out keywords from the English tweet csv. Don't use KeyDetect.py, some bugs are unresolved.
Some statistics with the dataset and some bugs. Don't use that.
Fixed bugs from previous retweet network ipynb. Add weights to edges.
Generate co-retweeted network by number of common hashtags.
Generate co-mentioned network by number of common mentions.