Using Spark Streaming and Spark MLlib (via PySpark) to perform Sentiment Analysis on a dataset of Tweets. Done as part of the Big Data (UE19CS322) course at PES University.
- Logistic Regression
- Naive Bayes
- Linear SVM
- K-Means Clustering
The above 4 models have also been implemented with incremental learning via scikit-learn.