This project is a collaborative effort by three students from FEUP for the curricular unit CAC (Machine Learning Complements). The project focuses on performing natural language processing (NLP) and time series analysis to forecast trends in tweets related to COVID-19.
The goal of this project is to analyze and forecast trends in tweets concerning COVID-19 using NLP techniques and time series analysis. The project will involve extracting and preprocessing tweet data, performing sentiment analysis, and applying time series forecasting models to predict future trends.
- Collection and preprocessing of tweet data
- Sentiment analysis on tweets
- Time series analysis of tweet trends
- Forecasting future tweet trends related to COVID-19
- Visualization of analysis results
- Python
- Pandas
- NumPy
- Scikit-learn
- NLTK (Natural Language Toolkit)
- TensorFlow/Keras
- Matplotlib/Seaborn
- Statsmodels
- Clone the repository:
git clone https://github.com/jogp10/nlp-time-series-covid19.git
- Install the required dependencies:
pip install -r requirements.txt
This project is licensed under the MIT License.