This project was undertaken as part of the course DDA4260: Networked Life during Term 2, 2022/2023 at The Chinese University of Hong Kong, Shenzhen (CUHK-SZ).
Student ID | Student Name | |
---|---|---|
120040022 | Darren Boesono | [email protected] |
120040002 | Joseph Ariel Christopher Teja | [email protected] |
├── LICENSE
├── Makefile <- Makefile with commands like `make data` or `make train`
├── README.md <- The top-level README for developers using this project.
├── data
│ ├── external <- Data from third party sources.
│ ├── interim <- Intermediate data that has been transformed.
│ ├── processed <- The final, canonical data sets for modeling.
│ └── raw <- The original, immutable data dump.
│
├── docs <- A default Sphinx project; see sphinx-doc.org for details
│
├── models <- Trained and serialized models, model predictions, or model summaries
│
├── notebooks <- Jupyter notebooks. Naming convention is a number (for ordering),
│ the creator's initials, and a short `-` delimited description, e.g.
│ `1.0-jqp-initial-data-exploration`.
│
├── references <- Data dictionaries, manuals, and all other explanatory materials.
│
├── reports <- Generated analysis as HTML, PDF, LaTeX, etc.
│ └── figures <- Generated graphics and figures to be used in reporting
│
├── requirements.txt <- The requirements file for reproducing the analysis environment, e.g.
│ generated with `pip freeze > requirements.txt`
│
├── setup.py <- makes project pip installable (pip install -e .) so src can be imported
├── src <- Source code for use in this project.
│ ├── __init__.py <- Makes src a Python module
│ │
│ ├── data <- Scripts to download or generate data
│ │ └── make_dataset.py
│ │
│ ├── features <- Scripts to turn raw data into features for modeling
│ │ └── build_features.py
│ │
│ ├── models <- Scripts to train models and then use trained models to make
│ │ │ predictions
│ │ ├── predict_model.py
│ │ └── train_model.py
│ │
│ └── visualization <- Scripts to create exploratory and results oriented visualizations
│ └── visualize.py
│
└── tox.ini <- tox file with settings for running tox; see tox.readthedocs.io
Movie recommendation systems are a crucial part of modern streaming platforms like Netflix, Hulu, and Amazon Prime. They help in enhancing the user experience by suggesting movies based on their preferences. This project aims to build a robust recommendation system using two different models:
- Restricted Boltzmann Machine (RBM): A type of artificial neural network used for unsupervised learning. It learns the probability distribution over its set of inputs. The RBM model in this project is visualized below.
-
Baseline Predictor Model: This model predicts a movie's rating based on the global average rating, the movie's average rating, and the user's average rating. The mathematical formula used for prediction is:
$$\hat{r}_{ui} = \mu + b_u + b_i$$ Where:
$\hat{r}_{ui}$ is the the predicted rating for user$\mu$ and movie$i$ .$\mu$ is the global average rating.$b_u$ is the bias associated with user$\mu$ .$b_i$ is the bias associated with movie$i$ .
Ensure you have Python 3 installed on your machine.
-
Clone the repository:
git clone https://github.com/dboesono/Recommendation-Movie.git
-
Navigate to the project directory:
cd Recommendation-Movie
-
Create a virtual environment using the command:
python -m venv env
-
Activate the virtual environment according to your operating system:
-
On Windows:
.\env\Scripts\activate
-
On Unix or MacOS:
source env/bin/activate
-
-
Install the required packages:
pip install -r requirements.txt
After installing the necessary packages, you can run the app using:
streamlit run app.py
This will launch the web app in your default browser. Rate some movies and get personalized recommendations!
Project based on the cookiecutter data science project template. #cookiecutterdatascience