The project has been conducted in two steps:
- Performed data mining, data analysis and feature engineering to design a strategy for dealing with missing values and outliers
- Used Spark ML with decision trees model to perform predictions on the rent prices of the houses
Data: https://www.kaggle.com/austinreese/usa-housing-listings
Authors: Célina KHALFAT, Inas KACI