Processde dataset can be found at Big Five Personality Test. Please add dataset into .gitignore if you put dataset under work folder. Or find csv file on DBFS at "/FileStore/tables/data_final-1.csv"
LR&RF.ipynb: Basic data exploration. Main tasks are logistic regression and Random forest
LR_KMcluster.ipynb: K-means
preprocess_data.ipynb: processing .csv original data