parkingSegmentation

parking users segmentation from the parking system operational log

Preprocess.py Adds new variables to original log dataset Input file: history_origin.csv Output file: history_2022_02_preprocessed_.csv
Exploratory_history_22_02.py Cleans the data and creates Users dataset Input file: history_2022_02_preprocessed_.csv Output files: users_2022_02.csv + history_2022_02_to_powerbi.csv
Exploratory_users_22_02.py Cleans the data and does some exploratory analysis Overwrites the file users_2022_02.csv Input file: users_2022_02.csv
scale_and_pca.py Scales data and applies Principal Component Analysis Input file: users_2022_02.csv Output file: explained_var.csv
kmeans.py Clustering with KMeans from sklearn.cluster package Input file: users_feb_pca.csv Output files: users_2022_2_labeled.csv + centers_kmeans6.csv
hierarchical.py Clustering with AgglomerativeClustering from sklearn.cluster package Input file: users_2022_2_labeled.csv Output file: users_2022_2_labeled.csv
som.py Clustering with minisom from MiniSom package Input file: users_2022_2_labeled.csv Output files: users_2022_2_labeled.csv + centers_som7.csv
fraud.py Anomaly detection using SOM Maps Input file: users_2022_2_labeled.csv Output file: users_fraud_20.csv

Also uploaded DBSCAN and Gaussian Mixture models code, although they don't fit the data

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
DBScan.py		DBScan.py
LICENSE		LICENSE
README.md		README.md
exploratory_history_22_02.py		exploratory_history_22_02.py
exploratory_users_22_02.py		exploratory_users_22_02.py
fraud.py		fraud.py
gaussian mixture.py		gaussian mixture.py
hierarchical.py		hierarchical.py
kmeans.py		kmeans.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
scale and pca.py		scale and pca.py
som.py		som.py

Provide feedback