SOM Clustering with KNN Data Imputation

Source code for my undergraduate thesis — Region Grouping in East Java based on Person with Social Welfare Problems using Self-Organizing Maps Algorithm and K-Nearest Neighbors Missing-Value Imputation.

Prerequisites

Git
A *.csv file to be clustered
Conda (This project using Conda as an environment)
A cup of coffee ☕

Set-up app into your machine

Clone this repository into your machine

git clone https://github.com/desenfirman/som-clustering-knn-imputation.git
cd som-clustering-knn-imputation

Set up conda environment for this project.
```
conda env create -f environment.yml
```
Wait for download and installation package completed. Drink-your-coffee. . . ☕
After installation completed, run this command to start a Flask webserver.
```
python runwebserver.py
```
Access localhost:8000 to your browser and you're ready to use this app.

How to use

Open localhost:8000 from your browser
Select your *.csv file that you want to be clustered.

Input a algorithm parameter. In this app you need to input following parameter:

K               = Don't use KNN or use KNN with K = 1 till 7 (recommended value)
Alpha           = 0.1 till 1 (recommended value)
Eta             = 0.1 till 1 (recommended value)
Epoch           = minimum 30 is recommended
Neuron Size     = 3x3, 4x4, 5x5 etc

After all parameter input is filled, click 'Mulai Clustering' to start clustering process.

The result?

As you can see, the app show clustering progress and report alongside cluster visualization from epoch through epoch.

When clustering process is complete, you can see overall Silhouette Coefficient alongside with all member Silhouette Coefficient.

Credit(s)

I don't built a webserver, built an array transformation algorithm or any code that doesn't relevant in my undergraduate thesis from scratch. You can check environment.yml to see what packages I used for this project.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
.vscode		.vscode
application		application
dataset_used		dataset_used
main_algorithm		main_algorithm
tests		tests
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
runserver.py		runserver.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SOM Clustering with KNN Data Imputation

Prerequisites

Set-up app into your machine

How to use

The result?

Credit(s)

About

Releases

Packages

Languages

License

desenfirman/som-clustering-knn-imputation

Folders and files

Latest commit

History

Repository files navigation

SOM Clustering with KNN Data Imputation

Prerequisites

Set-up app into your machine

How to use

The result?

Credit(s)

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages