Skip to content

loftiskg/distributed_spectral_clustering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Distributed Spectral Cluster with PySpark

Run python main.py at the terminal to apply clustering to both testing datasets int ./data. This will output a scatter plot into ./figures/spectral_cluster_test.png which will show a scatter plot of the test datasets after clustering.

To regenerate the test dataset run python generate_test_data.py from the commandline.

The code for the clustering algorithm lives in spectral.py.

About

A distributed implementation of Spectral clustering in PySpark

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages