Skip to content

Sanikommus/Iris_flower_Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Iris_flower_Dataset

Given with Iris flower dataset file. The file Iris.csv consists of 150, 4-dimensional data which includes 50 samples from each of the three species of Iris (Iris setose, Iris virginica and Iris versicolor). Column 1 to 4 of the given file are the four features (attributes) that were measured from each sample: the length and the width of the sepals and petals (in centimetres) respectively. Column 5 is the class label (species name) associated with each of the samples of Iris flower.

We had to reduce the data into 2-dimensional data using PCA and then partition (cluster) the reduced dimensional data using different clustering techniques. While performing the PCA, ignore the target column.

Target Attribute was to be used to calculate the Purity Score.

The code:

  • Loads the dataset into the Spyder Enviornment.
  • Imports the Unsupervised Learning Models like k-means, GMM and DBSCAN.
  • We form Clusters and then we label them for each of the model seperately and check the purity score for each model.

Input Dataset

https://www.kaggle.com/datasets/uciml/iris

image

Output

PCA on the dataset :

image

image

k-Means with number of clusters(K) = 3 :

image

k-Means for Different values of K :

image

image

GMM with number of clusters(K) = 3 :

image

GMM with different values of K :

image

image

DBSCAN :

image

image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages