A few puzzles for unsupervised learning / structure discovery

This is a warm-up assignment. It has two purposes: it gives you an idea of how it is to work with unlabeled data, and it gives me an idea on what I can (realistically) expected from the participants.

For this assignment, you will use four artificially generated data sets, each stored as a CSV file.

Each row in the data files corresponds to a data point, and each column corresponds to a feature.

Your task is to discover the structure in these data sets. Each data set contains 2000 data points with an underlying structure. Particularly, the data in all data sets come from multiple multi-variate random variables. In other words, the underlying structure suggests that each data point belongs to a group which is not indicated in the data. You are free to use any method, including plotting and eyeballing the data. If you do not use an automated (machine learning) method for your solution, however, describe which method(s) would be useful, for example, to assign each data point to a sensible group or cluster.

Provide your answer by editing this file, and briefly explaining your solution for each data set. Figures and/or other visual material are welcome. You can provide short code segments inline below, or check in the code as separate files in your repository.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
data-set1.csv		data-set1.csv
data-set2.csv		data-set2.csv
data-set3.csv		data-set3.csv
data-set4.csv		data-set4.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A few puzzles for unsupervised learning / structure discovery

Your solutions

Data set 1

Data set 2

Data set 3

Data set 4

About

Releases

Packages

License

SfS-unsupervisedCL/warmup-puzzle

Folders and files

Latest commit

History

Repository files navigation

A few puzzles for unsupervised learning / structure discovery

Your solutions

Data set 1

Data set 2

Data set 3

Data set 4

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages