Lab | Unsupervised Learning

Introduction

Some people think unsupervised learning is boring because there isn't any specific output to predict or evaluate. Other people, especially the machine learning experts, consider unsupervised learning the future of data science because it resembles how human beings learn. Think how a kid learns what a dog is. Dad and mom don't show 100K animals and tell her which ones are dogs. Rather, the kid will keep encountering dogs in her daily experience and after a number of encounterings she will extract the common features of dogs and recognize new ones.

In unsupervised learning, the classic task is cluster analysis in which you find hidden patterns or groups in data. At most times unsupervised learning tasks are open-ended and you will need to make sense of the data without any clear-defined pathways. But if you keep training yourself and eventually become good at finding pathways out of nowhere, you'll be an established data scientist. This is why you should take unsupervised learning serious.

In this lab, we will present you an unsupervised learning problem without clearly defined goals. Your general objective is to cluster the data and see if you can extract useful insights. But of course we will provide the necessary instructions to help you get started.

Getting Started

Open the main.ipynb file in the your-code directory. Follow the instructions and add your code and explanations as necessary. By the end of this lab, you will have learned how to prepare a dataset for most scikit-learn algorithms.

Deliverables

main.ipynb with your responses.

Submission

Upon completion, add your deliverables to git. Then commit git and push your branch to the remote.

Resources

DBSCAN

The DBSCAN Paper

sklearn.datasets.make_circles

sklearn.datasets.make_moons

Wholesale Customers

The Pareto Principle

Scikit-Learn Standard Scaling

Cluster Analysis External Evaluation

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
your-code		your-code
.DS_Store		.DS_Store
README.md		README.md
Wholesale customers data.csv		Wholesale customers data.csv
acs2015_county_data.csv		acs2015_county_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lab | Unsupervised Learning

Introduction

Getting Started

Deliverables

Submission

Resources

About

Releases

Packages

Languages

Ironhack-Data-Madrid-Abril-2021/lab_unsupervised_learning

Folders and files

Latest commit

History

Repository files navigation

Lab | Unsupervised Learning

Introduction

Getting Started

Deliverables

Submission

Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages