ETL-Project - Monash Data Analytics Bootcamp

Background

We are interested in looking at the specific locations of fast food chains in the US and various socio-economic measures like median level of income, unemployment rate etc. in those very locations. By transforming two different datasets that have a common point: Zip Code Tabulation Area (ZCTA), we hope that our database will allow analysts to draw insights on the potential link between low-socio economic status communities, the location of fast food chains as well as obesity levels across the US.

Project Report

For further details in any of the following steps and our project potentials and limitations, please refer to our report and the notebooks for each part.

Extract

1/ US Census Bureau Demographic Data

Use census API wrapper to retrieve data from the American Community Survey 5-Year Data (2009-2018) based on zip code tabulation area (zcta). Please refer to our notebook.

2/ Fast Food Restaurants Across America

This dataset was extracted from Kaggle and it came in the form of a downloadable CSV. Please refer to our notebook.

3/ Zip Code to ZCTA Cross Walk

This dataset was extracted from UDS Mapper and it came in the form of a downloadable CSV. Please refer to our notebook.

All of the input csv files can be found here.

Transform

Please refer to the following notebooks:

After our data analysis and transformation, we come up with this ERD and schema before loading data to the PostgreSQL database.

Load

Please refer to our notebook.

We make no claims as the ownership of the data. Hence, please do what you'd love with the data but credit the appropriate people.

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
00_input		00_input
01_extract_census		01_extract_census
01_extract_restaurant		01_extract_restaurant
01_extract_zip_zcta		01_extract_zip_zcta
02_transform_census		02_transform_census
02_transform_restaurant		02_transform_restaurant
02_transform_zip_zcta		02_transform_zip_zcta
03_load		03_load
report		report
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ETL-Project - Monash Data Analytics Bootcamp

Background

Project Report

Extract

Transform

Load

About

Releases

Packages

Contributors 4

Languages

poojaisabelle/ETL-Project

Folders and files

Latest commit

History

Repository files navigation

ETL-Project - Monash Data Analytics Bootcamp

Background

Project Report

Extract

Transform

Load

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages