Skip to content

Latest commit

 

History

History
22 lines (12 loc) · 1.75 KB

README.md

File metadata and controls

22 lines (12 loc) · 1.75 KB

AWS Data Engineering

Introduction

This repository is an educational example of how to use AWS S3 for staging/storage and creatinga Redshift cluster on AWS. From here, a STAR schema is designed by using an ETL process from S3 to mimick a real data pipleine.

All examples below use the Pagilla Schema to model the STAR schema using an ETL process.

Star Schema example

Designing OLAP Cubes

Column vs Row Storage

Infastructure as Code on AWS - Create a Redshift Cluster

Parallel ETL - Load Data to Cluster

Table Design and Optimization

  • All projects were sourced and completed under Udacity Dats Engineering Nanodegree