AWS Data Engineering

Introduction

This repository is an educational example of how to use AWS S3 for staging/storage and creatinga Redshift cluster on AWS. From here, a STAR schema is designed by using an ETL process from S3 to mimick a real data pipleine.

All examples below use the Pagilla Schema to model the STAR schema using an ETL process.

Star Schema example

Designing OLAP Cubes

Column vs Row Storage

Infastructure as Code on AWS - Create a Redshift Cluster

Parallel ETL - Load Data to Cluster

Table Design and Optimization

All projects were sourced and completed under Udacity Dats Engineering Nanodegree

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

AWS Data Engineering

Introduction

Files

README.md

Latest commit

History

README.md

File metadata and controls

AWS Data Engineering

Introduction