Skip to content

Latest commit

 

History

History
23 lines (19 loc) · 1.87 KB

README.md

File metadata and controls

23 lines (19 loc) · 1.87 KB

Data Engineer Ass1_Postgres

The Assignment requires a postgres database sparkifydb for a music app, Sparkify. The database is a start schame that has 1 fact table and 4 dimension tables

Fact Table

songplays - records in log data associated with song plays i.e. records with page NextSong
songplay_id, start_time, user_id, level, song_id, artist_id, session_id, location, user_agent

Dimension Tables

users - users in the app
user_id, first_name, last_name, gender, level
songs - songs in music database
song_id, title, artist_id, year, duration
artists - artists in music database
artist_id, name, location, latitude, longitude
time - timestamps of records in songplays broken down into specific units
start_time, hour, day, week, month, year, weekday

Run Python Scripts

  • Create tables python create_table.py

  • Execute ETL python etl.py