Skip to content

Latest commit

 

History

History
34 lines (26 loc) · 1.33 KB

README.md

File metadata and controls

34 lines (26 loc) · 1.33 KB

CITS3401 Project · Python 3.7.2 MSSQL

Authors

Overview

This project is written for CITS3401 Data Warehousing, Semester 1 2019. It is a two part project, Data Warehousing (40 marks) and Data Mining (20 marks), with 2 bonus marks in Data Mining.

Kaggle's PetFinder Adoption Prediction data set is used for the project. ETL is performed using Python, and concept heirarchies are generated using draw.io.

Execute ETL and Association

$ cd submission/etl
$ python etl.py
$ cd submission/associations
$ python build_associations.py

Dependencies

  • Microsoft SQL Server
  • Microsoft Visual Studio SSDT
  • Microsoft SQL Server Management Studio
  • Python 3
  • Weka
  • PowerBI
  • draw.io

Discretion

These files should be used as a guideline to understanding the project. Copying the code in any way risks Academic Misconduct at the University of Western Australia. Discretion is advised.