Dieses Projekt wird im Rahmen der Studiengangs B.Sc. Data Science der Fachhochschule Nordwestschweiz bearbeitet.
Dieses Projekt wird von folgenden Studenten bearbeitet:
Name | GitHub Handle |
---|---|
Patrick Schürmann | @patschue |
Gabriel Torres Gamez | @gabrieltorresgamez |
Hier geht es zum Projekt Taskboard mit allen Tasks und Issues:
Hier geht es zu unserer Zotero Gruppe:
Hier geht es zum 15 Minuten Dataset: Link zum Dataset
Notiz: Der Datensatz wird bei einigen Notebooks automatisch heruntergeladen.
Hier geht es zu unserer DagsHub Organisation:
- Clone the repo.
- Run
make reqs
to install required python packages.
- Setup the DVC credentials using DagsHub.
- Run
make pull
to pull the data from DagsHub. - You're ready to start developing!
The data in this repo is managed via DVC. Here are some useful commands:
make pull
- Pulls the data from DagsHub.make relink
- After changes in the DVC Folders (data) this command relinks the files in the repo.make push
- Pushes the data to DagsHub.
Before committing changes to the data folder, make sure to run make relink
to update the links to the data.
Afterwards, add, commit and push the changes to the repo (using git).
Finally, run make push
to push the data to DagsHub.
├── .dvc <- DVC Settings, don't touch.
├── data
│ ├── processed <- The final, canonical data sets for modeling.
│ └── raw <- The original, immutable data dump.
├── notebooks <- Jupyter notebooks. Naming convention is a number (for ordering)
│ and a short `-` delimited description, e.g.
│ `01-initial-data-exploration`.
├── references <- Data dictionaries, manuals, and all other explanatory materials.
├── slurm <- Slurm scripts for running the code on the i4Ds cluster.
├── src <- Source code for use in this project.
├── .dvcignore <- Files and directories to ignore by DVC.
├── .gitignore <- Files and directories to ignore by Git.
├── data.dvc <- DVC data/ folder registry.
├── LICENSE <- GNU General Public License v3.0.
├── Makefile <- Makefile with commands.
├── params.yml <- The parameters for the data pipeline.
├── README.md <- The top-level README for developers using this project.
└── requirements.txt <- The requirements file for reproducing the analysis environment.