This is a template for Python Projects in the Goldenberg Lab. There are three folders:
- data All of the data stored in this repo should be located in this folder. There are subfolders for raw data and processed data.
- processing All of the code designed for processing should be saved in this folder.
- analysis should be used for analysis of processed data.
Your workflow should be to:
- Place raw data in
data/raw
. Use.gitignore
to exclude all data, raw and processed, from Github! You should never upload any remotely sensitive data on Github. - Write and run code in
processing
that takes in this raw data, does all necessary cleaning, combining, and processing, and writes processed data file(s) todata/processed
. Again, this file should be ignored by Git. - Conduct your analysis, ideally in a Jupyter notebook, in
analysis
, reading in the processed data file. Visualizations should go inanalysis/img
, if you'd like to store them in the Github repo. (You may not need to include the images themselves, but code to produce any visualizations should be included in your analysis.)
This is a template for Python Projects in Goldenberg Lab. To use this template, please:
- Install Python and a version of conda (Anaconda or Miniconda). We recommend starting with the latest versions of each.
- You should run all of your code in a dedicated conda environment -- see this guide on using conda! When you're done, include the
environment.yml
in the root of this repo for others to use.
Use this template for the first time (if you are not replicating/ adding on to an existing analysis)
- Choose the repository's username/organization name.
- Please set the owner of your analysis template to our lab (
GoldenbergLab
). - All analysis repositories should start as public, unless indicated otherwise.
- Please set the owner of your analysis template to our lab (
- Name the repository following the lab naming convention. Full guide on repository naming conventions can be found here. In short, github repositories are following this naming convention:
project-name-analysis
. So if your project is about counting kittens, your repository name iscounting-kittens-analysis
. - Add a description of your project. Please include:
- Project Name
- Date of repository creation
- Your name, and the names of other who worked on it
- The purpose of the project and the main question you asked
- The source of the data for the analysis (Prolific, MTURK, Qualtrics, etc.)