This is a repository created by Norman Lee for the Fall 2024 semester of Wildlife 553, taught by Prof. Simona Picardi.
Date created: 2024-09-03
The main goals of this repository are:
- connect several datasets related to e-resource usage:
- e-resource usage
- enrollment by college
- research outputs produced by college
- grant expenditures by college
- e-resource college relevance
- make those datasets ready for future analysis
- connect them together via a combination of two keys: (college + fiscal year AND college relevance)
- clean the data:
- convert so that all data is at college level and fiscal year
- generate primary id's
- generate/simulate data as needed
- record a reproducible workflow for generating a clean datasets
- begin some forms of data exploration, a priori power analysis