Skip to content

bbuck1/clio2-eda

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Exploratory data analysis assignment

For this assignment you should follow these steps.

  1. Fork this repository on GitHub, then clone it to your own computer.

  2. Find a suitable quantitative dataset to analyze. This should be a primarily quantitative dataset. It would be best if you used a dataset that you cared about, but some suggested datasets include these:

  1. Add your data to this repository as a CSV file. If it is more than one file, it should probably go in a data/ directory.

  2. Create an RMarkdown file in this directory in which you will do your analysis. Name it something sensible like analysis.Rmd. This file should read in your data using the read_csv() function in the readr package.

  3. Using both prose and code, create an exploratory data analysis of your dataset. Use the techniques in Grolemund and Wickham as well as in Peng to figure out what the dataset is, what could be learned from it, and what potential pitfalls there are in the data.

  4. Edit your document to get rid of visualizations and prose that proved not to be useful.

  5. Knit the document to HTML.

  6. Submit a URL to your HTML document online as well as the URL to your GitHub repository.

About

Exploratory data analysis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 100.0%