Skip to content

fcorowe/sl

Repository files navigation

Introduction to Statistical Learning

DOI

Francisco Rowe [@fcorowe]1*

1 Geographic Data Science Lab, University of Liverpool, Liverpool, United Kingdom

* Correspondence: [email protected]

This site introduces the course Introduction to Statistical Learning in R. The course provides an introduction to statistics and probability covering essential topics in descriptive and inferential statistics and supervised machine learning. It adopts a problem-to-solution teaching approach, defining a practical problem and illustrating how statistics can enable understanding to make critically informed decisions about a population by examining a random sample. It uses a learning-by-doing approach based on real-world examples in various contexts. This also teaches how to conduct statistical data analysis in R. The course is organised around 6 sessions. Each session is designed to provide a combination of key statistical concepts and practical application through the use of R.

The course comprises three main components. The first component focuses on descriptive statistics, including descriptive statistics of different data types, common probability distributions and measures of centrality and dispersion. The second component involves inferential statistics covering hypothesis testing, confidence intervals, correlation, regression analysis, supervised machine learning approaches and cross-validation.

Learning outcomes

Having successfully completed this course, you will be able to:

  1. Conduct exploratory statistical data analysis.
  2. Have an understanding of elementary probability distributions and data types.
  3. Perform correlation and regression data analysis using real-world data.
  4. Assess the statistical significance between different data types.
  5. Carry out statistical data analysis in R.
  6. Have a basic understanding of supervised machine learning and cross-validation.

Schedule

The notes for each session are:

Citation

If you use the material, code or processed data, you can give appropriate attribution by using the following citation:

@article{rowe_slr20,
  author = {Francisco Rowe},
  title = {Introduction to Statistical Learning in R},
  year = 2020,
  url = {\url{https://fcorowe.github.io/sl/}},
  doi = {10.5281/zenodo.4007043},
}