- Table of contents
- Programming with Data
- Topics covered
- Assessment
- Module specification
- Syllabus
- Resources
This module will show you how to work with data: getting data from a variety of sources, visualising data in compelling, informative ways, processing data to make it useful and shareable, and reasoning with data to test hypotheses and make parameterised predictions. The module will also introduce you to a new language and programming environment that is well-adapted to languages for these applications.
- Setting up the programming environment
- Control structures, functions and comprehensions
- Data-driven programming
- Visualising data
- Descriptive statistics
- Getting data
- Processing data: cleaning, normalizing, and scaling
- Classification with K-nearest neighbours
- Bayes' theorem and naïve Bayes classification
- Clustering
One two hour unseen written examination and coursework (Type I)
- Foundations of Data Science: K-Means Clustering in Python - Coursera, by Dr Matthew Yee-King +3 more instructors.
- Learn Python, Data Viz, Pandas & More on Kaggle