Skip to content

Data Science 101 Prep

Sam Hopkins edited this page Oct 4, 2019 · 5 revisions

LDSSA Data Science Prep Course

The Prep course aims to equip you with the required knowledge for the Data Science 101 course. It is not required so if you are already comfortable with python and git, then don't worry about completing this!

What: A self-directed course covering useful materials for the Course (e.g. Python, numpy, Git, etc.). Everything is 100% online and remote.

Scope: Python Programming, Python Engineering, numpy, the *nix command line, Git Basics, Jupyter

Who: Anyone who wants to start learning Data Science but has no prior knowledge or experience in Python

When: Totally on your own time

How:

  • Learning materials: We will be using study materials provided by Dataquest, which teaches new concepts using interactive coding challenges

  • Slack Q&A: Ask questions on Slack! Although this prep course is not officially included in the course itself, we will do our best to support you in case you've got questions.

Investment: $29 (Dataquest's Basic 1-month plan, which is paid directly to Dataquest)

How will the course work

Our role

We, DareData, are playing the role of facilitators much more than teachers for this course. The contents and materials that need to be learned here are very commonly taught concepts and can be learned on your own through many of the excellent online-only courses.

You and DataQuest

DataQuest is where the vast majority of your learning should take place while we provide some deadlines. There are 11 of you going through this together, help each other out!

Deadlines and time management

DataQuest is organized into courses. Each of these courses is a self-contained unit that takes about 1.5 hours to complete. We have pre-selected 19 of them and for each of them assigned a week in which they should be done. Since the schedule is tight, you should do everything that you can in order to meet these deadlines but if you cannot, that is okay! These deadlines are suggestions to help you keep on track. If you fall a bit behind for whatever reason, do everything you can to get back on track!

Requirements:

  • A laptop, internet connection, and a web browser
  • The ability to pay $29 to Dataquest

Signup and billing

Since we did not build the material ourselves and rely on Dataquest's courses instead, you will have to sign up with and pay Dataquest all on your own. For your subscription plan and the course dates to correspond with each other, you need to follow a few key steps.

Step 1: Subscribe to Dataquest's Basic Plan on or after October 1st 2019. We know that the payment date is very exact, this is only for your own goods as we are planning it to keep your investment as low as possible.

Step 2: Take the most advantage of your subscription plan! We pre-selected 19 courses that are crucial for you to pass the entrance exam and to succeed in the academy. If after having completed those and still having time, you are very much encouraged to go through the rest of the courses that come along with Dataquest's Basic plan!

Step 3: Cancel your subscription before October 31st 2019. All Dataquest's subscriptions renew automatically, so if you forget to do this, you will be BILLED AGAIN. Yes you read it correctly. BILLED. AGAIN. So you'd better mark it to your calendar!

Schedule

As mentioned previously, this is our suggested schedule to help you keep track with your learning. That being said, feel free to skip certain topics you already knew. The other way around, take the time to study the materials that you are not comfortable with.

Below we listed the 19 learning units that you need to complete. We recommend you to give the highest priority to those first, and then if you still have time, feel free to go through the rest of the learning units.

Week 1: Python for Data Science: Fundamentals (Part I)

  1. Programming in Python
  2. Variables and Data Types
  3. Lists and For Loops
  4. Conditional Statements
  5. Dictionaries and Frequency Tables

Week 2: Python for Data Science: Fundamentals (Part II)

  1. Functions: Fundamentals
  2. Functions: Intermediate
  3. Project: Learn and Install Jupyter Notebook
  4. Guided Project: Profitable App Profiles for the App Store and Google Play Markets
  5. Exception raising: Materials to be provided

Week 3: Python for Data Science: Intermediate & Pandas and NumPy Fundamentals

  1. Cleaning and Preparing Data in Python
  2. Object-Oriented Python
  1. Introduction to NumPy
  2. Boolean Indexing with NumPy

Week 4: Command Line: Beginner & Git and Version Control

  1. Command line basics
  2. Challenge: Navigating the File System
  1. Introduction to Git
  2. Git Remotes
  3. Project: Git Installation and GitHub Integration

Slack Usage

The golden rule

There is one slack rule to rule them all: DON'T BE SHY! ASK QUESTIONS PUBLICLY

We know and totally understand that it can be intimidating to ask a question in front of a bunch of other people that you don't even know. We've all been there and can empathize. However, the rewards of overcoming this fear and asking a question in a slack channel that everyone can see are too great to ignore. The benefits, among others are:

  1. You will get an answer faster. If 15 people see your question, you will get an answer faster than if 1 person does.
  2. You will get a better answer. If 15 people see your question, you're likely get a better than if 1 person does.
  3. You may make a friend. It increases the chances of interacting with an interesting person that you may not have met otherwise. These types of peers can increase your chances of success immensely.

Practicalities

Here we will talk about each of the channels available in our slack channel and how they should be used. This is super important because it dictates how important information will be organized. While it may seem annoying at the moment to have to remember where to post a question, the future you will GREATLY appreciate it when you are looking for where that one clever answer to that one question is. Furthermore, you'll be very grateful not to have a question about a completely unrelated subject right next to your brilliant one. Trust us, the extra effort is worth it!

#dataquest-missions

If you have general questions about how dataquest works this is where you would ask them. For example: how do I track my progress in DataQuest? A question that you would not ask here is something very specific about one of the exercises.

#python-fundamentals

This is where you ask any questions that are specific to the Python for Data Science: Fundamentals course

#python-intermediate

This is where you ask any questions that are specific to the Python for Data Science: Intermediate course

#pandas-numpy-fundies

This is where you ask any questions that are specific to the Pandas and Numpy Fundamentals Course

#command-line-beginner

This is where you ask any questions that are specific to the Command Line: Beginner Course

#command-line-intermed

This is where you ask any questions that are specific to the Command Line: Intermediate Course

#git-and-vc

This is where you ask any questions that are specific to the Git and Version Control Course

How to ask for help

Again, always ask in a slack channel and DO NOT DM an instructor! For your questions to be answered in the shortest possible time, provide us with enough context to be able to answer it. Among others, you can attach a screenshot or copy-paste formatted code together with your questions. If you, as a student, see a question that you know the answer to, do it! Answer it!

Frequently Asked Questions

I just want to check that the whole course is 100% remote right?

Everything is 100% remote and online!

Can you please also confirm if the prep course classes will take place after work hours (18h)?

The Prep course does not occur at any specific time, except for the live sessions. We only care about you completing all the materials scheduled for that week. Apart from that, you can learn at anywhere and anytime at your own pace.