Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Content Request Data Manipulation Section #1169

Open
MatthijstenTije opened this issue Apr 25, 2024 · 2 comments
Open

Content Request Data Manipulation Section #1169

MatthijstenTije opened this issue Apr 25, 2024 · 2 comments
Assignees

Comments

@MatthijstenTije
Copy link
Contributor

Content Request

I want to start working on the data manipulation section of the website. Lachlan has sent me a cookbook for working with data in R, and I came across the following subjects I want to write an article about, as I think they would be useful for students. If there are other subjects that wants to be worked on, feel free to add.

Data Wrangling

Working with Factors in R

I want to start by explaining the factor() function and discuss an example, its structure, ordering levels, renaming of levels, and how to order categorical values. As a more advanced case, I want to highlight the forecasts package and show how to reorder factor levels by sorting along another variable within ggplot.

Data Transformations with dplyr

This is very common stuff, and I think every course teaches the dplyr package. However, I think a more in-depth look at the dplyr packages combined with R helper functions would add to most students' toolkit. I plan to work with several sections:

  • Subsetting, where filter and select are discussed along with their helper functions.
  • Aggregating, where summarize and group_by are discussed along with the helper functions.

Working With Character Data inside R

I want to discuss several functions that are useful when variables in a dataset are character-based and need to be transformed in some way. I want to focus on the following functions: paste(), paste0(), substr(), substring(), Regular Expressions, and lastly substitution using sub(), gsub(), grep(), and grepl().I

Sources

@lachlandeer
Copy link
Member

This looks like a good starting point!

Looking forward to see the new content. Do you have a rough timeline for completion?

@MatthijstenTije
Copy link
Contributor Author

Great! I waited for approval, so I am yet to start on it! somewhere in the summer (mid-July/begin august)? I have some vacations planned and work for Tilburg.ai.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants