Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add few more parts to correlation lesson #3

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

souravsingh
Copy link
Contributor

Add a few more questions into Correlation lesson plan. Need feedback.


- Class: cmd_question
Output: "Now lets create a scatter plot to find the relation between RNASeq Gene Expression vs Microarray Gene Expression"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

expand on the purpose of doing this and what the student should look for in the ggplot, what kind of correlations will they see? maybe explain basics around scatterplots and what they are best used for.

Output: For calculating correlations between two variables, we have the cor() function which takes as parameters the variables between which the correlations need to be calculated and the method of calculation for correlation.

- Class: cmd_question
Output: "Let us first calculate the correlation between RNAseq and microarray using pearson method."
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Explain Pearson vs Spearman correlation in more detail, what are advantages to each method? Have the student compare the outputs to both and how it may affect their interpretation of the correlation.

Output: Now we compare ACTB gene expression between RNA-seq and microarray data. For this we first need to create a dataframe, create a scatterplot between RNASeq Gene Expression vs microarray gene expression.

- Class: cmd_question
Output: "Lets start by creating a data frame for Tumor vs ACTB gene expression. For this we need to create three columns- tumour, rnaseq and array and obtain data for these columns from the rnaseq_wide and array_wide data for ACTB"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Put an overview of our datasets here and review what we've done to prepare each of them, and how the mapping of values is going to occur for when we plot it.

Copy link
Contributor

@lisancao lisancao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@souravsingh Great work! This is an awesome basis for this course. Because I anticipate this course to be a little more difficult material wise, I've asked you to flesh out some of the material in some places. <

Please also add hints for every question.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants