Reordering (everything) and typos

DaeunKiim · Jul 26, 2018 · 7902ed2 · 7902ed2
1 parent 136c8ae
commit 7902ed2
Show file tree

Hide file tree

Showing 20 changed files with 13 additions and 13 deletions.
diff --git a/1_00.Rmd b/1_00.Rmd
@@ -1,2 +1,2 @@
-# (PART) Collecting and using data {-}
+# (PART) Collecting and Using Data {-}
 
diff --git a/3_00.Rmd b/3_00.Rmd
@@ -1,2 +1,2 @@
-# (PART) Simple Parametric Statistics {-} 
+# (PART) Simple Statistics {-} 
 
diff --git a/4_00.Rmd b/4_00.Rmd
@@ -1,3 +1,3 @@
-# (PART) Categorical Data {-} 
+# (PART) Regression and ANOVA {-} 
 
 
diff --git a/5_01_regression_intro.Rmd → 4_01_regression_intro.Rmd b/5_01_regression_intro.Rmd → 4_01_regression_intro.Rmd
diff --git a/5_02_simple_regression.Rmd → 4_02_simple_regression.Rmd b/5_02_simple_regression.Rmd → 4_02_simple_regression.Rmd
diff --git a/6_02_one_way_anova_intro.Rmd → 4_03_one_way_anova_intro.Rmd b/6_02_one_way_anova_intro.Rmd → 4_03_one_way_anova_intro.Rmd
@@ -2,7 +2,7 @@
 
 ## Introduction {#intro}
 
-The two-sample *t*-tests evaluate whether or not the mean of a numeric variable changes among two groups or experimental conditions. At the beginning of the [Relationships and regression] chapter we pointed out that the different groups/conditions can be encoded by a categorical variable. We pointed out that we could conceptualise these *t*-tests as evaluating a relationship between between the numeric and categorical variable. The obvious question is, what happens if we need to evaluate differences among means of more than two groups? The 'obvious' thing to do might seem to be to test each pair of means using a *t*-test. However this procedure is tedious and, most importantly, statistically flawed.
+The two-sample *t*-tests evaluate whether or not the mean of a numeric variable changes among two groups or experimental conditions, which can be encoded by a categorical variable. We pointed out that we could conceptualise these *t*-tests as evaluating a relationship between between the numeric and categorical variable. The obvious question is, what happens if we need to evaluate differences among means of more than two groups? The 'obvious' thing to do might seem to be to test each pair of means using a *t*-test. However this procedure is tedious and, most importantly, statistically flawed.
 
 In this chapter we will introduce an alternative method that allows us to assess the statistical significance of differences among several means at the same time. This method is called **Analysis of Variance** (abbreviated to ANOVA). ANOVA is one of those statistical terms that unfortunately has two slightly different meanings:
 

diff --git a/6_03_one_way_anova_R.Rmd → 4_04_one_way_anova_R.Rmd b/6_03_one_way_anova_R.Rmd → 4_04_one_way_anova_R.Rmd
diff --git a/5_00.Rmd b/5_00.Rmd
@@ -1,3 +1,3 @@
-# (PART) Associations and Relationships {-}
+# (PART) Doing More with Models {-}
 
 
diff --git a/5_03_regression_diagnostics.Rmd → 5_02_regression_diagnostics.Rmd b/5_03_regression_diagnostics.Rmd → 5_02_regression_diagnostics.Rmd
diff --git a/8_01_transformations.Rmd → 5_03_transformations.Rmd b/8_01_transformations.Rmd → 5_03_transformations.Rmd
diff --git a/6_04_multiple_comparisons.Rmd → 5_04_multiple_comparisons.Rmd b/6_04_multiple_comparisons.Rmd → 5_04_multiple_comparisons.Rmd
diff --git a/6_00.Rmd b/6_00.Rmd
@@ -1,3 +1,3 @@
-# (PART) Experimental Design and ANOVA (I) {-}
+# (PART) Experimental Design {-}
 
 
diff --git a/3_05_t-tests_paired_sample.Rmd → 6_02_t-tests_paired_sample.Rmd b/3_05_t-tests_paired_sample.Rmd → 6_02_t-tests_paired_sample.Rmd
diff --git a/7_03_randomised_block_designs_R.Rmd → 6_03_randomised_block_designs_R.Rmd b/7_03_randomised_block_designs_R.Rmd → 6_03_randomised_block_designs_R.Rmd
diff --git a/7_00.Rmd b/7_00.Rmd
@@ -1,2 +1,2 @@
-# (PART) Experimental Design and ANOVA (II) {-}
+# (PART) Beyond Simple Models {-}
 
diff --git a/8_00.Rmd b/8_00.Rmd
@@ -1,2 +1,2 @@
-# (PART) Fixing Problems {-}
+# (PART) Frequency Data and Non-parametric Tests {-}
 
diff --git a/4_01_comparing_frequencies.Rmd → 8_01_comparing_frequencies.Rmd b/4_01_comparing_frequencies.Rmd → 8_01_comparing_frequencies.Rmd
@@ -8,7 +8,7 @@ However, we sometimes find a situation in which the ‘measurement’ we are int
 
 ## A new kind of distribution
 
-There are a quite a few options for dealing with categorical data^[e.g. the 'log-linear model', 'Fisher's exact test', and the 'G-test'.]. We're just going to look at one option in this book: $\chi^2$ tests. This is pronounced, and sometimes written, 'chi-square'. The 'ch' is a hard 'ch', as in 'character'. This isn't necessarily the best approach for every problem, but $\chi^2$ tests are widely used in biology so they are a good place to start.
+There are quite a few options for dealing with categorical data^[e.g. the 'log-linear model', 'Fisher's exact test', and the 'G-test'.]. We're just going to look at one option in this book: $\chi^2$ tests. This is pronounced, and sometimes written, 'chi-square'. The 'ch' is a hard 'ch', as in 'character'. This isn't necessarily the best approach for every problem, but $\chi^2$ tests are widely used in biology so they are a good place to start.
 
 ```{block, type='do-something'}
 It is not critical that you understand everything in this section. This material is here to help those who like to have a sense of how statistical tests work. You won't be assessed on it.
@@ -18,7 +18,7 @@ The $\chi^2$ tests that we're going to study borrow their name from a particular
 
 1. The $\chi^2$ distribution pops up a lot in statistics. However, in contrast to the normal distribution, it isn't often used to model the distribution of a variable we've sampled (i.e. 'the data'). Instead, the $\chi^2$ distribution is often associated with a test statistic of some kind.
 
-2. The standard $\chi^2$ distribution is completely described by only one parameter, called the degrees of freedom. This is closely related to the degrees of freedom idea introduced in the last few chapters on *t*-tests.
+2. The standard $\chi^2$ distribution is completely described by only one parameter, called the degrees of freedom. This is closely related to the degrees of freedom idea introduced in the chapters on *t*-tests.
 
 3. The $\chi^2$ distribution is appropriate for positive-valued numeric variables. Negative values can't be accommodated. This is because the distribution arises whenever we take one or more normally distributed variables, square these, and then add them up.
 
@@ -89,7 +89,7 @@ Notice that we are not interested in judging whether the proportion of males, or
 
 ### The assumptions and requirements of $\chi^{2}$ tests
 
-It's important to realise that in terms of their assumptions, analysis of a contingency tables and goodness-of-fit tests aren't fundamentally different from one another. The difference between the two types lies in the type of hypothesis evaluated. When we carry out a goodness-of-fit test we have to supply the expected values, whereas the calculation of expected values is embedded in the formula used to carry out a contingency table test. That will make more sense once we've seen the two tests in action.
+It's important to realise that in terms of their assumptions, contingency tables and goodness-of-fit tests aren't fundamentally different from one another. The difference between the two types lies in the type of hypothesis evaluated. When we carry out a goodness-of-fit test we have to supply the expected values, whereas the calculation of expected values is embedded in the formula used to carry out a contingency table test. That will make more sense once we've seen the two tests in action.
 
 $\chi^{2}$ tests are often characterised as **non-parametric** tests because they do not assume any particular form for the distribution of the data. In fact, as with any statistical test, there are some assumptions in play, but these are relatively mild:
 

diff --git a/4_02_chi_sqr_gof.Rmd → 8_02_chi_sqr_gof.Rmd b/4_02_chi_sqr_gof.Rmd → 8_02_chi_sqr_gof.Rmd
@@ -34,7 +34,7 @@ We want to test whether the ratio of male to female flowers differs significantl
 
 **Step 3.**  Compare the $\chi^{2}$ statistic to the theoretical predictions of the $\chi^{2}$ distribution to assess the statistical significance of the difference between observed and expected counts. 
 
-The interpretation of this *p*-value in this test is the same as for any other kind of statistical test: it is probability we would see the observed frequencies, or more extreme values, under the null hypothesis.
+The interpretation of this *p*-value in this test is the same as for any other kind of statistical test: it is the probability we would see the observed frequencies, or more extreme values, under the null hypothesis.
 
 ### Assumptions of the chi-square goodness of fit test
 

diff --git a/4_03_chi_sqr_cont_tables.Rmd → 8_03_chi_sqr_cont_tables.Rmd b/4_03_chi_sqr_cont_tables.Rmd → 8_03_chi_sqr_cont_tables.Rmd
@@ -17,7 +17,7 @@ Let's think about what these kinds of data look like. Here are the biology stude
 
 This is called a two-way contingency table. It is a *two-way* contingency table because it summarises the frequency distribution of two categorical variables at the same time^[This is called their 'joint distribution', in case you were wondering.]. If we had measured three variables we would have ended up with a *three-way* contingency table (e.g. 2 x 2 x 2). 
 
-A contingency table takes its name from the fact that it captures the 'contingencies' among the categorical variables: it summarises how the frequencies of one categorical variable are associated with the categories of another. The term association is use here to describe the non-independence of categories among categorical variables. Other terms used to refer to the same idea include 'linkage', 'non-independence', and 'interaction'.
+A contingency table takes its name from the fact that it captures the 'contingencies' among the categorical variables: it summarises how the frequencies of one categorical variable are associated with the categories of another. The term association is used here to describe the non-independence of categories among categorical variables. Other terms used to refer to the same idea include 'linkage', 'non-independence', and 'interaction'.
 
 Associations are evident when the proportions of objects in one set of categories (e.g. R1 and R2) depends on a second set of categories (e.g. C1 and C2). Here are two possibilities:
 

diff --git a/8_02_non_parametric_tests.Rmd → 8_04_non_parametric_tests.Rmd b/8_02_non_parametric_tests.Rmd → 8_04_non_parametric_tests.Rmd
Original file line number	Diff line number	Diff line change
		@@ -1,2 +1,2 @@
		# (PART) Collecting and using data {-}
		# (PART) Collecting and Using Data {-}
Original file line number	Diff line number	Diff line change
		@@ -1,2 +1,2 @@
		# (PART) Simple Parametric Statistics {-}
		# (PART) Simple Statistics {-}
Original file line number	Diff line number	Diff line change
		@@ -1,3 +1,3 @@
		# (PART) Categorical Data {-}
		# (PART) Regression and ANOVA {-}
Original file line number	Diff line number	Diff line change
		@@ -1,3 +1,3 @@
		# (PART) Associations and Relationships {-}
		# (PART) Doing More with Models {-}
Original file line number	Diff line number	Diff line change
		@@ -1,3 +1,3 @@
		# (PART) Experimental Design and ANOVA (I) {-}
		# (PART) Experimental Design {-}
Original file line number	Diff line number	Diff line change
		@@ -1,2 +1,2 @@
		# (PART) Experimental Design and ANOVA (II) {-}
		# (PART) Beyond Simple Models {-}
Original file line number	Diff line number	Diff line change
		@@ -1,2 +1,2 @@
		# (PART) Fixing Problems {-}
		# (PART) Frequency Data and Non-parametric Tests {-}