-
Notifications
You must be signed in to change notification settings - Fork 10
Statistical Recipes in Python
Maurice HT Ling edited this page Aug 14, 2021
·
42 revisions
Methods to summarize a given set of data.
- Arithmetic mean
- Geometric mean
- Harmonic mean
- Maximum
- Minimum
- Moment
- Kurtosis
- Sample standard deviation
- Sample standard error
- Sample variance
- Skew
Methods to examine whether a set of values are normally distributed.
- Kurtosis test: test whether kurtosis of data is normal
- Jarque-Bera test: normality test for large sample size (n > 2000)
- Shapiro-Wilk test for normality: normality test for small sample size
- Skew test: test whether the skew of data is normal
Methods to examine whether two or more sets of values have the same variance (or standard deviation).
Methods to examine the trend of 2 sets of values.
- Kendall's tau: correlation measure for ordinal data
- Pearson's correlation coefficient: correlation measure for normally distributed data
- Point biserial correlation coefficient: correlation measure between a binary variable and a continuous variable
- Somer's D: asymmetric measure of ordinal association
- Spearman's rank correlation coefficient: correlation measure for ranked data
Statistical tests assuming normality in data set(s).
- Alexander Govern test: test for equal means in 2 or more samples without assuming equal variances
- ANOVA - One-way: test for equal means in 2 or more samples assuming equal variances
- t-test - 2-samples (independent samples) assuming equal variance
- t-test - 2-samples (independent samples) assuming unequal variance
- t-test - paired (dependent samples)
Statistical tests without assuming normality in data set(s), also known as distribution-free tests.
- Brunner-Munzel test: non-parametric version of 2-samples (independent samples) t-test without assuming equal variances
- Chi-Square test: test whether 2 distributions are equal
- Cramér-von Mises test: test whether 2 distributions are equal
- Cressie-Read power divergence test: test whether 2 distributions are equal
- Epps-Singleton test: test whether 2 distributions are equal
- Freeman-Tukey test: test whether 2 distributions are equal
- G-test: test whether 2 distributions are equal
- Kolmogorov-Smirnov test: test whether 2 distributions are equal
- Kruskal-Wallis H-test: non-parametric version of ANOVA - One-way
- Mann-Whitney U test: non-parametric version of 2-samples (independent samples) t-test assuming equal variances
- Neyman's test of goodness of fit: test whether 2 distributions are equal
- Page's L test: measure of trend in observations between treatments
- Wilcoxon rank-sum test: non-parametric version of 2-samples (independent samples) t-test assuming equal variances
- Wilcoxon signed-rank test: non-parametric version of t-test - paired (dependent samples)
Methods to analyze 2x2 and MxN contingency tables.
- Barnard exact test: test whether variable for columns is independent to the variable for rows on 2x2 contingency table
- Boschloo’s exact test: test whether variable for columns is independent to the variable for rows on 2x2 contingency table
- Chi-square test of independence: test whether variable for columns is independent to the variable for rows on MxN contingency table
- Cramer’s V: measure the degree of association between two nominal variables on MxN contingency table
- Fisher exact test: test whether variable for columns is independent to the variable for rows on 2x2 contingency table
- G-test of independence: test whether variable for columns is independent to the variable for rows on MxN contingency table
- Pearson’s contingency coefficient: measure the degree of association between two nominal variables on MxN contingency table
- Relative risk: test whether exposure increases the risk of an outcome using 2x2 contingency table
- Odds ratio: test whether exposure increases the odds of an outcome using 2x2 contingency table
- Tschuprow’s T: measure the degree of association between two nominal variables on MxN contingency table
Methods to combine p-values from independent tests on the same hypothesis.
Copyright (c) 2008-2024, Maurice HT Ling
Refereed Publications and Technical Reports
Abstracts and Other Un-Refereed Works
Autobiographic Verses (Poems that I wrote) and My Sayings