-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
16 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,19 @@ | ||
# *Visualizing Multivariate Data and Models in R* | ||
<img src="images/Viz-MLM-logo.jpg" style="float:right; height:200px;" /> | ||
|
||
This is the bare beginnings of my new book ... | ||
This book is about graphical methods for multivariate data, and their uses in understanding relationships | ||
particularly when there are several aspects to be considered together in multiple response models | ||
such as multivariate analysis of variance and multivariate multiple regression. | ||
|
||
It brings together a collection of novel techniques I, and others, have developed over the past 15 years and implemented in mature R packages. Some key substantive features of the book are: | ||
|
||
* Statistical data visualization is cast in a general framework by goal (see the data, visualize a model, diagnose problems), rather than a categorization by graphic types. It is best informed by principles and goals | ||
of communication, for example making graphic comparison easy and ordering factors and variables according to what should be seen (_effect ordering_). | ||
|
||
* Data visualization is seen as a combination of **exposure**---plotting the raw data---and **summarization**--- plotting statistical summaries---to highlight what should be noticed. For example, data ellipses and confidence ellipses are widely used as simple, effective summaries of data and fitted model parameters. When the data is complex, the idea of **visual thinning** can be used to balance the tradeoff. | ||
|
||
* These graphical tools can be used to understand or explain a wide variety of statistical concepts, phenomena, and paradoxes such as Simpson's paradox, effects of measurement error, and so forth. | ||
|
||
* The HE ("hypothesis - error") plot framework provides a simple way to understand the results of statistical tests and the relations among response outcomes in the multivariate linear model. | ||
|
||
* Dimension reduction techniques such as PCA and discriminant analysis are presented as "multivariate juicers," able to squeeze the important information in high-dimensional data into informative two-dimensional views. |