The goal of describe
is to make it easy to compute descriptive
statistics for non-normal distributions without creating custom
functions or spending hours researching stack overflow. This package
also provides flexible and intuitive univariate and multiple data
imputation and visualizations for exploratory data analysis.
You can install the development version of this package through the
devtools
package.
devtools::install_github("jpmonteagudo28/describe")
You can also install it from CRAN:
install.packages("describe")
describe
aims to help researchers and analysts during the exploratory
phase of their analysis. This package allows you to compute almost any
measure of central tendency, bootstrapped correlation confidence
intervals, impute missing values, visualize missing patterns and a few
other things.
The package makes it easy to generate correlated data with missing patterns according to all three missingness mechanisms, missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR).
#library(describe)
# Generate random data with NAs
# data <- gen.mcar(50,rho = .467,sigma = c(1,1.3),n_vars = 2, na_prob = .10)
# Impute missing values using predictive mean matching
# imputed_data <- pmean.match(data, robust = TRUE, verbose = TRUE)
# summary(imputed_data)
# Some examples of bootstrapped correlation coefficients and geometric median
# Plot something here
Wulff, J. N., & Jeppesen, L. E. (2017). Multiple imputation by chained equations in praxis: Guidelines and review. Electronic Journal of Business Research Methods, 15(1), 41-56. http://www.ejbrm.com/volume15/issue1
Li, Peng, Elizabeth A. Stuart, and David B. Allison. 2015. “Multiple Imputation: A Flexible Tool for Handling Missing Data.” JAMA 314 (18): 1966. https://doi.org/10.1001/jama.2015.15281.
Byrne, John. 2005. “A New Short Exact Geometric Confidence Interval.” Australian & New Zealand Journal of Statistics 47 (4): 563–69. https://doi.org/10.1111/j.1467-842X.2005.00417.x.
Donders, A. Rogier T., Geert J.M.G. van der Heijden, Theo Stijnen, and Karel G.M. Moons. 2006. “Review: A Gentle Introduction to Imputation of Missing Values.” Journal of Clinical Epidemiology 59 (10): 1087–91. https://doi.org/10.1016/j.jclinepi.2006.01.014.
Elamir, Elsayed A H. 2015. “Tests and Confidence Intervals for Geometric Mean from One AND Two Populations: A Unified Approach.” Statistica & Applicazioni. Vol. XIII.
Yang, John Jeongseok, Yousun Chung, Hyungsuk Kim, Dae-Hyun Ko, Sang-Hyun Hwang, and Heung-Bum Oh. 2020. “Retrospection of Anti–Blood Group Antibody Proficiency Testing Data Using the Geometric Mean and Standard Deviation.” American Journal of Clinical Pathology 153 (4): 530–36. https://doi.org/10.1093/ajcp/aqz187.
van Buuren, S. (2018). Flexible Imputation of Missing Data, Second Edition (2nd ed.). Chapman and Hall/CRC. https://doi.org/10.1201/9780429492259