Skip to content

jpmonteagudo28/describe

Repository files navigation

describe describe website

stability-wip CRAN status

The goal of describe is to make it easy to compute descriptive statistics for non-normal distributions without creating custom functions or spending hours researching stack overflow. This package also provides flexible and intuitive univariate and multiple data imputation and visualizations for exploratory data analysis.

Installation

You can install the development version of this package through the devtools package.

devtools::install_github("jpmonteagudo28/describe")

You can also install it from CRAN:

install.packages("describe")

Usage

describe aims to help researchers and analysts during the exploratory phase of their analysis. This package allows you to compute almost any measure of central tendency, bootstrapped correlation confidence intervals, impute missing values, visualize missing patterns and a few other things.

The package makes it easy to generate correlated data with missing patterns according to all three missingness mechanisms, missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR).

#library(describe)

# Generate random data with NAs
# data <- gen.mcar(50,rho = .467,sigma = c(1,1.3),n_vars = 2, na_prob = .10)

# Impute missing values using predictive mean matching
# imputed_data <- pmean.match(data, robust = TRUE, verbose = TRUE)

# summary(imputed_data)

Imputation

# Some examples of bootstrapped correlation coefficients and geometric median

Visualizations

# Plot something here

Further reading

Wulff, J. N., & Jeppesen, L. E. (2017). Multiple imputation by chained equations in praxis: Guidelines and review. Electronic Journal of Business Research Methods, 15(1), 41-56. http://www.ejbrm.com/volume15/issue1

Li, Peng, Elizabeth A. Stuart, and David B. Allison. 2015. “Multiple Imputation: A Flexible Tool for Handling Missing Data.” JAMA 314 (18): 1966. https://doi.org/10.1001/jama.2015.15281.

Byrne, John. 2005. “A New Short Exact Geometric Confidence Interval.” Australian & New Zealand Journal of Statistics 47 (4): 563–69. https://doi.org/10.1111/j.1467-842X.2005.00417.x.

Donders, A. Rogier T., Geert J.M.G. van der Heijden, Theo Stijnen, and Karel G.M. Moons. 2006. “Review: A Gentle Introduction to Imputation of Missing Values.” Journal of Clinical Epidemiology 59 (10): 1087–91. https://doi.org/10.1016/j.jclinepi.2006.01.014.

Elamir, Elsayed A H. 2015. “Tests and Confidence Intervals for Geometric Mean from One AND Two Populations: A Unified Approach.” Statistica & Applicazioni. Vol. XIII.

Yang, John Jeongseok, Yousun Chung, Hyungsuk Kim, Dae-Hyun Ko, Sang-Hyun Hwang, and Heung-Bum Oh. 2020. “Retrospection of Anti–Blood Group Antibody Proficiency Testing Data Using the Geometric Mean and Standard Deviation.” American Journal of Clinical Pathology 153 (4): 530–36. https://doi.org/10.1093/ajcp/aqz187.

van Buuren, S. (2018). Flexible Imputation of Missing Data, Second Edition (2nd ed.). Chapman and Hall/CRC. https://doi.org/10.1201/9780429492259

Releases

No releases published

Packages

No packages published

Languages