Learn R Programming

mStats (version 3.2.2)

summ: Number Summary for numerical data

Description

summ() generates summary statistics for numerical data as well as grouped summary measures.

Usage

summ(data, ..., by = NULL, na.rm = FALSE, rnd = 1)

Arguments

data

Dataset

...

Variable or multiple variables Colon separator : can be used to specify multiple variables.

by

Varaiable for cross-tabulation

na.rm

A logical value to specify missing values,

rnd

specify rounding of numbers. See round.

Value

summary measures as data.frame

Details

summ() reports seven number summary statistics, normality and other additional metadata.

summ(data, var1)

summ(data, var1, var2, var3:var5, var10)

summ(data)

Normality test is perfomed by Shapiro-Wilk Normality Test. See more at shapiro.test.

' ANNOTATIONS

Obs. = observation

NA. = missing data

Mean = Mean value

Std.Dev = Standard deviation

Median = Median value

Q1 = First quartile or percentile

Q3 = Third quartile or percentile

Min = Minimum value

Max = Maximum value

Normality = P-value from Shapiro-Wilk Normality Test

Grouped Summary Measures

If by is specified, grouped summary measures are calculated and produced five number summary, excluding minimum and maximum. In addition, if levels of by are more than 2, p-values from ANOVA and Kruskal Wallis tests are displayed. Otherwise, Student's t-test and Wilcoxon signed rank test are measured and their respective p-values are tabulated.

There are two parts of the final table. The first part tabulates grouped summary measures and second part tabulates one-variable summary measures for corresponding variables.

summ(data, var1, var2, by = var3)

summ(data, var1, var2, var3:var5, var10, by = var11)

summ(data, by = var11)

Using colon : spearator

Colon separator : can be used to indicate sequence of variables.

summ(data, var1, var2, var3:var5, var10)

References

Betty R. Kirkwood, Jonathan A.C. Sterne (2006, ISBN:978<U+2013>0<U+2013>86542<U+2013>871<U+2013>3)

Examples

Run this code
# NOT RUN {
## use iris dataset
data(iris)

summ(iris, Sepal.Length)
summ(iris, Sepal.Length:Petal.Width)

summ(iris)

# }

Run the code above in your browser using DataLab