summ()
generates summary statistics for numerical data as well as
grouped summary measures.
summ(data, ..., by = NULL, na.rm = FALSE, rnd = 1)
Dataset
Variable or multiple variables
Colon separator :
can be used to specify multiple variables.
Varaiable for cross-tabulation
A logical value to specify missing values,
specify rounding of numbers. See round
.
summary measures as data.frame
summ()
reports seven number summary statistics, normality and other additional
metadata.
summ(data, var1)
summ(data, var1, var2, var3:var5, var10)
summ(data)
Normality test is perfomed by Shapiro-Wilk Normality Test. See more at
shapiro.test
.
' ANNOTATIONS
Obs.
= observation
NA.
= missing data
Mean
= Mean value
Std.Dev
= Standard deviation
Median
= Median value
Q1
= First quartile or percentile
Q3
= Third quartile or percentile
Min
= Minimum value
Max
= Maximum value
Normality
= P-value from Shapiro-Wilk Normality Test
Grouped Summary Measures
If by
is specified, grouped summary measures are calculated and
produced five number summary, excluding minimum and maximum. In addition,
if levels of by
are more than 2, p-values from ANOVA and Kruskal Wallis tests
are displayed. Otherwise, Student's t-test and Wilcoxon signed rank test are
measured and their respective p-values are tabulated.
There are two parts of the final table. The first part tabulates grouped summary measures and second part tabulates one-variable summary measures for corresponding variables.
summ(data, var1, var2, by = var3)
summ(data, var1, var2, var3:var5, var10, by = var11)
summ(data, by = var11)
Using colon :
spearator
Colon separator :
can be used to indicate sequence of variables.
summ(data, var1, var2, var3:var5, var10)
Betty R. Kirkwood, Jonathan A.C. Sterne (2006, ISBN:978<U+2013>0<U+2013>86542<U+2013>871<U+2013>3)
# NOT RUN {
## use iris dataset
data(iris)
summ(iris, Sepal.Length)
summ(iris, Sepal.Length:Petal.Width)
summ(iris)
# }
Run the code above in your browser using DataLab