Learn R Programming

collapse (version 1.1.0)

A0-collapse-documentation: Collapse Documentation & Overview

Description

The following table fully summarizes the contents of collapse. The documentation follows a hierarchical structure: This is the main overview page, linking to topical overview pages and associated function pages (unless functions are documented on the topic page). Calling ?FUN brings up the documentation page for FUN, with links to associated topic pages and closely related functions.

Arguments

Topics and Functions

Topic Main Features / Keywords Functions

Fast Statistical Functions Fast (grouped and weighted) statistical functions for vector, matrix, data.frame and grouped_df (dplyr compatible). fsum, fprod, fmean, fmedian, fmode, fvar, fsd, fmin, fmax, ffirst, flast, fNobs, fNdistinct

Fast Grouping Fast (ordered or unordered) groupings from vectors, data.frames, lists. 'GRP' objects are extremely efficient inputs for collapse's fast functions (to optimize different / repeated computations over the same groups). GRP, as.factor.GRP, group_names.GRP, is.GRP

Quick Select and Replace Variables Quick and flexible select and replace (or add) variables from (to) data.frames / data.tables / tibbles etc... (speed about 2x '[' for selecting and 4x '[<-' for replacing). get_vars/gv, add_vars/av, num_vars/nv, cat_vars, char_vars, fact_vars, logi_vars, Date_vars

Quick Data Conversion Quick conversions: data.frame <> data.table | matrix <> list, data.frame, data.table | array > matrix, data.frame, data.table | list > data.frame, data.table | vector > factor, matrix, data.frame, data.table. qDF, qDT, qM, qF, qG, mrtl, mctl

Advanced Data Aggregation Fast and easy (weighted and parallelized) aggregation of multi-type data, with (multiple) functions applied to numeric and categorical columns. Also supports fully customized aggregation tasks mapping functions directly to columns. collap, collapv, collapg

Data Transformations Efficient row- and column- data-apply and Split-Apply-Combine computing. Fast (grouped and weighted) replacing and sweeping of statistics, scaling / standardizing, (higher-dimensional) within- and between-transformations (i.e. centering and averaging), complex linear prediction and partialling out. A fast F-test for linear models with (large) factors. Additional methods for grouped_df (dplyr) and pseries, pdata.frame (plm). dapply, BY, TRA, fscale/STD, fbetween/B, fwithin/W, fHDbetween/HDB, fHDwithin/HDW, fFtest

Time-Series and Panel-Series Fast (sequences of) lags / leads and (lagged / leaded and iterated) differences and growth rates / log-differences on (unordered) time-series and panel-data. Panel-data to (ts-)array conversions. Multivariate panel- auto-, partial- and cross- correlation functions. Additional methods for grouped_df (dplyr) and pseries, pdata.frame (plm). flag/L/F, fdiff/D, fgrowth/G, psmat, psacf, pspacf, psccf

List Processing (Recursive) list search and identification, search and extract list-elements / list-subsetting, rapply to lists of data.frame's / data objects, and (fast) generalized recursive row-binding / unlisting in 2-dimensions / to data.frame. is.regular, is.unlistable, ldepth, has_elem, get_elem, atomic_elem, list_elem, reg_elem, irreg_elem, rapply2d, unlist2d

Summary Statistics Extremely fast (one-pass, grouped and weighted), summary statistics for cross-sectional and complex multilevel / panel data, with additional methods for pseries and pdata.frame (plm). Efficient detailed description of data.frame. Pairwise correlations and covariances (with observation count, p-value and pretty printing), pairwise observation count. qsu, descr, pwcor, pwcov, pwNobs

Recode and Replace Values Recode multiple values (exact or regex matching) and replace NaN/Inf/-Inf and outliers (according to 1- or 2-sided threshold or column standard-deviation) in vectors, matrices or data.frames. Recode, replace_non_finite, replace_outliers

Small (Helper) Functions Set and extract variable labels, extract variable classes and C storage types, display variable names, labels and classes, add / remove prefix or postfix to / from column names, not-in operator, matching with error message for non-matched, faster nlevels for factors, fast unique vector elements, faster interactions, remove NA's from vector, insert NA's at random into matrix-like objects, check exact or near / numeric equality of multiple objects or of all elements in a list, seq_along rows or columns of matrix-like objects, return object with dimnames, row- or colnames set, identify categorical and date(-time) objects, convert factors (or all factors in a list) to numeric or character by converting levels. vlabels, vclasses, vtypes, namlab, add_stub, rm_stub, %!in%, ckmatch, fnlevels, funique, finteraction, na_rm, na_insert, all_identical, all_obj_equal, seq_row, seq_col, setDimnames, setRownames, setColnames, is.categorical, is.Date, as.numeric_factor, as.character_factor

Data and Global Macros Groningen Growth and Development Centre 10-Sector Database, World Bank World Development dataset, and some global macros containing links to the topical documentation pages (including this page), all exported objects (excluding exported S3 methods), all generic functions, the 2 datasets, all fast functions, all fast statistical (scalar-valued) functions, and all transformation operators / operator-like functions. GGDC10S, wlddev, .COLLAPSE_TOPICS, .COLLAPSE_ALL, .COLLAPSE_GENERIC, .COLLAPSE_DATA, .FAST_FUN, .FAST_STAT_FUN, .OPERATOR_FUN Topic Main Features / Keywords

Details

The added top-level documentation infrastructure in collapse allows you to effectively navigate the package (as in other commercial software documentations like Mathematica). Calling ?FUN brings up the documentation page documenting the function as in other R packages. You can also call topical documentation pages directly from the console. The links to these pages are contained in .COLLAPSE_TOPICS (i.e. calling help(.COLLAPSE_TOPICS[1]) brings up this page).

See Also

collapse-package