Learn R Programming

⚠️There's a newer version (2.1.1) of this package.Take me there.

collapse (version 1.8.9)

Advanced and Fast Data Transformation

Description

A C/C++ based package for advanced data transformation and statistical computing in R that is extremely fast, class-agnostic, and programmer friendly through a flexible and parsimonious syntax. It is well integrated with base R, 'dplyr' / (grouped) 'tibble', 'data.table', 'sf', 'plm' (panel-series and data frames), and non-destructively handles other matrix or data frame based classes (like 'ts', 'xts' / 'zoo', 'tsibble', ...) --- Key Features: --- (1) Advanced statistical programming: A full set of fast statistical functions supporting grouped and weighted computations on vectors, matrices and data frames. Fast and programmable grouping, ordering, unique values/rows, factor generation and interactions. Fast and flexible functions for data manipulation, data object conversions, and memory efficient R programming. (2) Advanced aggregation: Fast and easy multi-data-type, multi-function, weighted and parallelized data aggregation. (3) Advanced transformations: Fast row/column arithmetic, (grouped) replacing and sweeping out of statistics (by reference), (grouped, weighted) scaling/standardizing, (higher-dimensional) between (averaging) and (quasi-)within (demeaning) transformations, linear prediction, model fitting and testing exclusion restrictions. (4) Advanced time-computations: Fast and flexible indexed time series and panel data classes. Fast (sequences of) lags/leads, and (lagged/leaded, iterated, quasi-, log-) differences and (compounded) growth rates on (irregular) time series and panels. Multivariate auto-, partial- and cross-correlation functions for panel data. Panel data to (ts-)array conversions. (5) List processing: Recursive list search, splitting, extraction/subsetting, apply, and generalized row-binding / unlisting to data frame. (6) Advanced data exploration: Fast (grouped, weighted, panel-decomposed) summary statistics and descriptive tools.

Copy Link

Version

Install

install.packages('collapse')

Monthly Downloads

38,990

Version

1.8.9

License

GPL (>= 2) | file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Sebastian Krantz

Last Published

October 7th, 2022

Functions in collapse (1.8.9)

GRP

Fast Grouping / collapse Grouping Objects
across

Apply Functions Across Multiple Columns
TRA

Transform Data by (Grouped) Replacing or Sweeping out Statistics
arithmetic

Fast Row/Column Arithmetic for Matrix-Like Objects
BY

Split-Apply-Combine Computing
collapse-options

collapse Package Options
GGDC10S

Groningen Growth and Development Centre 10-Sector Database
collapse-package

Advanced and Fast Data Transformation
collap

Advanced Data Aggregation
collapse-documentation

Collapse Documentation & Overview
collapse-renamed

Renamed Functions
colorder

Fast Reordering of Data Frame Columns
dapply

Data Apply
get_elem

Find and Extract / Subset List Elements
fFtest

Fast (Weighted) F-test for Linear Models (with Factors)
fast-data-manipulation

Fast Data Manipulation
data-transformations

Data Transformations
descr

Detailed Statistical Description of Data Frame
efficient-programming

Small Functions to Make R Programming More Efficient
fast-grouping-ordering

Fast Grouping and Ordering
fdroplevels

Fast Removal of Unused Factor Levels
ffirst-flast

Fast (Grouped) First and Last Value for Matrix-Like Objects
flag

Fast Lags and Leads for Time Series and Panel Data
fcumsum

Fast (Grouped, Ordered) Cumulative Sum for Matrix-Like Objects
fdiff

Fast (Quasi-, Log-) Differences for Time Series and Panel Data
flm

Fast (Weighted) Linear Model Fitting
fgrowth

Fast Growth Rates for Time Series and Panel Data
fhdbetween-fhdwithin

Higher-Dimensional Centering and Linear Prediction
fast-statistical-functions

Fast (Grouped, Weighted) Statistical Functions for Matrix-Like Objects
fbetween-fwithin

Fast Between (Averaging) and (Quasi-)Within (Centering) Transformations
fndistinct

Fast (Grouped) Distinct Value Count for Matrix-Like Objects
fnobs

Fast (Grouped) Observation Count for Matrix-Like Objects
fnth

Fast (Grouped, Weighted) N'th Element/Quantile for Matrix-Like Objects
fmin-fmax

Fast (Grouped) Maxima and Minima for Matrix-Like Objects
frename

Fast Renaming and Relabelling Objects
fmean

Fast (Grouped, Weighted) Mean for Matrix-Like Objects
fmedian

Fast (Grouped, Weighted) Median Value for Matrix-Like Objects
fprod

Fast (Grouped, Weighted) Product for Matrix-Like Objects
fscale

Fast (Grouped, Weighted) Scaling and Centering of Matrix-like Objects
fmode

Fast (Grouped, Weighted) Statistical Mode for Matrix-Like Objects
fsubset

Fast Subsetting Matrix-Like Objects
fsummarise

Fast Summarise
group

Fast Hash-Based Grouping
groupid

Generate Run-Length Type Group-Id
ftransform

Fast Transform and Compute Columns on a Data Frame
fvar-fsd

Fast (Grouped, Weighted) Variance and Standard Deviation for Matrix-Like Objects
funique

Fast Unique Elements / Rows
psmat

Matrix / Array from Panel Series
qtab

Fast (Weighted) Cross Tabulation
indexing

Fast Indexed Time Series and Panels
quick-conversion

Quick Data Conversion
is_unlistable

Unlistable Lists
fsum

Fast (Grouped, Weighted) Sum for Matrix-Like Objects
pad

Pad Matrix-Like Objects with a Value
psacf

Auto- and Cross- Covariance and Correlation Function Estimation for Panel Series
recode-replace

Recode and Replace Values in Matrix-Like Objects
roworder

Fast Reordering of Data Frame Rows
ldepth

Determine the Depth / Level of Nesting of a List
qF-qG-finteraction

Fast Factor Generation, Interactions and Vector Grouping
rsplit

Fast (Recursive) Splitting
pwcor-pwcov-pwnobs

(Pairwise, Weighted) Correlations, Covariances and Observation Counts
time-series-panel-series

Time Series and Panel Series
qsu

Fast (Grouped, Weighted) Summary Statistics for Cross-Sectional and Panel Data
timeid

Generate Integer-Id From Time/Date Sequences
summary-statistics

Summary Statistics
list-processing

List Processing
radixorder

Fast Radix-Based Ordering
fselect-get_vars-add_vars

Fast Select, Replace or Add Data Frame Columns
t_list

Efficient List Transpose
rapply2d

Recursively Apply a Function to a List of Data Objects
seqid

Generate Group-Id from Integer Sequences
wlddev

World Development Dataset
unlist2d

Recursive Row-Binding / Unlisting in 2D - to Data Frame
varying

Fast Check of Variation in Data
small-helpers

Small (Helper) Functions