Learn R Programming

StatMatch (version 1.4.2)

Statistical Matching or Data Fusion

Description

Integration of two data sources referred to the same target population which share a number of variables. Some functions can also be used to impute missing values in data sets through hot deck imputation methods. Methods to perform statistical matching when dealing with data from complex sample surveys are available too.

Copy Link

Version

Install

install.packages('StatMatch')

Monthly Downloads

1,383

Version

1.4.2

License

GPL (>= 2)

Issues

Pull Requests

Stars

Forks

Last Published

May 13th, 2024

Functions in StatMatch (1.4.2)

RANDwNND.hotdeck

Random Distance hot deck.
StatMatch-package

Statistical Matching or Data Fusion
comb.samples

Statistical Matching of data from complex sample surveys
Frechet.bounds.cat

Frechet bounds of cells in a contingency table
create.fused

Creates a matched (synthetic) dataset
Fbwidths.by.x

Computes the Frechet bounds of cells in a contingency table by considering all the possible subsets of the common variables.
comp.cont

Empirical comparison of two estimated distributions of the same continuous variable
comp.prop

Compares two distributions of the same categorical variable
selMtc.by.unc

Identifies the best combination if matching variables in reducing uncertainty in estimation the contingency table Y vs. Z.
NND.hotdeck

Distance Hot Deck method.
mahalanobis.dist

Computes the Mahalanobis Distance
harmonize.x

Harmonizes the marginal (joint) distribution of a set of variables observed independently in two sample surveys referred to the same target population
gower.dist

Computes the Gower's Distance
plotBounds

Graphical representation of the uncertainty bounds estimated through the Frechet.bounds.cat function
pBayes

Pseudo-Bayes estimates of cell probabilities
fact2dummy

Transforms a categorical variable in a set of dummy variables
plotCont

graphical comparison of the estimated distributions for the same continuous variable.
mixed.mtc

Statistical Matching via Mixed Methods
rankNND.hotdeck

Rank distance hot deck method.
create.imputed

Fills-in missing values in the recipient dataset with values observed on the donors units
maximum.dist

Computes the Maximum Distance
samp.A

Artificial data set resembling EU--SILC survey
samp.C

Artificial data set resembling EU--SILC survey
samp.B

Artificial data set resembling EU--SILC survey
pw.assoc

Pairwise measures between categorical variables
plotTab

Graphical comparison of the estimated distributions for the same categorical variable.