Learn R Programming

fastLink (version 0.6.1)

Fast Probabilistic Record Linkage with Missing Data

Description

Implements a Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods described in Enamorado, Fifield, and Imai (2019) ''Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records'' and is available at .

Copy Link

Version

Install

install.packages('fastLink')

Monthly Downloads

574

Version

0.6.1

License

GPL (>= 3)

Maintainer

Ted Enamorado

Last Published

November 17th, 2023

Functions in fastLink (0.6.1)

gammaNUMCK2par

gammaNUMCK2par
plot.fastLink

Plot matching patterns of the EM object by posterior probability of match
stateoutflow

State-level outflow rates by state
gammaNUMCKpar

gammaNUMCKpar
inspectEM

inspectEM
stringSubset

stringSubset
getPosterior

getPosterior
stateinflow

State-level inflow rates by state
gammaCKpar

gammaCKpar
matchesLink

matchesLink
nameReweight

nameReweight
gammaKpar

gammaKpar
statemove

In-state movers rates by state
getPatterns

getPatterns
summary.fastLink

Get summaries of fastLink() objects
getMatches

getMatches
print.inspectEM

print.inspectEM
preprocText

preprocText
statefips

State-level FIPS Codes
tableCounts

tableCounts
aggregateEM

Aggregate EM objects for use in `summary.fastLink()`
aggconfusion

aggconfusion
countyfips

County-level FIPS Codes
clusterMatch

clusterMatch
calcMoversPriors

calcMoversPriors
dedupeMatches

dedupeMatches
confusion

Get confusion table for fastLink objects
blockData

blockData
emlinkRS

emlinkRS
emlinklog

emlinklog
emlinkMARmov

emlinkMARmov
fastLink

fastLink
gammaCK2par

gammaCK2par
countyoutflow

County-level outflow rates by state
dfB

Sample dataset B
dfA

Sample dataset A
fastLink-package

Fast Probabilistic Record Linkage with Missing Data
countyinflow

County-level inflow rates by state