Learn R Programming

datadr: Divide and Recombine in R

datadr is an R package that leverages RHIPE to provide a simple interface to division and recombination (D&R) methods for large complex data.

To get started, see the package documentation and function reference located here.

Visualization tools based on D&R can be found here.

Installation

# from CRAN:
install.packages("datadr")

# from github:
devtools::install_github("delta-rho/datadr")

License

This software is currently under the BSD license. Please read the license document.

Acknowledgement

datadr development is sponsored by:

  • U.S. Department of Defense Advanced Research Projects Agency, XDATA program
  • U.S. Department of Homeland Security, Science and Technology Directorate, Homeland Security Advanced Research Projects Agency (HSARPA)
  • Pacific Northwest National Laboratory, operated by Battelle for the U.S. Department of Energy, LDRD Program, Signature Discovery and Future Power Grid Initiatives

Copy Link

Version

Install

install.packages('datadr')

Monthly Downloads

57

Version

0.8.6.1

License

BSD_3_clause + file LICENSE

Maintainer

Last Published

August 19th, 2018

Functions in datadr (0.8.6.1)

as.data.frame.ddf

Turn 'ddf' Object into Data Frame
ddf

Instantiate a Distributed Data Frame ('ddf')
divide-internals

Functions used in divide()
ddo-ddf-accessors

Accessor Functions
drSubset

Subsetting Distributed Data Frames
divide

Divide a Distributed Data Object
flatten

"Flatten" a ddf Subset
kvApply

Apply Function to Key-Value Pair
drAggregate

Division-Agnostic Aggregation
drRead.table

Data Input
kvPair

Specify a Key-Value Pair
rhipeControl

Specify Control Parameters for RHIPE Job
drSample

Take a Sample of Key-Value Pairs Take a sample of key-value Pairs
recombine

Recombine
rrDiv

Random Replicate Division
to_ddf

Convert dplyr grouped_df to ddf
updateAttributes

Update Attributes of a 'ddo' or 'ddf' Object
removeData

Remove Key-Value Pairs from a Data Connection
combCollect

"Collect" Recombination
combDdf

"DDF" Recombination
bsv

Construct Between Subset Variable (BSV)
condDiv

Conditioning Variable Division
convert

Convert 'ddo' / 'ddf' Objects
charFileHash

Character File Hash Function
drBLB

Bag of Little Bootstraps Transformation Method
combMeanCoef

Mean Coefficient Recombination
drFilter

Filter a 'ddo' or 'ddf' Object
combRbind

"rbind" Recombination
drGLM

GLM Transformation Method
drPersist

Persist a Transformed 'ddo' or 'ddf' Object
drQuantile

Sample Quantiles for 'ddf' Objects
kvPairs

Specify a Collection of Key-Value Pairs
adult

"Census Income" Dataset
print.kvPair

Print a key-value pair
localDiskConn

Connect to Data Source on Local Disk
drGetGlobals

Get Global Variables and Package Dependencies
drLM

LM Transformation Method
drLapply

Apply a function to all key-value pairs of a ddo/ddf object
applyTransform

Apply transformation function(s)
print.kvValue

Print value of a key-value pair
ddo-ddf-attributes

Managing attributes of 'ddo' or 'ddf' objects
ddo

Instantiate a Distributed Data Object ('ddo')
addData

Add Key-Value Pairs to a Data Connection
localDiskControl

Specify Control Parameters for MapReduce on a Local Disk Connection
addTransform

Add a Transformation Function to a Distributed Data Object
ddf-accessors

Accessor methods for 'ddf' objects
datadr-package

datadr
getCondCuts

Get names of the conditioning variable cuts
hdfsConn

Connect to Data Source on HDFS
makeExtractable

Take a ddo/ddf HDFS data object and turn it into a mapfile
drHexbin

HexBin Aggregation for Distributed Data Frames
drJoin

Join Data Sources by Key
%>%

Pipe data
mr-summary-stats

Functions to Compute Summary Statistics in MapReduce
setupTransformEnv

Set up transformation environment
print.ddo

Print a "ddo" or "ddf" Object
mrExec

Execute a MapReduce Job
getSplitVar

Extract "Split" Variable(s)
readHDFStextFile

Experimental HDFS text reader helper function
readTextFileByChunk

Experimental sequential text reader helper function
as.list.ddo

Turn 'ddo' / 'ddf' Object into a list
digestFileHash

Digest File Hash Function
combDdo

"DDO" Recombination
combMean

Mean Recombination