Learn R Programming

datadr: Divide and Recombine in R

datadr is an R package that leverages RHIPE to provide a simple interface to division and recombination (D&R) methods for large complex data.

To get started, see the package documentation and function reference located here.

Visualization tools based on D&R can be found here.

Installation

# from CRAN:
install.packages("datadr")

# from github:
devtools::install_github("delta-rho/datadr")

License

This software is currently under the BSD license. Please read the license document.

Acknowledgement

datadr development is sponsored by:

  • U.S. Department of Defense Advanced Research Projects Agency, XDATA program
  • U.S. Department of Homeland Security, Science and Technology Directorate, Homeland Security Advanced Research Projects Agency (HSARPA)
  • Pacific Northwest National Laboratory, operated by Battelle for the U.S. Department of Energy, LDRD Program, Signature Discovery and Future Power Grid Initiatives

Copy Link

Version

Install

install.packages('datadr')

Monthly Downloads

7

Version

0.8.6.1

License

BSD_3_clause + file LICENSE

Maintainer

Ryan Hafen

Last Published

August 19th, 2018

Functions in datadr (0.8.6.1)

as.data.frame.ddf

Turn 'ddf' Object into Data Frame
ddf

Instantiate a Distributed Data Frame ('ddf')
divide-internals

Functions used in divide()
ddo-ddf-accessors

Accessor Functions
drSubset

Subsetting Distributed Data Frames
divide

Divide a Distributed Data Object
flatten

"Flatten" a ddf Subset
kvApply

Apply Function to Key-Value Pair
drAggregate

Division-Agnostic Aggregation
drRead.table

Data Input
kvPair

Specify a Key-Value Pair
rhipeControl

Specify Control Parameters for RHIPE Job
drSample

Take a Sample of Key-Value Pairs Take a sample of key-value Pairs
recombine

Recombine
rrDiv

Random Replicate Division
to_ddf

Convert dplyr grouped_df to ddf
updateAttributes

Update Attributes of a 'ddo' or 'ddf' Object
removeData

Remove Key-Value Pairs from a Data Connection
combCollect

"Collect" Recombination
combDdf

"DDF" Recombination
bsv

Construct Between Subset Variable (BSV)
condDiv

Conditioning Variable Division
convert

Convert 'ddo' / 'ddf' Objects
charFileHash

Character File Hash Function
drBLB

Bag of Little Bootstraps Transformation Method
combMeanCoef

Mean Coefficient Recombination
drFilter

Filter a 'ddo' or 'ddf' Object
combRbind

"rbind" Recombination
drGLM

GLM Transformation Method
drPersist

Persist a Transformed 'ddo' or 'ddf' Object
drQuantile

Sample Quantiles for 'ddf' Objects
kvPairs

Specify a Collection of Key-Value Pairs
adult

"Census Income" Dataset
print.kvPair

Print a key-value pair
localDiskConn

Connect to Data Source on Local Disk
drGetGlobals

Get Global Variables and Package Dependencies
drLM

LM Transformation Method
drLapply

Apply a function to all key-value pairs of a ddo/ddf object
applyTransform

Apply transformation function(s)
print.kvValue

Print value of a key-value pair
ddo-ddf-attributes

Managing attributes of 'ddo' or 'ddf' objects
ddo

Instantiate a Distributed Data Object ('ddo')
addData

Add Key-Value Pairs to a Data Connection
localDiskControl

Specify Control Parameters for MapReduce on a Local Disk Connection
addTransform

Add a Transformation Function to a Distributed Data Object
ddf-accessors

Accessor methods for 'ddf' objects
datadr-package

datadr
getCondCuts

Get names of the conditioning variable cuts
hdfsConn

Connect to Data Source on HDFS
makeExtractable

Take a ddo/ddf HDFS data object and turn it into a mapfile
drHexbin

HexBin Aggregation for Distributed Data Frames
drJoin

Join Data Sources by Key
%>%

Pipe data
mr-summary-stats

Functions to Compute Summary Statistics in MapReduce
setupTransformEnv

Set up transformation environment
print.ddo

Print a "ddo" or "ddf" Object
mrExec

Execute a MapReduce Job
getSplitVar

Extract "Split" Variable(s)
readHDFStextFile

Experimental HDFS text reader helper function
readTextFileByChunk

Experimental sequential text reader helper function
as.list.ddo

Turn 'ddo' / 'ddf' Object into a list
digestFileHash

Digest File Hash Function
combDdo

"DDO" Recombination
combMean

Mean Recombination