Learn R Programming

ddR (version 0.1.2)

collect: Fetch partition(s) of 'darray', 'dframe' or 'dlist' from remote workers.

Description

Fetch partition(s) of 'darray', 'dframe' or 'dlist' from remote workers.

Usage

collect(dobj, index = NULL)

Arguments

dobj
input distributed array, distributed data frame or distributed list.
index
a vector indicating partitions to fetch. If multiple indices are provided, the result is assembled in the same order as the indices provided, though be aware that for dframes and darrays the result may lose its structure.

Value

An R list, array, or data.frame containing data stored in the partitions of the input.

References

Prasad, S., Fard, A., Gupta, V., Martinez, J., LeFevre, J., Xu, V., Hsu, M., Roy, I. Large scale predictive analytics in Vertica: Fast data transfer, distributed model creation and in-database prediction. _Sigmod 2015_, 1657-1668.

Venkataraman, S., Bodzsar, E., Roy, I., AuYoung, A., and Schreiber, R. (2013) Presto: Distributed Machine Learning and Graph Processing with Sparse Matrices. _EuroSys 2013_, 197-210.

Homepage: https://github.com/vertica/ddR

Examples

Run this code
## Not run: 
# a <- darray(dim=c(9,9),psize=c(3,3),data=5)
# b <- collect(a) # 9x9 matrix filled with 5s
# c <- collect(a,1) # First partition of a, which contains a 3x3 matrix of 5s
# ## End(Not run)

Run the code above in your browser using DataLab