read.csv.ddmatrix(file, sep = ",", nrows, ncols, header = FALSE, bldim = 4, num.rdrs = 1, ICTXT = 0, exact.linecount = TRUE)
nrows
is missing, this
determines whether or not the exact number of rows should be determined
(which requires a file read), or if an estimate should be used. Default is
TRUE
, meaning that the file will be scanned.scan()
and read.csv()
, but for
parallel reads into a distributed matrix.The arguments nrow=
and ncol=
are optional. In the case that
they are left blank, they will be determined. However, note that doing so
is costly, so knowing the dimensions beforehand can greatly improve
performance.
Although frankly, the performance-minded should not be using csv's in the
first place. Consider using the pbdNCDF4
package for managing data.