Learn R Programming

readMLData (version 0.9-7)

prepareDSList: Prepares a data frame dsList, which describes the data contained in a local data description directory.

Description

The data frame dsList is needed to read the data contained in a directory tree below dsList$pathData using dsRead(). The directory pathDescription is expected to contain the file contents.xml and subdirectory scripts with R scripts for reading the data sets.

Usage

prepareDSList(pathData, pathDescription)

Arguments

pathData
Character. A path to the required data directory.
pathDescription
Character. A path to a directory containing description of the required data, in particular the file "contents.xml".

Value

Data frame with columns pathData, pathDescription, and other as listed by getFields(). The output data frame can be used as dsList parametr of functions dsSearch(), dsRead(), checkConsistency(), checkType().

Details

The character "~" expands to your home directory.

The directory pathData need not contain all the data sets included in pathDescription/contents.xml. The function getAvailable() is called and its output is stored in column availability of the output data frame, which is logical and specifies for each data set, whether it is or is not present.

See http://www.cs.cas.cz/~savicky/readMLData/ for description files of some of the data sets from UCI Machine Learning Repository. See the help page readMLData for more information on the structure of the description files.

See Also

readMLData, getAvailable, checkConsistency.

Examples

Run this code
  pathData <- getPath("exampleData")
  pathDescription <- getPath("exampleDescription")
  dsList <- prepareDSList(pathData, pathDescription)

Run the code above in your browser using DataLab