Reads CSV-files and seperates the text and meta data. The result is a
textmeta object.
readTextmeta(
path,
file,
cols,
dateFormat = "%Y-%m-%d",
idCol = "id",
dateCol = "date",
titleCol = "title",
textCol = "text",
encoding = "UTF-8",
xmlAction = TRUE,
duplicateAction = TRUE
)readTextmeta.df(
df,
cols = colnames(df),
dateFormat = "%Y-%m-%d",
idCol = "id",
dateCol = "date",
titleCol = "title",
textCol = "text",
xmlAction = TRUE,
duplicateAction = TRUE
)
textmeta object
character/data.frame string with path where the data files
are OR parameter df for readTextmeta.df
character string with names of the CSV files
character vector with columns which should be kept
character string with the date format in the files
for as.Date
character string with column name of the IDs
character string with column name of the Dates
character string with column name of the Titles
character string with column name of the Texts
character string with encoding specification of the files
logical whether all columns of the CSV should be
handled with removeXML
logical
whether deleteAndRenameDuplicates should be applied to the
created textmeta object
data.frame table which should be transformed to a textmeta object