Reads CSV-files and seperates the text and meta data. The result is a
textmeta
object.
readTextmeta(
path,
file,
cols,
dateFormat = "%Y-%m-%d",
idCol = "id",
dateCol = "date",
titleCol = "title",
textCol = "text",
encoding = "UTF-8",
xmlAction = TRUE,
duplicateAction = TRUE
)readTextmeta.df(
df,
cols = colnames(df),
dateFormat = "%Y-%m-%d",
idCol = "id",
dateCol = "date",
titleCol = "title",
textCol = "text",
xmlAction = TRUE,
duplicateAction = TRUE
)
textmeta
object
character/data.frame
string with path where the data files
are OR parameter df
for readTextmeta.df
character
string with names of the CSV files
character
vector with columns which should be kept
character
string with the date format in the files
for as.Date
character
string with column name of the IDs
character
string with column name of the Dates
character
string with column name of the Titles
character
string with column name of the Texts
character string with encoding specification of the files
logical
whether all columns of the CSV should be
handled with removeXML
logical
whether deleteAndRenameDuplicates
should be applied to the
created textmeta
object
data.frame
table which should be transformed to a textmeta object