Wrapper for bracketX
, replace_number
,
replace_symbol
, replace_abbreviation
and scrubber
to quickly prepare text for analysis. Care
should be taken with this function to ensure data is properly formatted and
complete.
qprep(
text.var,
rm.dash = TRUE,
bracket = "all",
missing = NULL,
names = FALSE,
abbreviation = qdapDictionaries::abbreviations,
replace = NULL,
ignore.case = TRUE,
num.paste = TRUE,
...
)
The text variable.
logical. If TRUE
dashes will be removed.
The type of bracket (and encased text) to remove. This is one
of the strings "curly"
, "square"
, "round"
,
"angle"
and "all"
. These strings correspond to: {, [, (, <
or all four types. Also takes the argument NULL
which turns off this
parsing technique.
Value to assign to empty cells.
logical. If TRUE
the sentences are given as the names of
the counts.
A two column key of abbreviations (column 1) and long
form replacements (column 2) or a vector of abbreviations. Default is to use
qdap's abbreviations data set. Also takes the argument NULL
which
turns off this parsing technique.
A vector of long form replacements if a data frame is not supplied to the abbreviation argument.
logical. If TRUE
replaces without regard to
capitalization.
logical. If TURE
a the elements of larger numbers are
separated with spaces. If FALSE
the elements will be joined without
spaces. Also takes the argument NULL
which turns off this parsing
technique.
Other arguments passed to replace_symbol
.
bracketX
,
replace_abbreviation
,
replace_number
,
replace_symbol
if (FALSE) {
x <- "I like 60 (laughter) #d-bot and $6 @ the store w/o 8p.m."
qprep(x)
}
Run the code above in your browser using DataLab