powered by
Transforms text in koRpus objects token by token.
kRp.text.transform(txt, scheme, p = 0.5, paste = FALSE)
An object of class kRp.txt.trans-class, kRp.tagged-class, kRp.txt.freq-class or kRp.analysis-class.
kRp.txt.trans-class
kRp.tagged-class
kRp.txt.freq-class
kRp.analysis-class
One of the following character strings:
"minor" Start each word with a lowercase letter.
"minor"
"all.minor" Forces all letters into lowercase.
"all.minor"
"major" Start each word with a uppercase letter.
"major"
"all.major" Forces all letters into uppercase.
"all.major"
"random" Randomly start words with uppercase or lowercase letters.
"random"
"de.norm" German norm: All names, nouns and sentence beginnings start with an uppercase letter, anything else with a lowercase letter.
"de.norm"
"de.inv" Inversion of "de.norm".
"de.inv"
"eu.norm" Usual European cases: Only names and sentence beginnings start with an uppercase letter, anything else with a lowercase letter.
"eu.norm"
"eu.inv" Inversion of "eu.norm".
"eu.inv"
Numeric value between 0 and 1. Defines the probability for upper case letters (relevant only if scheme="random").
scheme="random"
Logical, see value section.
By default an object of class kRp.txt.trans-class is returned. If paste=TRUE, returns an atomic character vector (via kRp.text.paste).
paste=TRUE
kRp.text.paste
This function is mainly intended to produce text material for experiments.
# NOT RUN { tagged.text.obj <- freq.analysis("/some/text.txt", corp.freq=my.LCC.data) kRp.text.transform(tagged.text.obj, scheme="random", paste=TRUE) # }
Run the code above in your browser using DataLab