powered by
Read in the content from a .doc file using antiword via the antiword package.
read_doc(file, skip = 0, remove.empty = TRUE, trim = TRUE, format = FALSE, ...)
The path to the .doc file.
The number of lines to skip.
logical. If TRUE empty elements in the vector are removed.
TRUE
logical. If TRUE the leading/training white space is removed.
logical. If TRUE the output will keep doc formatting (e.g., bold, italics, underlined). This corresponds to the -f flag in antiword.
-f
ignored.
Returns a character vector.
# NOT RUN { x <- system.file("docs/Yasmine_Interview_Transcript.doc", package = "textreadr") read_doc(x) # }
Run the code above in your browser using DataLab