A "word" is defined as a string of alphabetical characters separated by either spaces or dashes (but not other special characters).
get_word_count(voter_file, surname_col = "last_name", regex = "[ -]+")
A vector of word counts.
The voter file, with each row consisting of a voter.
A string denoting the surname column.
A string denoting the regular expression to use for querying the the word count.