Learn R Programming

⚠️There's a newer version (2.4.6) of this package.Take me there.

qdap (version 0.2.5)

Bridging the gap between qualitative data and quantitative analysis

Description

This package automates many of the tasks associated with quantitative discourse analysis of transcripts containing discourse including frequency counts of sentence types, words, sentences, turns of talk, syllables and other assorted analysis tasks. The package provides parsing tools for preparing transcript data. Many functions enable the user to aggregate data by any number of grouping variables providing analysis and seamless integration with other R packages that undertake higher level analysis and visualization of text. This affords the user a more efficient and targeted analysis. qdap is designed for transcript analysis, however, many functions are applicable to other areas of Text Mining /Natural Language Processing

Copy Link

Version

Install

install.packages('qdap')

Monthly Downloads

1,406

Version

0.2.5

License

GPL-2

Maintainer

Last Published

September 1st, 2013

Functions in qdap (0.2.5)

cm_code.transform

Transform Codes
adverb

Adverb Word List
colsplit2df

Wrapper for colSplit that Returns Dataframe(s)
clean

Remove Escaped Characters
cm_combine.dummy

Find Co-occurrence Between Codes
gantt_wrap

Gantt Plot
OnixTxtRetToolkitSWL1

Onix Text Retrieval Toolkit Stopword List 1
Trim

Remove Leading/Trailing White Space
lookup

Hash Table/Dictionary Lookup
plot.character.table

Plots a character.table Object
cm_distance

Distance Matrix Between Codes
blank2NA

Replace Blanks in a dataframe
polarity

Polarity Score (Sentiment Analysis)
beg2char

Grab Begin/End of Sting to Character
cm_code.combine

Combine Codes
Top200Words

Fry's 200 Most Commonly Used English Words
action.verbs

Action Word List
plot.formality

Plots a formality Object
NAMES_LIST

First Names and Predictive Gender (U.S.) List
cm_code.exclude

Exclude Codes
rm_row

Remove Rows That Contain Markers
negation.words

Negating Words
cm_code.overlap

Find Co-occurrence Between Codes
cm_df2long

Transform Codes to Start-End Durations
url_dl

Download Instructional Documents
cm_range.temp

Range Code Sheet
replace_abbreviation

Replace Abbreviations
diversity

Diversity Statistics
pos

Parts of Speech Tagging
print.pos.by

Prints a pos.by Object.
env.syn

Syllable Lookup Environment
bag.o.words

Bag of Words
NAMES

First Names and Gender (U.S.)
qheat

Quick Heatmap
gantt_rep

Generate Unit Spans for Repeated Measures
qcv

Quick Character Vector
cm_dummy2long

Convert cm_combine.dummy Back to Long
Search

Search Columns of a Data Frame
Top100Words

Fry's 100 Most Commonly Used English Words
ngrams

Generate ngrams
BuckleySaltonSWL

Buckley & Salton Stopword List
DATA

Fictitious Classroom Dialogue
text2color

Map Words to Colors
adjacency_matrix

Takes a Matrix and Generates an Adjacency Matrix
NAMES_SEX

First Names and Predictive Gender (U.S.)
sentSplit

Sentence Splitting
key_merge

Merge Demographic Information with Person/Text Transcript
gantt_plot

Gantt Plot
stopwords

Remove Stopwords
DICTIONARY

Nettalk Corpus Syllable Data Set
SYNONYM

Synonyms Data Set
raj.act.1

Romeo and Juliet: Act 1
cm_time.temp

Time Span Code Sheet
abbreviations

Small Abbreviations Data Set
kullback.leibler

Kullback Leibler Statistic
automated_readability_index

Readability Measures
word.count

Word Counts
print.adjacency_matrix

Prints an adjacency_matrix Object
htruncdf

Dataframe Viewing
contractions

Contraction Conversions
replace_contraction

Replace Contractions
print.character.table

Prints a character.table object
plot.polarity

Plots a polarity Object
common

Find Common Words Between Groups
env.syl

Syllable Lookup Environment
multiscale

Nested Standardization
termco

Search For and Count Terms
gantt

Generate Unit Spans
rajPOS

Romeo and Juliet Split in Parts of Speech
dir_map

Map Transcript Files from a Directory to a Script
print.colsplit2df

Prints a colsplit2df Object.
tdm

Convert/Generate Term Document Matrix
question_type

Count of Question Type
capitalizer

Capitalize Select Words
interjections

Interjections
exclude

Exclude Elements From a Vector
potential_NA

Search for Potential Missing Values
end_mark

Sentence End marks
stemmer

Stem Text
left.just

Text Justification
print.word_associate

Prints a word_associate object
Top25Words

Fry's 25 Most Commonly Used English Words
raj.act.5

Romeo and Juliet: Act 5
read.transcript

Read Transcripts Into R
rajSPLIT

Romeo and Juliet (Complete & Split)
cm_long2dummy

Stretch and Dummy Code cm_xxx2long
print.cm_distance

Prints a cm_distance Object
replace_symbol

Replace Symbols With Word Equivalents
replacer

Replace Cells in a Matrix or Data Frame
cm_df.temp

Break Transcript Dialogue into Blank Code Matrix
incomplete.replace

Denote Incomplete End Marks With "|"
outlier.labeler

Locate Outliers in Numeric String
common.list

list Method for common
increase.amplification.words

Amplifying Words
formality

Formality Score
hms2sec

Convert h:m:s to Seconds
plot.termco

Plots a termco object
outlier.detect

Detect Outliers in Text
duplicates

Find Duplicated Words in a Text String
colSplit

Separate a Column Pasted by paste2
print.formality

Prints a formality Object
print.question_type

Prints a question_type object
imperative

Intuitively Remark Sentences as Imperative
qdap

qdap: Quantitative Discourse Analysis Package
raj.act.2

Romeo and Juliet: Act 2
new_project

Project Template
mraja1spl

Romeo and Juliet: Act 1 Dialogue Merged with Demographics and Split
emoticon

Emoticons Data Set
labMT

Language Assessment by Mechanical Turk (labMT) Sentiment Words
name2sex

Names to Gender Prediction
multigsub

Multiple gsub
plot.diversity

Plots a diversity object
mcsv_r

Read/Write Multiple csv Files at a Time
scrubber

Clean Imported Text
strWrap

Wrap Character Strings to Format Paragraphs
negative.words

Negative Words
raj

Romeo and Juliet (Unchanged & Complete)
synonyms

Search For Synonyms
prop

Convert Raw Numeric Matrix or Data Frame to Proportions
plot.question_type

Plots a question_type Object
positive.words

Positive Words
replace_number

Replace Numbers With Text Representation
plot.word_stats

Plots a word_stats object
word_diff_list

Differences In Word Use Between Groups
syllable.sum

Syllabication
trans.cloud

Word Clouds by Grouping Variable
sec2hms

Convert Seconds to h:m:s
raj.demographics

Romeo and Juliet Demographics
cm_df.fill

Range Coding
word_stats

Descriptive Word Statistics
print.kullback.leibler

Prints a kullback.leibler Object.
cm_time2long

Transform Codes to Start-End Times
print.qdapProj

Prints a qdapProj Object
print.termco

Prints a termco object.
print.word_list

Prints a word_list Object
qprep

Quick Preparation of Text
print.ngrams

Prints an ngrams object
print.pos

Prints a pos Object.
dissimilarity

Dissimilarity Statistics
spaste

Add Leading/Trailing Spaces
word_associate

Find Associated Words.
paste2

Paste an Unspecified Number Of Text Columns
speakerSplit

Break and Stretch if Multiple Persons per Cell
rank_freq_mplot

Rank Frequency Plot
print.polarity

Prints a polarity Object
raj.act.3

Romeo and Juliet: Act 3
print.v.outer

Prints a v.outer Object.
termco.c

Combine Columns from a termco Object
raj.act.4

Romeo and Juliet: Act 4
gradient_cloud

Gradient Word Cloud
strip

Strip Text
space_fill

Replace Spaces
DATA2

Fictitious Repeated Measures Classroom Dialogue
word.network.plot

Word Network Plot
cm_df.transcript

Transcript With Word Number
all_words

Searches Text Column for Words
distTab

SPSS Style Frequency Tables
NAer

Replace Missing Values (NA)
preposition

Preposition Words
print.diversity

Prints a diversity object
v.outer

Vectorized Version of outer
bracketX

Bracket Parsing
cm_code.blank

Blank Code Transformation
hash

Hash/Dictionary Lookup
print.word_stats

Prints a word_stats object
mraja1

Romeo and Juliet: Act 1 Dialogue Merged with Demographics
qcombine

Combine Columns
word_list

Raw Word Lists/Frequency Counts
wfm

Word Frequency Matrix
cm_range2long

Transform Codes to Start-End Durations
plot.pos.by

Plots a pos.by Object
end_inc

Test for Incomplete Sentences
print.dissimilarity

Prints a dissimilarity object
tot_plot

Visualize Word Length by Turn of Talk
trans.venn

Venn Diagram by Grouping Variable