Learn R Programming

⚠️There's a newer version (2.4.6) of this package.Take me there.

qdap (version 1.3.5)

Bridging the gap between qualitative data and quantitative analysis

Description

This package automates many of the tasks associated with quantitative discourse analysis of transcripts containing discourse including frequency counts of sentence types, words, sentences, turns of talk, syllables and other assorted analysis tasks. The package provides parsing tools for preparing transcript data. Many functions enable the user to aggregate data by any number of grouping variables providing analysis and seamless integration with other R packages that undertake higher level analysis and visualization of text. This affords the user a more efficient and targeted analysis. qdap is designed for transcript analysis, however, many functions are applicable to other areas of Text Mining/Natural Language Processing.

Copy Link

Version

Install

install.packages('qdap')

Monthly Downloads

1,406

Version

1.3.5

License

GPL-2

Maintainer

Last Published

April 8th, 2014

Functions in qdap (1.3.5)

all_words

Searches Text Column for Words
DATA2

Fictitious Repeated Measures Classroom Dialogue
Filter.all_words

Filter
cm_code.exclude

Exclude Codes
is.global

Test If Environment is Global
colSplit

Separate a Column Pasted by paste2
cm_code.combine

Combine Codes
Animate.formality

Animate Formality
clean

Remove Escaped Characters
cm_range.temp

Range Code Sheet
cm_combine.dummy

Find Co-occurrence Between Dummy Codes
incomplete_replace

Denote Incomplete End Marks With "|"
Animate.polarity

Animate Polarity
gantt_rep

Generate Unit Spans for Repeated Measures
cm_distance

Distance Matrix Between Codes
Trim

Remove Leading/Trailing White Space
plot.pos

Plots a pos Object
imperative

Intuitively Remark Sentences as Imperative
adjacency_matrix

Takes a Matrix and Generates an Adjacency Matrix
Animate.gantt

Gantt Durations
counts.character_table

Term Counts
new_project

Project Template
Dissimilarity

Dissimilarity Statistics
counts.termco

Term Counts
bag_o_words

Bag of Words
cm_df.transcript

Transcript With Word Number
counts.flesch_kincaid

Readability Measures
beg2char

Grab Begin/End of String to Character
cm_code.transform

Transform Codes
Title

Add Title to Select qdap Plots
cm_time2long

Transform Codes to Start-End Times
freq_terms

Find Frequent Terms
cm_time.temp

Time Span Code Sheet
counts.end_mark_by

Question Counts
cm_code.blank

Blank Code Transformation
cm_dummy2long

Convert cm_combine.dummy Back to Long
DATA

Fictitious Classroom Dialogue
dir_map

Map Transcript Files from a Directory to a Script
print.boolean_qdap

Prints a boolean_qdap object
multigsub

Multiple gsub
NAer

Replace Missing Values (NA)
common

Find Common Words Between Groups
colcomb2class

Combine Columns to Class
counts.question_type

Question Counts
DATA.SPLIT

Fictitious Split Sentence Classroom Dialogue
end_inc

Test for Incomplete Sentences
print.SMOG

Prints an SMOG Object
automated_readability_index

Readability Measures
hash

Hash/Dictionary Lookup
counts

Generic Counts Method
cm_2long

A Generic to Long Function
plot.end_mark_by_preprocessed

Plots a end_mark_by_preprocessed Object
htruncdf

Dataframe Viewing
capitalizer

Capitalize Select Words
bracketX

Bracket Parsing
formality

Formality Score
plot.pos_by

Plots a pos_by Object
counts.pos_by

Parts of Speech
cm_df2long

Transform Codes to Start-End Durations
preprocessed

Generic Preprocessed Method
plot.SMOG

Plots a SMOG Object
common.list

list Method for common
dist_tab

SPSS Style Frequency Tables
cm_code.overlap

Find Co-occurrence Between Codes
cm_df.fill

Range Coding
left_just

Text Justification
exclude

Exclude Elements From a Vector
phrase_net

Phrase Nets
outlier_detect

Detect Outliers in Text
hms2sec

Convert h:m:s to Seconds
plot.table_count

Plots a table_count Object
gradient_cloud

Gradient Word Cloud
counts.SMOG

Readability Measures
gantt_wrap

Gantt Plot
preprocessed.pos_by

Parts of Speech
mraja1spl

Romeo and Juliet: Act 1 Dialogue Merged with Demographics and Split
plot.animated_formality

Plots a animated_formality Object
plot.end_mark_by_proportion

Plots a end_mark_by_proportion Object
print.question_type

Prints a question_type object
print.linsear_write_scores

Prints a linsear_write_scores Object
cm_range2long

Transform Codes to Start-End Durations
name2sex

Names to Gender Prediction
colsplit2df

Wrapper for colSplit that Returns Dataframe(s)
id

ID By Row Number or Sequence Along
plot.animated_polarity

Plots a animated_polarity Object
wfm

Word Frequency Matrix
key_merge

Merge Demographic Information with Person/Text Transcript
plot.formality

Plots a formality Object
blank2NA

Replace Blanks in a dataframe
Search

Search Columns of a Data Frame
print.question_type_preprocessed

Prints a question_type_preprocessed object
vertex_apply

Apply Parameter to List of Igraph Vertices/Edges
print.readability_score

Prints a readability_score Object
plot.cmspans

Plots a cmspans object
plot.automated_readability_index

Plots a automated_readability_index Object
plot.rmgantt

Plots a rmgantt object
raj.act.3

Romeo and Juliet: Act 3
prop

Convert Raw Numeric Matrix or Data Frame to Proportions
counts.coleman_liau

Readability Measures
counts.linsear_write

Readability Measures
plot.freq_terms

Plots a freq_terms Object
diversity

Diversity Statistics
raj

Romeo and Juliet (Unchanged & Complete)
t.TermDocumentMatrix

Transposes a TermDocumentMatrix object
counts.polarity

Polarity
speakerSplit

Break and Stretch if Multiple Persons per Cell
rm_row

Remove Rows That Contain Markers
plot.linsear_write_scores

Plots a linsear_write_scores Object
rajSPLIT

Romeo and Juliet (Complete & Split)
print.discourse_map

Prints a discourse_map Object
plot.end_mark_by_count

Plots a end_mark_by_count Object
mcsv_r

Read/Write Multiple csv Files at a Time
kullback_leibler

Kullback Leibler Statistic
word_count

Word Counts
print.pos

Prints a pos Object.
condense

Condense Dataframe Columns
plot.sums_gantt

Plots a sums_gantt object
polarity

Polarity Score (Sentiment Analysis)
plot.end_mark_by_score

Plots a end_mark_by_score Object
scores.word_stats

Word Stats
plot.linsear_write_count

Plots a linsear_write_count Object
plot.termco

Plots a termco object
counts.pos

Parts of Speech
plot.gantt

Plots a gantt object
plot.diversity

Plots a diversity object
mraja1

Romeo and Juliet: Act 1 Dialogue Merged with Demographics
print.character_table

Prints a character_table object
print.linsear_write_count

Prints a linsear_write_count Object
plot.table_proportion

Plots a table_proportion Object
plot.question_type

Plots a question_type Object
print.fry

Prints an fry Object
counts.formality

Formality
plot.polarity_count

Plots a polarity_count Object
replace_number

Replace Numbers With Text Representation
scores.end_mark_by

Question Counts
scores.formality

Formality
plot.end_mark_by

Plots a end_mark_by Object
preprocessed.question_type

Question Counts
print.animated_discourse_map

Prints a animated_discourse_map Object
plot.wfm

Plots a wfm object
plot.readability_score

Plots a readability_score Object
plot.cm_distance

Plots a cm_distance object
print.coleman_liau

Prints an coleman_liau Object
plot.pos_preprocessed

Plots a pos_preprocessed Object
plot.formality_scores

Plots a formality_scores Object
summary.wfm

Summarize a wfm object
plot.polarity_score

Plots a polarity_score Object
sec2hms

Convert Seconds to h:m:s
print.pos_preprocessed

Prints a pos_preprocessed object
print.animated_polarity

Prints a animated_polarity Object
plot.table_score

Plots a table_score Object
print.qdapProj

Prints a qdapProj Object
plot.linsear_write

Plots a linsear_write Object
gantt_plot

Gantt Plot
plot.word_cor

Plots a word_cor object
v_outer

Vectorized Version of outer
print.polarity_count

Prints a polarity_count Object
plot.wfdf

Plots a wfdf object
discourse_map

Discourse Mapping
termco

Search For and Count Terms
print.colsplit2df

Prints a colsplit2df Object.
plot.readability_count

Plots a readability_count Object
plot.word_stats

Plots a word_stats object
trans_venn

Venn Diagram by Grouping Variable
pos

Parts of Speech Tagging
plot.animated_discourse_map

Plots a animated_discourse_map Object
plot.flesch_kincaid

Plots a flesch_kincaid Object
print.wfm

Prints a wfm Object
termco_c

Combine Columns from a termco Object
print.sent_split

Prints a sent_split object
preprocessed.formality

Formality
print.cm_distance

Prints a cm_distance Object
plot.word_proximity

Plots a word_proximity object
print.flesch_kincaid

Prints an flesch_kincaid Object
print.sum_cmspans

Prints a sum_cmspans object
proportions.question_type

Question Counts
preprocessed.pos

Parts of Speech
print.wfm_summary

Prints a wfm_summary Object
proportions.termco

Term Counts
print.end_mark_by

Prints a end_mark_by object
print.polarity_score

Prints a polarity_score Object
duplicates

Find Duplicated Words in a Text String
raj.act.2

Romeo and Juliet: Act 2
lookup

Hash Table/Dictionary Lookup
print.all_words

Prints an all_words Object
plot.polarity

Plots a polarity Object
gantt

Gantt Durations
Animate.gantt_plot

Gantt Plot
print.table_proportion

Prints a table_proportion object
end_mark

Sentence End marks
ngrams

Generate ngrams
cm_df.temp

Break Transcript Dialogue into Blank Code Matrix
raj.demographics

Romeo and Juliet Demographics
print.animated_formality

Prints a animated_formality Object
plot.weighted_wfm

Plots a weighted_wfm object
print.word_cor

Prints a word_cor object
preprocessed.end_mark_by

Question Counts
proportions.character_table

Term Counts
proportions.formality

Formality
print.v_outer

Prints a v_outer Object.
qcv

Quick Character Vector
proportions

Generic Proportions Method
word_diff_list

Differences In Word Use Between Groups
print.word_stats_counts

Prints a word_stats_counts object
pres_debates2012

2012 U.S. Presidential Debates
plot.kullback_leibler

Plots a kullback_leibler object
print.word_stats

Prints a word_stats object
scores.SMOG

Readability Measures
qprep

Quick Preparation of Text
replace_contraction

Replace Contractions
scores

Generic Scores Method
paste2

Paste an Unspecified Number Of Text Columns
scores.fry

Readability Measures
scores.flesch_kincaid

Readability Measures
read.transcript

Read Transcripts Into R
raj.act.1POS

Romeo and Juliet: Act 1 Parts of Speech by Person
replace_symbol

Replace Symbols With Word Equivalents
rank_freq_mplot

Rank Frequency Plot
tdm

tm Package Compatibility Tools: Apply to or Convert to/from Term Document Matrix or Document Term Matrix
print.polarity

Prints an polarity Object
repo2github

Upload a Local Repo to GitHub
sentSplit

Sentence Splitting
text2color

Map Words to Colors
print.automated_readability_index

Prints an automated_readability_index Object
counts.fry

Readability Measures
scores.pos_by

Parts of Speech
summary.wfdf

Summarize a wfdf object
synonyms

Search For Synonyms
syllable_sum

Syllabication
rm_url

Remove/Replace URLs
qcombine

Combine Columns
tot_plot

Visualize Word Length by Turn of Talk
sample.time.span

Minimal Time Span Data Set
scores.question_type

Question Counts
print.table_count

Prints a table_count object
scores.polarity

Polarity
proportions.pos_by

Parts of Speech
trans_context

Print Context Around Indices
print.adjacency_matrix

Prints an adjacency_matrix Object
print.kullback_leibler

Prints a kullback_leibler Object.
print.qdap_context

Prints a qdap_context object
proportions.pos

Parts of Speech
url_dl

Download Instructional Documents
print.word_associate

Prints a word_associate object
multiscale

Nested Standardization
visual.discourse_map

Discourse Map
print.termco

Prints a termco object.
qdap

qdap: Quantitative Discourse Analysis Package
strWrap

Wrap Character Strings to Format Paragraphs
question_type

Count of Question Type
scores.character_table

Term Counts
scores.automated_readability_index

Readability Measures
t.DocumentTermMatrix

Transposes a DocumentTermMatrix object
outlier_labeler

Locate Outliers in Numeric String
plot.character_table

Plots a character_table Object
plot.coleman_liau

Plots a coleman_liau Object
plot.discourse_map

Plots a discourse_map Object
spaste

Add Leading/Trailing Spaces
print.readability_count

Prints a readability_count Object
rajPOS

Romeo and Juliet Split in Parts of Speech
word_list

Raw Word Lists/Frequency Counts
plot.word_stats_counts

Plots a word_stats_counts Object
print.ngrams

Prints an ngrams object
scrubber

Clean Imported Text
word_stats

Descriptive Word Statistics
potential_NA

Search for Potential Missing Values
print.formality_scores

Prints a formality_scores object
raj.act.4

Romeo and Juliet: Act 4
print.trunc

Prints a trunc object
pres_debate_raw2012

First 2012 U.S. Presidential Debate
print.word_list

Prints a word_list Object
scores.linsear_write

Readability Measures
visual

Generic visual Method
print.table_score

Prints a table_score object
strip

Strip Text
space_fill

Replace Spaces
summary.cmspans

Summarize a cmspans object
rm_stopwords

Remove Stop Words
trans_cloud

Word Clouds by Grouping Variable
word_network_plot

Word Network Plot
word_proximity

Proximity Matrix Between Words
raj.act.5

Romeo and Juliet: Act 5
print.pos_by

Prints a pos_by Object.
word_cor

Find Correlated Words
proportions.end_mark_by

Question Counts
print.word_proximity

Prints a word_proximity object
raw.time.span

Minimal Raw Time Span Data Set
plot.sent_split

Plots a sent_split Object
replace_abbreviation

Replace Abbreviations
scores.termco

Term Counts
Animate

Generic Animate Method
cm_long2dummy

Stretch and Dummy Code cm_xxx2long
counts.automated_readability_index

Readability Measures
list2df

List/Matrix/Vector to Dataframe
mtabulate

Tabulate Frequency Counts for Multiple Vectors
plot.question_type_preprocessed

Plots a question_type_preprocessed Object
print.diversity

Prints a diversity object
print.formality

Prints a formality Object
print.linsear_write

Prints an linsear_write Object
replacer

Replace Cells in a Matrix or Data Frame
raj.act.1

Romeo and Juliet: Act 1
scores.coleman_liau

Readability Measures
stemmer

Stem Text
Animate.discourse_map

Discourse Map
counts.word_stats

Word Stats
dispersion_plot

Lexical Dispersion Plot
plot.sum_cmspans

Plot Summary Stats for a Summary of a cmspans Object
print.Dissimilarity

Prints a Dissimilarity object
print.end_mark_by_preprocessed

Prints a end_mark_by_preprocessed object
print.phrase_net

Prints a phrase_net Object
print.sums_gantt

Prints a sums_gantt object
qheat

Quick Heatmap
word_associate

Find Associated Words