Learn R Programming

Concreteness

Concreteness has long been central to psychological theories of learning and thinking, and increasingly has practical applications to domains with prevalent natural language data, like advice and plan-making. However, the literature provides diffuse and competing definitions of concreteness in natural language. In this package, we codify simple guidelines for automated concreteness detection within and across domains, developed from a review of existing methods in the literature.

Installation

You can install the doc2concrete package directly, like so:

devtools::install_github("myeomans/doc2concrete")

Usage

This package is built as an accompaniment to Yeomans (2020). Here, we operationalize models of document-level concreteness based on a survey of datasets in several domains, including advice. We offer two applications. First, we provide pre-trained models specifically tuned to measure concreteness in two open-ended goal pursuit domains - advice and plan-making. These were developed using supervised machine learning tools, and robustly outperform other domain-specific models. We trained the advice model across a range of datasets from lab and field settings (9 studies, 4,608 students), and we trained the plan-making model from plans students wrote at the beginning of online classes (7 classes, 5,172 students). Second, we provide an open-domain model based on a word-level concreteness dictionary in Byrsbaert, Warriner & Kuperman (2014). While the open domain model did seem relatively robust in our research, we also found substantial variation in concreteness within and across domains. We provide this open-domain model as a scaleable starting point for researchers interested in concreteness in other domains. However, we highly recommend that researchers conduct deeper work to better understand their own domain-specific model of concreteness.


library(doc2concrete)

cor.test(doc2concrete(feedback_dat$feedback,domain="open"),
    feedback_dat$concrete)

cor.test(doc2concrete(feedback_dat$feedback,domain="advice"),
         feedback_dat$concrete)

References

Yeomans, M. (2021). A concrete example of construct construction in natural language. Organizational Behavior and Human Decision Processes, 162, 81-94.

Brysbaert, M., Warriner, A. B., & Kuperman, V. (2014). Concreteness ratings for 40 thousand generally known English word lemmas. Behavior Research Methods, 46(3), 904-911.

Copy Link

Version

Install

install.packages('doc2concrete')

Monthly Downloads

285

Version

0.6.0

License

MIT + file LICENSE

Maintainer

Mike Yeomans

Last Published

January 23rd, 2024

Functions in doc2concrete (0.6.0)

concDict

Open-Domain Concreteness Dictionaries
vocabmatcher

Feature Count Matcher
cleantext

Text Cleaner
overlaps

Overlap cleaner
planModel

Pre-trained Concreteness Detection Model for Plan-Making
doublestacker

Doublestacker
adviceNgrams

Pre-trained advice concreteness features
doc2concrete

Concreteness Scores
ngramTokens

Ngram Tokenizer
adviceModel

Pre-trained Concreteness Detection Model for Advice
bootstrap_list

Concreteness mTurk Word List
stemmer

Stemmer
uk2us

UK to US Conversion dictionary
usWords

UK to US conversion
feedback_dat

Personal Feedback Dataset
stemexcept

Conditional Stemmer
ctxpand

Contraction Expander
mturk_list

Concreteness mTurk Word List
planNgrams

Pre-trained plan concreteness features
cleanpunct

Cleaning weird encodings
textformat

Text Formatter