D3MIRT
Modeling
The D3mirt
analysis is based on descriptive multidimensional item
response theory (DMIRT; Reckase, 2009, 1985; Reckase & McKinley, 1991)
and can be used to analyze dichotomous and polytomous items to measure
latent traits (denoted $\theta$) in a three-dimensional ability space.
The method is foremost visual and illustrates item characteristics with
the help of vector geometry.
In DMIRT analysis, it is assumed that items in a multidimensional ability space can measure single or multiple latent traits (Reckase, 2009, 1985). This assumption is realized in the compensatory model, i.e., a type of measurement model that uses linear combinations of $\theta$-values for ability assessment (Reckase, 2009). DMIRT builds on the results from the former model and seeks to maximize item discrimination in the latent space by relaxing the assumption of unidimensionality. The method is descriptive because the results describe both the strength of discrimination of the items and the extent to which the items are unidimensional, i.e., that the items discriminate on one dimension only, or are within-multidimensional, i.e., that the maximum item discrimination is reached when the item measures more than one dimension.
Regarding vector orientation, the angle of the vector arrows indicates what traits, located along the axes in the model, an item measures (Reckase, 2009, 1985; Reckase & McKinley, 1991). For instance, in a two-dimensional space, an item is unidimensional if its item vector arrow is at $0°$ with respect to one of the axes in the model and $90°$ with respect to the other. Such an item describes a single trait only. In contrast, an item is perfectly within-multidimensional if its item vector arrow is oriented at $45°$ in relation to the axes in the model. Such an item describes both traits in the model equally well. The same criteria are extended to the $n$-dimensional case.
The DMIRT approach uses two types of item models depending on item type. If dichotomous items are used, the analysis is based on the multidimensional two-parameter logistic model (M2PL) (McKinley & Reckase, 1983; Reckase, 1985). If polytomous items are used, the analysis is extended to the multidimensional two-parameter graded response model (MGRM; Muraki & Carlson, 1995). The method is, therefore, limited to items that fit these item models.
The estimation begins by first fitting and extracting the loadings $a$ and difficulty parameters $d$ from a compensatory model. Next, the DMIRT estimation takes the former estimates and computes the multidimensional discrimination ($MDISC$) parameter and the multidimensional difficulty ($MDIFF$) parameter used to locate the items in a multidimensional vector space.
The $MDIFF$ is interpreted similarly to the difficulty parameter in the unidimensional model. It shows the level of ability required for a higher or correct response, defined as the distance from the origin. Note that if polytomous items are used, such as Likert items, the items will be represented by multiple vector arrows, one for each response function. The $MDIFF$ will then show an item’s multidimensional range of difficulty as located in the latent trait space.
The $MDISC$ shows the highest level of discrimination an item can achieve in the multidimensional latent model. It is, therefore, a global item characteristic referring only to the discrimination achieved in the specific direction of the item in the latent space. The $MDISC$ score is visualized in the graphical output by scaling the length of the vector arrows (longer arrows indicate higher discrimination and vice versa), which in turn describe the domain of item response functions.
In D3mirt
it also possible to use constructs in the analysis.
Constructs, in this context, refer to the assumption that a subset of
items or a particular orientation in the model space can measure a
higher-order latent variable. This is implemented in D3mirt
as
optional vectors whose orientation is either calculated as the average
direction of a subset of items (from all items down to a single item) or
indicated by spherical coordinates. A construct vector will, therefore,
point in the direction of the maximum slope of what we might think of as
an imaginary item response function indicated by the items or
coordinates chosen by the user based on exploratory or theoretical
reasons.
If constructs are used, the output will include reporting the directional discrimination ($DDISC$) parameter that shows how well the items discriminate, assuming that they measure one of the constructs used in the analysis. That is, while the $MDISC$ represents the maximum level of discrimination in the model, the $DDISC$ represents the local discrimination that makes it possible to compare item discrimination in a specific direction set by the constructs. The constructs are, therefore, like unidimensional models nested in the multidimensional latent space and are visually represented with construct vector arrows scaled to an arbitrary length.
Overview
The package includes the following main functions.
modid()
: D3mirt Model IdentificationD3mirt()
: 3D DMIRT Model Estimationplot()
: Graphical Output forD3mirt()
Installation
You can install the D3mirt
package from CRAN, or try the development
version of the package, by using the following codes for R
.
# Install from CRAN depository
install.packages('D3mirt')
# Install the development version from GitHub
# install.packages("devtools")
# To include package vignette in the installation, add: build_vignettes = TRUE
devtools::install_github("ForsbergPyschometrics/D3mirt")
In what follows, the D3mirt
procedure, including the main functions
and some of the more essential arguments, will be briefly described
using the built-in data set “anes0809offwaves”. The data set
($N = 1046, M_{age} = 51.33, SD = 14.56, 57%$ Female) is a subset from
the American National Election Survey (ANES) from the 2008-2009 Panel
Study Off Wave Questionnaires, December 2009 (DeBell, et al, 2010;
https://electionstudies.org/data-center/2008-2009-panel-study/). All
items measure moral preferences and are positively scored of Likert
type, ranging from 1 = Strongly Disagree to 6 = Strongly Agree.
Demographic variables include age and gender (male/female).
The D3mirt approach largely consists of the following three steps:
- Model Identification
- D3mirt model estimation
- Plotting
Please see the vignette included in the package for more details on the
D3mirt
package, including extended examples of analysis and functions.
1. Model Identification
As a first step in the analysis, the three-dimensional compensatory model must be identified (Reckase, 2009). In the three-dimensional case, this implies identifying the $x$ and $y$-axis in the DMIRT model by selecting two items from the item set. The first item should not load on the second and third axes ($y$ and $z$-axes), while the second item should not load on the third axis ($z$). Consequently, if the model is not known beforehand, it is necessary to explore the data with exploratory factor analysis (EFA), preferably with the help of the EFA methods suitable for DMIRT.
The modid()
function was designed to help with this step in the
process. In brief, the function first performs an EFA based on
multidimensional item response theory. It then selects items so that the
strongest loading item, from the strongest factor, is always parallel
with the x-axis, and the remaining items follow thereafter. This helps
make the result maximally interpretable while also avoiding imposing an
unempirical structure on the data.
Note that the EFA is only used to find model identification items that meet the necessary DMIRT model specification requirements. The EFA model itself is discarded after this step in the procedure. This implies that the rotation method is less crucial, and the user is encouraged to try different rotation methods and compare the results.
Begin by loading the item data. Note that all outputs are available as ready-made package files that can be loaded directly into the R session.
# Load Package
library(D3mirt)
# Load data
data("anes0809offwaves")
x <- anes0809offwaves
x <- x[, 3:22] # Remove columns for age and gender
The modid()
can take in raw item data or a data frame with item factor
loadings. In the default mode (efa = TRUE
), using raw data, the
function performs an EFA using with the help of the mirt()
function,
from the mirt
package (Chalmers, 2012), with three factors as the
default (factors = 3
). After sorting the results following the model
identification procedure, the function finishes by presenting the model
identification items. The function throws an error if no items can be
located. Note that it is possible to use available factor loadings
assigned to a data frame in the function call. This can be done by
setting efa
to FALSE
, which allows the function to skip the EFA and
jump directly to the model identification procedure.
The output consists of an $S3$ object of class modid
containing data
frames with model identification items, order of factor strength (based
on sum of squares), and item factor loadings. The function has two
arguments: the lower and upper bound. In brief, the lower bound
increases the item pool used in the procedure, while the upper bound
acts as a filter that removes items that do not meet the necessary
statistical requirements. This implies that the upper bound should not
generally be manipulated.
# Optional: Load the EFA data for this example directly from the package file
load(system.file("extdata/id.Rdata", package = "D3mirt"))
# Call to modid() with x, containing factors scores from the EFA
# Observe that the efa argument is set to false
id <- modid(x)
summary(id)
#>
#> modid: 20 items and 3 factors
#>
#> Model identification items:
#> Item 1 W7Q3
#> Item 2 W7Q20
#>
#> Item.1 ABS
#> W7Q3 0.8547 0.0174
#> W7Q5 0.8199 0.0648
#> W7Q1 0.7589 0.0772
#> W7Q10 0.7239 0.0854
#>
#> Item.2 ABS
#> W7Q20 0.7723 0.0465
#> W7Q19 0.6436 0.0526
#> W7Q18 0.6777 0.0782
#>
#> SS Loadings
#> F2 5.3505
#> F1 2.1127
#> F3 1.6744
#>
#> F2 F1 F3
#> W7Q1 0.7589 0.0407 -0.0365
#> W7Q2 0.8901 -0.0263 -0.0838
#> W7Q3 0.8547 -0.0096 -0.0078
#> W7Q4 0.6628 0.0272 0.1053
#> W7Q5 0.8199 -0.0390 -0.0258
#> W7Q6 0.6654 0.0525 0.1054
#> W7Q7 0.5604 -0.0148 0.2087
#> W7Q8 0.5731 0.0390 0.1966
#> W7Q9 0.6151 0.0697 0.0918
#> W7Q10 0.7239 0.0371 -0.0483
#> W7Q11 0.2085 0.0959 0.5488
#> W7Q12 0.0755 -0.0853 0.5559
#> W7Q13 -0.0176 -0.0153 0.7654
#> W7Q14 -0.0407 0.1439 0.5629
#> W7Q15 0.1087 0.4556 -0.1111
#> W7Q16 0.1759 0.2100 0.1152
#> W7Q17 0.2160 0.5816 0.0261
#> W7Q18 -0.0560 0.6777 -0.0782
#> W7Q19 0.0589 0.6436 0.0526
#> W7Q20 -0.0735 0.7723 0.0465
The summary()
function prints the number of items and the number of
factors used in the analysis together with the suggested model
identification items. As can be seen, the items suggested by modid()
are the items “W7Q3” and “W7Q20”. The output also includes data frames
that hold all the model identification items (Item.1...Item.n
)
selected by modid()
together with the items’ absolute sum score
(ABS
), one frame for the sum of squares for factors sorted in
descending order, and one frame for item factor loadings.
The order of the factors follows the model identification items so that item 1 comes from the strongest factor (sorted highest up), item 2 from the second strongest (sorted second), and so on. The absolute sum scores indicate statistical fit to the structural assumptions of the DMIRT model, and the items are sorted with the lowest absolute sum score highest up. Thus, the top items are the items that best meet the necessary statistical requirements for the model identification. For a three-dimensional model, this implies that the item highest up in the first data frame should be used to identify the $x$-axis and the item highest up in the second data frame should be used to identify the $y$-axis, and so on. See the package vignette for more on the model identification procedure (e.g., troubleshooting, criteria, or limitations).
2. D3mirt Model Estimation
The D3mirt()
function holds two built-in models. The first model is
the default model, i.e., the three-dimensional compensatory model, and
the latter is the orthogonal model, i.e., a restricted model in which
the assumption of within-multidimensionality is removed. In this
context, orthogonal refers to the strict perpendicular orientation of
the items in the latent space. The choice of model depends on the vector
used in the modid
argument when calling D3mirt()
(see examples
below).
Calling D3mirt()
returns an $S3$ object of class D3mirt
with lists
containing $a$ and $d$ from the compensatory model, and the $MDISC$, and
$MDIFF$ parameters, direction cosines, and spherical coordinates for the
item vectors from the DMIRT model. Regarding the latter, spherical
coordinates are represented by $\theta$ and $\phi$. The $\theta$
coordinate is the positive or negative angle in degrees, starting from
the $x$-axis, of the vector projections from the vector arrows in the
$xz$-plane up to $\pm 180°$. Note that the $\theta$ angle is oriented
following the positive pole of the $x$ and $z$ axis so that the angle
increases clockwise in the graphical output. The $\phi$ coordinate is
the positive angle in degrees from the $y$-axis and the vectors. Note,
the $\rho$ coordinate from the spherical coordinate system is in DMIRT
represented by the $MDIFF$, and so is reported separately.
Constructs can be included in the analysis by creating one or more
nested lists that indicate what items belong to what construct (see
example below) and using it in the con.items
argument. From this, the
D3mirt()
function finds the average direction of the subset of items
contained in each nested list by adding and normalizing the direction
cosines for the items and scaling the construct direction vector to an
arbitrary length (the length can be adjusted by the user) so that the
arrows can be seen when plotting. To use spherical coordinates, the user
must create a nested list containing spherical coordinate pairs in the
same format as the item indicators. For example:
con <- list(c(0, 45))
, and so on. The con
vector, in the latter
case, gives one construct vector located $45^{\circ}$ strictly above the
$x$-axis in the latent space.
If constructs are used, the function also returns construct direction cosines, spherical coordinates for the construct vector arrows, and $DDISC$ parameters (one index per construct).
The summary()
function presents the DMIRT estimates. The constructs
included below were grouped based on exploratory reasons, i.e., because
these items cluster in the model (observable in the graphical output
below).
# Optional: Load the mod3 data as a data frame directly from the package file
load(system.file("extdata/mod3.Rdata", package = "D3mirt"))
# Call to D3mirt(), including optional nested lists for three constructs
# Item W7Q16 is not included in any construct because of model violations
# The model violations for the item can be seen when plotting the model
con <- list(c(1:10),
c(11:14),
c(15:20))
mod3 <- D3mirt(x, modid = c("W7Q3", "W7Q20"), con.items = con)
summary(mod3)
#>
#> D3mirt: 20 items and 5 levels of difficulty
#>
#> Compensatory model
#> Model identification items: W7Q3, W7Q20
#>
#> Constructs
#> Item vector 1: W7Q1, W7Q2, W7Q3, W7Q4, W7Q5, W7Q6, W7Q7, W7Q8, W7Q9, W7Q10
#> Item vector 2: W7Q11, W7Q12, W7Q13, W7Q14
#> Item vector 3: W7Q15, W7Q16, W7Q17, W7Q18, W7Q19, W7Q20
#>
#>
#> a1 a2 a3 d1 d2 d3 d4 d5
#> W7Q1 2.0297 0.1645 -0.1227 8.0868 7.0641 5.9876 3.2016 -0.4834
#> W7Q2 2.6215 -0.0025 -0.2576 9.2884 6.6186 4.5103 1.6650 -2.4437
#> W7Q3 2.7932 0.0000 0.0000 10.4884 7.5922 5.6797 2.7181 -1.1790
#> W7Q4 1.9043 0.1877 0.1502 7.3749 6.0464 4.9812 2.4830 -1.1144
#> W7Q5 2.2427 -0.0285 -0.0836 8.4293 6.6722 4.9055 1.8256 -1.8316
#> W7Q6 2.0020 0.2392 0.1578 8.0687 6.3578 4.9521 2.3301 -1.0187
#> W7Q7 1.6284 0.1036 0.3600 6.0180 4.8976 3.6909 1.6326 -1.3483
#> W7Q8 1.7773 0.2254 0.3536 6.9174 5.1824 3.7663 1.4845 -1.8331
#> W7Q9 1.7197 0.2495 0.1286 7.5588 4.9756 3.3649 0.9344 -2.2093
#> W7Q10 1.7697 0.1274 -0.1404 8.3641 5.7399 4.2865 1.9648 -0.6641
#> W7Q11 1.4237 0.4680 1.0449 6.2204 4.6938 3.5443 1.1923 -1.8579
#> W7Q12 0.7601 0.0413 0.9370 4.1361 2.8772 2.3420 1.1791 -0.4240
#> W7Q13 1.1265 0.2914 1.6908 5.8838 4.3950 3.4385 1.8931 -0.6004
#> W7Q14 0.7444 0.4832 0.9787 5.3893 3.9334 3.0259 0.8144 -1.5869
#> W7Q15 0.4551 0.7871 -0.1607 4.3208 3.0545 2.3970 0.9187 -0.9705
#> W7Q16 0.6236 0.4141 0.1798 3.7249 2.0305 1.1658 -0.0612 -1.8084
#> W7Q17 1.1891 1.3414 0.0561 6.9016 5.8026 4.9348 2.7917 -0.0040
#> W7Q18 0.4106 1.3543 -0.1372 3.7838 2.0986 1.4183 0.1829 -1.9855
#> W7Q19 0.8578 1.4100 0.2278 4.4979 2.6484 1.6731 0.3741 -1.9966
#> W7Q20 0.7355 1.9066 0.0000 4.6376 2.3632 1.2791 -0.3430 -2.9188
#>
#> MDISC MDIFF1 MDIFF2 MDIFF3 MDIFF4 MDIFF5
#> W7Q1 2.0401 -3.9640 -3.4627 -2.9350 -1.5693 0.2370
#> W7Q2 2.6341 -3.5262 -2.5127 -1.7123 -0.6321 0.9277
#> W7Q3 2.7932 -3.7550 -2.7181 -2.0334 -0.9731 0.4221
#> W7Q4 1.9194 -3.8423 -3.1502 -2.5952 -1.2936 0.5806
#> W7Q5 2.2444 -3.7557 -2.9728 -2.1857 -0.8134 0.8160
#> W7Q6 2.0224 -3.9897 -3.1437 -2.4486 -1.1521 0.5037
#> W7Q7 1.6710 -3.6015 -2.9310 -2.2088 -0.9771 0.8069
#> W7Q8 1.8261 -3.7881 -2.8380 -2.0625 -0.8130 1.0038
#> W7Q9 1.7425 -4.3380 -2.8555 -1.9311 -0.5362 1.2679
#> W7Q10 1.7798 -4.6995 -3.2251 -2.4084 -1.1040 0.3731
#> W7Q11 1.8269 -3.4048 -2.5692 -1.9400 -0.6526 1.0170
#> W7Q12 1.2073 -3.4260 -2.3832 -1.9399 -0.9766 0.3512
#> W7Q13 2.0525 -2.8667 -2.1413 -1.6753 -0.9223 0.2925
#> W7Q14 1.3211 -4.0792 -2.9772 -2.2903 -0.6164 1.2011
#> W7Q15 0.9232 -4.6800 -3.3085 -2.5963 -0.9951 1.0512
#> W7Q16 0.7699 -4.8384 -2.6375 -1.5143 0.0795 2.3490
#> W7Q17 1.7935 -3.8482 -3.2354 -2.7515 -1.5566 0.0022
#> W7Q18 1.4218 -2.6613 -1.4761 -0.9976 -0.1286 1.3965
#> W7Q19 1.6661 -2.6997 -1.5896 -1.0042 -0.2245 1.1984
#> W7Q20 2.0436 -2.2694 -1.1564 -0.6259 0.1678 1.4283
#>
#> D.Cos X D.Cos Y D.Cos Z Theta Phi
#> W7Q1 0.9949 0.0806 -0.0601 -3.4590 85.3756
#> W7Q2 0.9952 -0.0010 -0.0978 -5.6133 90.0554
#> W7Q3 1.0000 0.0000 0.0000 0.0000 90.0000
#> W7Q4 0.9921 0.0978 0.0783 4.5098 84.3885
#> W7Q5 0.9992 -0.0127 -0.0372 -2.1341 90.7284
#> W7Q6 0.9899 0.1183 0.0780 4.5068 83.2064
#> W7Q7 0.9745 0.0620 0.2154 12.4657 86.4441
#> W7Q8 0.9733 0.1235 0.1936 11.2521 82.9082
#> W7Q9 0.9869 0.1432 0.0738 4.2769 81.7671
#> W7Q10 0.9943 0.0716 -0.0789 -4.5357 85.8965
#> W7Q11 0.7793 0.2561 0.5719 36.2765 75.1588
#> W7Q12 0.6296 0.0342 0.7761 50.9493 88.0399
#> W7Q13 0.5488 0.1420 0.8238 56.3262 81.8384
#> W7Q14 0.5634 0.3657 0.7408 52.7448 68.5472
#> W7Q15 0.4929 0.8525 -0.1740 -19.4451 31.5154
#> W7Q16 0.8101 0.5378 0.2335 16.0800 57.4627
#> W7Q17 0.6630 0.7479 0.0313 2.7031 41.5881
#> W7Q18 0.2888 0.9525 -0.0965 -18.4742 17.7264
#> W7Q19 0.5148 0.8463 0.1367 14.8712 32.1870
#> W7Q20 0.3599 0.9330 0.0000 0.0000 21.0939
#>
#> C.Cos X C.Cos Y C.Cos Z Theta Phi
#> C1 0.9970 0.0688 0.0368 2.1119 86.0548
#> C2 0.6409 0.2029 0.7404 49.1207 78.2961
#> C3 0.5405 0.8411 0.0226 2.3974 32.7476
#>
#> DDISC1 DDISC2 DDISC3
#> W7Q1 2.0304 1.2433 1.2326
#> W7Q2 2.6038 1.4887 1.4088
#> W7Q3 2.7847 1.7901 1.5096
#> W7Q4 1.9169 1.3697 1.1905
#> W7Q5 2.2308 1.3696 1.1862
#> W7Q6 2.0181 1.4484 1.2868
#> W7Q7 1.6438 1.3312 0.9754
#> W7Q8 1.8004 1.4465 1.1582
#> W7Q9 1.7364 1.2479 1.1422
#> W7Q10 1.7679 1.0560 1.0604
#> W7Q11 1.4900 1.7809 1.1867
#> W7Q12 0.7951 1.1892 0.4668
#> W7Q13 1.2053 2.0328 0.8922
#> W7Q14 0.8113 1.2997 0.8308
#> W7Q15 0.5019 0.3324 0.9043
#> W7Q16 0.6568 0.6168 0.6894
#> W7Q17 1.2799 1.0758 1.7722
#> W7Q18 0.4975 0.4363 1.3578
#> W7Q19 0.9605 1.0044 1.6547
#> W7Q20 0.8644 0.8581 2.0011
The D3mirt()
function prints a short report containing the number of
items used and the number of difficulty levels when the estimation is
done. As can be seen, when construct vectors are used, the function also
prints the number of construct vectors and the names of the items
included in each construct. Next, the factor loadings and the difficulty
parameters from the compensatory model are reported in data frames,
followed by all necessary DMIRT estimates.
Regarding the orthogonal model, the function call must include nested lists indicating which items should be constrained to either of the axes. Below is an example in which items $1$ to $10$ are constrained to the $x$-axes, items $15$ to $19$ to the $y$-axis, and items $11$ to $14$ to the $z$-axis. Note that fitting the orthogonal model often requires removing misfitting items. Otherwise, the orthogonal model most likely will amplify the misfit tendencies. In this case, item W7Q16 was removed before fitting the model.
# Optional: Load the mod2 data as a data frame directly from the package file
load(system.file("extdata/mod2.Rdata", package = "D3mirt"))
y <- data.frame(x[, -16])# Remove misfitting item W7Q16
# Call D3mirt() and using the orthogonal model
mod2 <- D3mirt(y, modid = list(c(1:10), # x-axis
c(15:19), # y-axis
c(11:14))) # z-axis
summary(mod2)
#>
#> D3mirt: 19 items and 5 levels of difficulty
#>
#> Orthogonal model
#> Item vector 1: W7Q1, W7Q2, W7Q3, W7Q4, W7Q5, W7Q6, W7Q7, W7Q8, W7Q9, W7Q10
#> Item vector 2: W7Q15, W7Q17, W7Q18, W7Q19, W7Q20
#> Item vector 3: W7Q11, W7Q12, W7Q13, W7Q14
#>
#> a1 a2 a3 d1 d2 d3 d4 d5
#> W7Q1 2.0183 0.0000 0.0000 8.0211 7.0101 5.9451 3.1762 -0.4884
#> W7Q2 2.5545 0.0000 0.0000 9.0536 6.4408 4.3879 1.6178 -2.3976
#> W7Q3 2.7483 0.0000 0.0000 10.3241 7.4782 5.6007 2.6793 -1.1828
#> W7Q4 1.9121 0.0000 0.0000 7.3485 6.0230 4.9697 2.4782 -1.1279
#> W7Q5 2.1971 0.0000 0.0000 8.2981 6.5636 4.8289 1.7951 -1.8163
#> W7Q6 2.0070 0.0000 0.0000 8.0685 6.3532 4.9445 2.3214 -1.0367
#> W7Q7 1.5979 0.0000 0.0000 5.9310 4.8301 3.6411 1.5984 -1.3322
#> W7Q8 1.7491 0.0000 0.0000 6.8381 5.1224 3.7094 1.4480 -1.8096
#> W7Q9 1.7410 0.0000 0.0000 7.5706 4.9930 3.3754 0.9282 -2.2171
#> W7Q10 1.7488 0.0000 0.0000 8.3054 5.6856 4.2405 1.9417 -0.6614
#> W7Q11 0.0000 0.0000 1.5285 5.9525 4.4684 3.3585 1.1019 -1.7346
#> W7Q12 0.0000 0.0000 1.1511 4.1002 2.8570 2.3265 1.1659 -0.4295
#> W7Q13 0.0000 0.0000 1.8866 5.7126 4.2787 3.3450 1.8306 -0.5882
#> W7Q14 0.0000 0.0000 1.3482 5.4784 4.0172 3.1011 0.8363 -1.6128
#> W7Q15 0.0000 0.8870 0.0000 4.3049 3.0375 2.3797 0.9084 -0.9651
#> W7Q17 0.0000 1.5464 0.0000 6.6917 5.6269 4.7725 2.6404 -0.0421
#> W7Q18 0.0000 1.3316 0.0000 3.6937 2.0504 1.3881 0.1841 -1.9387
#> W7Q19 0.0000 1.6471 0.0000 4.5704 2.6646 1.6572 0.3352 -2.0165
#> W7Q20 0.0000 1.9471 0.0000 4.5578 2.3173 1.2468 -0.3427 -2.8570
#>
#> MDISC MDIFF1 MDIFF2 MDIFF3 MDIFF4 MDIFF5
#> W7Q1 2.0183 -3.9742 -3.4733 -2.9456 -1.5737 0.2420
#> W7Q2 2.5545 -3.5442 -2.5214 -1.7178 -0.6333 0.9386
#> W7Q3 2.7483 -3.7565 -2.7210 -2.0379 -0.9749 0.4304
#> W7Q4 1.9121 -3.8432 -3.1500 -2.5992 -1.2961 0.5899
#> W7Q5 2.1971 -3.7768 -2.9873 -2.1978 -0.8170 0.8267
#> W7Q6 2.0070 -4.0202 -3.1655 -2.4636 -1.1566 0.5165
#> W7Q7 1.5979 -3.7118 -3.0228 -2.2787 -1.0003 0.8337
#> W7Q8 1.7491 -3.9096 -2.9287 -2.1208 -0.8279 1.0346
#> W7Q9 1.7410 -4.3485 -2.8680 -1.9388 -0.5332 1.2735
#> W7Q10 1.7488 -4.7492 -3.2511 -2.4248 -1.1103 0.3782
#> W7Q11 1.5285 -3.8944 -2.9234 -2.1973 -0.7209 1.1348
#> W7Q12 1.1511 -3.5620 -2.4820 -2.0211 -1.0129 0.3731
#> W7Q13 1.8866 -3.0279 -2.2679 -1.7730 -0.9703 0.3118
#> W7Q14 1.3482 -4.0635 -2.9796 -2.3001 -0.6203 1.1962
#> W7Q15 0.8870 -4.8531 -3.4243 -2.6827 -1.0241 1.0880
#> W7Q17 1.5464 -4.3273 -3.6387 -3.0862 -1.7074 0.0272
#> W7Q18 1.3316 -2.7738 -1.5398 -1.0424 -0.1383 1.4559
#> W7Q19 1.6471 -2.7749 -1.6178 -1.0062 -0.2035 1.2243
#> W7Q20 1.9471 -2.3408 -1.1902 -0.6403 0.1760 1.4673
#>
#> D.Cos X D.Cos Y D.Cos Z Theta Phi
#> W7Q1 1 0 0 0 90
#> W7Q2 1 0 0 0 90
#> W7Q3 1 0 0 0 90
#> W7Q4 1 0 0 0 90
#> W7Q5 1 0 0 0 90
#> W7Q6 1 0 0 0 90
#> W7Q7 1 0 0 0 90
#> W7Q8 1 0 0 0 90
#> W7Q9 1 0 0 0 90
#> W7Q10 1 0 0 0 90
#> W7Q11 0 0 1 90 90
#> W7Q12 0 0 1 90 90
#> W7Q13 0 0 1 90 90
#> W7Q14 0 0 1 90 90
#> W7Q15 0 1 0 NaN 0
#> W7Q17 0 1 0 NaN 0
#> W7Q18 0 1 0 NaN 0
#> W7Q19 0 1 0 NaN 0
#> W7Q20 0 1 0 NaN 0
In the output, the simple loading structure can be seen. Aldo note that
$\theta$ reports NaN
when both $\theta$ and $\phi$ is zero. This is
because when converting to spherical coordinates, items oriented
parallel to the $y$-axis will have $\cos (0)$ in the denominator of the
$\arctan$ function, resulting in undefined values.
3. Plotting
The plot()
method for objects of class D3mirt
is built on the rgl
package (Adler & Murdoch, 2023) for visualization with OpenGL. Graphing
in default mode by calling plot()
will return an RGL device appearing
in an external window as a three-dimensional interactive object
containing vector arrows with the latent dimensions running along the
orthogonal axes that can be rotated. In this illustration, however, all
RGL devices are plotted inline as still shots displayed from two angles,
$15^{\circ}$ (clockwise; default plot angle) and $90^{\circ}$. To change
the plot output to $90^{\circ}$, use the view
argument in the plot()
function and change the first indicator from $15$ to $90$.
# Plot RGL device with constructs visible and named
plot(mod3, constructs = TRUE,
construct.lab = c("Compassion", "Fairness", "Conformity"))
Figure 1: Three-dimensional vector plot for all items and the three constructs Compassion, Fairness, and Conformity (solid black arrows) plotted with the model rotated $15^{\circ}$ clockwise.
Figure 2: Three-dimensional vector plot for all items and the three constructs Compassion, Fairness, and Conformity (solid black arrows) plotted with the model rotated $90^{\circ}$ clockwise.
An example of how the output can be described is as follows.
As can be seen in Figures 1 and 2, the pattern in the data indicates the presence of foremost two main nested latent constructs indicated by the items, one aligned with the $x$-axis and one approaching the $y$-axis. We might also suspect the presence of a third construct located close to the $xy$-plane, between the $x$ and $z$ axes. Studying the content of the items, the labels Compassion, Fairness, and Conformity were introduced. The angles of the constructs inform us that Compassion ($\theta = 2.092^{\circ}$, $\phi = 86.061^{\circ}$) and Conformity ($\theta = -2.514 ^{\circ}$, $\phi = 28.193^{\circ}$) have some within-multidimensional tendencies. However, they are both more or less orthogonal to the $z$-axis. Next, we find Fairness ($\theta = 49.101^{\circ}$, $\phi = 78.313^{\circ}$) with clear within-multidimensional tendencies with respect to the $x$-axis. Thus, the output indicates that Compassion and Conformity could be independent constructs but Fairness seems not to be.
The orthogonal model is displayed below rotated rotated $15^{\circ}$ clockwise. In the graphical output the perpendicular orientation of the items forms a cross pattern in which each axis holds a unidimensional model indicated by the items parallel to it. The orthogonal model is foremost useful for studying the items under the assumption that the items are unidimensional and that the unidimensions are uncorrelated. Note that the orthogonal model will change the location of respondents in the latent space compared to the compensatory model. This can have an effect when studying profiles (see examples below), such that profile tendencies can emerge or disappear depending on the choice of model.
# Plot the orthogonal model
plot(mod2)
Figure 3: The orthogonal model plotted with the model rotated $15^{\circ}$ clockwise.
3.1. items
A subset of items can be plotted for a more thorough investigation using
the items
argument. In the example below, all constructs are plotted
together with the items used for the conformity construct. In the
function call, the numerical indicators in the items
argument follow
the item order in the original data frame (see ?anes0809offwaves
).
# The Conformity items from the model plotted with construct vector arrows
plot(mod3, constructs = TRUE,
items = c(15,17,18,19,20),
construct.lab = c("Compassion", "Fairness", "Conformity"))
Figure 4: The items from the Conformity construct plotted with the model rotated $15^{\circ}$ clockwise.
Figure 5: The items from the Conformity construct plotted the model rotated $90^{\circ}$ clockwise.
The plot()
function also allows plotting a single item by entering a
single number indicating what item that should be displayed. As was
mentioned above, the W7Q16 was not included in any of the constructs
because the item showed signs of measurement problems. For example, the
short vector arrows indicate high amounts of model violations and the
location of the item in the model also indicates that the item is
within-multidimensional and does not seem to belong to any construct
explicitly. Typing $16$ in the items
argument allows for a closer
look.
# Item W7Q16 has location 16 in the data set (gender and age excluded)
# The item is plotted together with construct to aid the visual interpretation
plot(mod3, constructs = TRUE,
items = 16,
construct.lab = c("Compassion", "Fairness", "Conformity"))
Figure 6: The item W7Q16 plotted with the three constructs and with the model rotated $15^{\circ}$ clockwise.
Figure 7: The item W7Q16 plotted with the three constructs and with the model rotated $90^{\circ}$ clockwise.
An example of how the output for analysis of the single item above is as follows.
The Figures 3 and 4 shows that item W7Q16 is located at $\theta = 16.085^{\circ}$, $\phi = 57.476^{\circ}$, indicating that the item is within-multidimensional with respect to the $x$ and $y$-axis; but much less so with respect to the $z$-axis. In addition, the directional discrimination further underscores that the item does not seem to measure any particular construct ($DDISC_1 = .657$, $DDISC_2 = .617$, $DDISC_3 = .656$). The global discrimination ($MDISC = .770$, $MDIFF_{range} = [-4.838, 2.349]$) is also the lowest of all discrimination scores in the model. This, combined, implies that the item in question does not seem to fit the three-dimensional DMIRT model used in this analysis and should, therefore, be removed or adapted. We can also note that item W7Q15, $MDISC = .923$, $MDIFF_{range} = [-4.680, 1.051]$) has the second lowest global discrimination score. Compared to W7Q16, however, item W7Q15 does seem to belong to the Conformity construct, observable when comparing angle orientation ($\theta = -19.432^{\circ}, \phi = 31.515^{\circ}$) and direction discrimination ($DDISC_1 = .502$, $DDISC_2 = .332$, $DDISC_3 = .912$). In other words, even if item W7Q15 shows signs of statistical violations in the model, the item still informs us of the content of the Conformity construct.
3.2. diff.level
The user has the option of plotting on one level of difficulty at a time
with the diff.level
argument studying the entire scale, a subset of
items, or on one item at a time. Note that difficulty refers to the
number of item response functions in the items, i.e., the total number
of response options minus one. In this case, $6$ response options were
used which means that the model has $5$ levels of difficulty.
# Plot RGL device on item difficulty level 5
plot(mod3, diff.level = 5)
Figure 8: All items plotted on difficulty level 5 and with the model rotated $15^{\circ}$ clockwise.
Figure 9: All items plotted on difficulty level 5 and with the model rotated $90^{\circ}$ clockwise.
3.3 scale
The D3mirt()
function returns item vector coordinates estimated with
and without the $MDISC$ as a scalar for the arrow length. When the
$MDISC$ is not used for the arrow length, all item vector arrows are
scaled to one unit length. This allows the user to graph the item vector
arrows with plot()
set to a uniform length. This can help reduce
visual clutter in the graphical output. To view the item vector arrows
without the $MDISC$, set scale = TRUE
.
# Plot RGL device with items in uniform length and constructs visible and named
plot(mod3, scale = TRUE,
constructs = TRUE,
construct.lab = c("Compassion", "Fairness", "Conformity"))
Figure 10: All items scaled to uniform length and plotted with the model rotated $15^{\circ}$ clockwise.
Figure 11: All items scaled to uniform length and plotted with the model rotated $90^{\circ}$ clockwise.
3.4. D3mirt
Profile Analysis
The plot()
function can also display respondents in the
three-dimensional model represented as spheres whose coordinates are
derived from the respondent’s trait scores. This allows for a profile
analysis in which respondents can be separated and plotted as subsets
conditioned on single or multiple criteria. The resulting output shows
where the respondents are located in the model and, accordingly, what
model profile best describes them. To do this, the user can either plot
all respondents by setting ind.scores
to TRUE
, or plot a subsection
of respondents by creating a separate data frame and use it in the
profiles
argument when calling plot()
.
In the example below, the first option is illustrated also using the
levels
argument. Regarding the latter, the plot()
function uses
as.factor()
to count the number of factor levels in the data imputed
in the levels
argument. This information is then used to assign colors
for the spheres representing respondents when plotting. This means that
raw data can be used as is, but the number of colors in the color
vectors argument (sphere.col
) may need to be adapted. In the example
below, the criteria variable for gender only hold two factor levels and,
therefore, only two colors in the color vector are needed. By separating
respondents using color coding, it is sometimes possible to display
group-level effects.
Generally, it can be helpful to hide vector arrows with hide = TRUE
when plotting respondent profiles to avoid visual cluttering. The
example below displays all respondents using the gender variable
included in the built-in data set. The gender variable is also
color-coded so that males are displayed as blue spheres and females as
red spheres.
# Load the data set to be used in levels
# Use as.matrix to remove any attributes
data("anes0809offwaves")
x <- as.matrix(anes0809offwaves)
Call plot()
and use the gender data, contained in column two in data
frame $x$, in the levels
argument. In the function call below, the
axes in the model are named using the x.lab
, y.lab
, and z.lab
arguments following the direction of the constructs. Note that the model
axes represent unidimensional singular structures or traits.
# Plot profiles with item vector arrows hidden with hide = TRUE
# Score levels: 1 = Blue ("male") and 2 = Red ("female")
# Plot profiles with item vector arrows hidden
# Score levels: 1 = Blue ("male") and 2 = Red ("female")
plot(mod3, hide = TRUE, ind.scores = TRUE,
levels = x[, 2],
sphere.col = c("blue", "red"),
x.lab = "Compassion",
y.lab="Conformity",
z.lab="Fairness")
Figure 12: Gender profile for the anes0809offwaves
data set plotted
with the model rotated $15^{\circ}$ clockwise.
Figure 13: Gender profile for the anes0809offwaves
data set plotted
with the model rotated $90^{\circ}$ clockwise.
An example of how the output can be described is as follows.
In the figure, a simple gender profile can be observed, showing that more women tend to have higher levels of compassion. When rotating the model $90^{\circ}$ clockwise, there seems to be no obvious gender difference related to Conformity or Fairness.
3.5. Plotting Confidence Intervals
It is also possible to plot a confidence interval in the shape of an
ellipse surrounding the spheres in the latent space. In the example
below, the younger individuals ($\leq30$) are selected and plotted with
a $95%$ CI. To subset respondents, create a new data frame by
combining respondents’ factor scores from mod3$fscores
with age data
from column one in data frame $x$. Then assign respondent data
conditioned on age to a new data frame using the subset()
function.
# Column bind trait scores in mod3 with the age variable W3Xage from data frame x
z <- data.frame(cbind(mod3$fscores, x[, 1]))
# Subset data frame z conditioned on age <= 30
z1 <- subset(z, z[, 4] <= 30)
When a criterion variable has a wide data range, such as an age
variable, rep()
can be used to set the appropriate size of the color
vector for sphere.col
by repeating color names with rep()
. When
plotting, the plot()
function will pick colors from the sphere.col
argument following the factor order in the levels argument. To do this,
the first step is to count the number of factors in the criterion
variable. This can be done with nlevels()
, as illustrated below.
# Check number of factor levels with nlevels() and as.factor()
nlevels(as.factor(z1[, 4]))
# Use rep() to create a color vector to color groups based on the nlevels() output
# z1 has 14 factor levels
colvec <- c(rep("red", 14))
To plot the CI, the ci
argument is set to TRUE
. The color of the
sphere was also changed from default grey80
to orange
in the example
below. Note that the CI limit can be adjusted with the ci.level
argument.
# Call plot() with profile data on age with item vector arrows hidden
plot(mod3, hide = TRUE,
profiles = z1,
levels = z1[, 4],
sphere.col = colvec,
x.lab = "Compassion",
y.lab="Conformity",
z.lab="Fairness",
ci = TRUE,
ci.level = 0.95,
ellipse.col = "orange")
Figure 14: Adults less than or equal to age 30 from the
anes0809offwaves
data set plotted surrounded by a $95%,CI$ and with
the model rotated $15^{\circ}$ clockwise.
Figure 15: Adults less than or equal to age 30 from the
anes0809offwaves
data set plotted surrounded by a $95%,CI$ and with
the model rotated $90^{\circ}$ clockwise.
An example of how the output can be described is as follows.
In Figures 7 and 8 we can see a tendency for an age profile in which younger individuals could be described as less oriented towards Conformity. We can also observe a tendency for what could be an interaction effect in which higher levels of Conformity seem to be associated with lower levels of Fairness.
4. Exporting the RGL Device
Some options for exporting the RGL device are shown below. In addition,
it is also possible to export graphical devices in R Markdown documents
with rgl::hook_webgl()
together with graphical options for knitr, as
was done when creating the package vignette.
# Export an open RGL device to the console that can be saved as an html or image file
plot(mod3, constructs = TRUE)
s <- rgl::scene3d()
rgl::rglwidget(s,
width = 1040,
height = 1040)
# Export a snap shoot of an open RGL device directly to file
plot(mod3, constructs = TRUE)
rgl::rgl.snapshot('RGLdevice.png',
fmt = 'png')
Getting Help, Feedback, and Questions
If you encounter a bug, please file an issue with a minimal reproducible example on GitHub (https://github.com/ForsbergPyschometrics/D3mirt). For questions and suggestions on how to improve the code, please contact me on GitHub or via email (forsbergpsychometrics@gmail.com).
References
Adler, D., & Murdoch, D. (2023). Rgl: 3d Visualization Using OpenGL [Computer software]. https://dmurdoch.github.io/rgl/index.html
Chalmers, R., P. (2012). mirt: A Multidimensional Item Response Theory Package for the R Environment. Journal of Statistical Software, 48(6), 1-29. https://doi.org/10.18637/jss.v048.i06
DeBell, M., Krosnick, J. A., & Lupia, A. (2010). Methodology Report and User’s Guide for the 2008-2009 ANES Panel Study. Palo Alto, CA, and Ann Arbor, MI: Stanford University and the University of Michigan.
McKinley, R. L., & Reckase, M. D. (1983). An Extension of the Two-parameter Logistic Model to the multidimensional latent space, Report ONR83-2. Iowa City, IA, American College Testing Program.
Muraki, E., & Carlson, J. E. (1995). Full-Information Factor Analysis for Polytomous Item Responses. Applied Psychological Measurement, 19(1), 73-90. https://doi.org/10.1177/014662169501900109
Reckase, M. D. (2009). Multidimensional Item Response Theory. Springer. https://doi.org/10.1007/978-0-387-89976-3
Reckase, M. D. (1985). The Difficulty of Test Items That Measure More Than One Ability. Applied Psychological Measurement, 9(4),401-412. https://doi.org/10.1177/014662168500900409
Reckase, M. D., & McKinley, R. L. (1991). The Discriminating Power of Items That Measure More Than One Dimension. Applied Psychological Measurement, 15(4), 361-373. https://doi.org/10.1177/014662169101500407