Simulate a number of individual realisations from a continuous-time Markov process. Observations of the process are made at specified arbitrary times for each individual, giving panel-observed data.
simmulti.msm(
data,
qmatrix,
covariates = NULL,
death = FALSE,
start,
ematrix = NULL,
misccovariates = NULL,
hmodel = NULL,
hcovariates = NULL,
censor.states = NULL,
drop.absorb = TRUE
)
A data frame with columns,
Subject identification indicators
Observation times
Simulated (true) state at the corresponding time
Observed outcome at the
corresponding time, if ematrix
or hmodel
was supplied
Row numbers of the original data. Useful when
drop.absorb=TRUE
, to show which rows were not dropped
plus any supplied covariates.
A data frame with a mandatory column named time
,
representing observation times. The optional column named subject
,
corresponds to subject identification numbers. If not given, all
observations are assumed to be on the same individual. Observation times
should be sorted within individuals. The optional column named cens
indicates the times at which simulated states should be censored. If
cens==0
then the state is not censored, and if cens==k
, say,
then all simulated states at that time which are in the set
censor.states
are replaced by k
. Other named columns of the
data frame represent any covariates, which may be time-constant or
time-dependent. Time-dependent covariates are assumed to be constant
between the observation times.
The transition intensity matrix of the Markov process, with
any covariates set to zero. The diagonal of qmatrix
is ignored, and
computed as appropriate so that the rows sum to zero. For example, a
possible qmatrix
for a three state illness-death model with recovery
is:
rbind( c( 0, 0.1, 0.02 ), c( 0.1, 0, 0.01 ), c( 0, 0, 0 ) )
List of linear covariate effects on log transition intensities. Each element is a vector of the effects of one covariate on all the transition intensities. The intensities are ordered by reading across rows of the intensity matrix, starting with the first, counting the positive off-diagonal elements of the matrix.
For example, for a multi-state model with three transition intensities, and
two covariates x
and y
on each intensity,
covariates=list(x = c(-0.3,-0.3,-0.3), y=c(0.1, 0.1, 0.1))
Vector of indices of the death states. A death state is an
absorbing state whose time of entry is known exactly, but the individual is
assumed to be in an unknown transient state ("alive") at the previous
instant. This is the usual situation for times of death in chronic disease
monitoring data. For example, if you specify death = c(4, 5)
then
states 4 and 5 are assumed to be death states.
death = TRUE
indicates that the final state is a death state, and
death = FALSE
(the default) indicates that there is no death state.
A vector with the same number of elements as there are distinct subjects in the data, giving the states in which each corresponding individual begins. Or a single number, if all of these are the same. Defaults to state 1 for each subject.
An optional misclassification matrix for generating observed
states conditionally on the simulated true states. As defined in
msm
.
Covariate effects on misclassification probabilities
via multinomial logistic regression. Linear effects operate on the log of
each probability relative to the probability of classification in the
correct state. In same format as covariates
.
An optional hidden Markov model for generating observed
outcomes conditionally on the simulated true states. As defined in
msm
. Multivariate outcomes (hmmMV
) are not supported.
List of the same length as hmodel
, defining any
covariates governing the hidden Markov outcome models. Unlike in the
msm
function, this should also define the values of the covariate
effects. Each element of the list is a named vector of the initial values
for each set of covariates for that state. For example, for a three-state
hidden Markov model with two, one and no covariates on the state 1, 2 and 3
outcome models respectively,
hcovariates = list (c(acute=-8, age=0), c(acute=-8), NULL)
Set of simulated states which should be replaced by a censoring indicator at censoring times. By default this is all transient states (representing alive, with unknown state).
Drop repeated observations in the absorbing state, retaining only one.
C. H. Jackson chris.jackson@mrc-bsu.cam.ac.uk
sim.msm
is called repeatedly to produce a simulated trajectory
for each individual. The state at each specified observation time is then
taken to produce a new column state
. The effect of time-dependent
covariates on the transition intensity matrix for an individual is
determined by assuming that the covariate is a step function which remains
constant in between the individual's observation times. If the subject
enters an absorbing state, then only the first observation in that state is
kept in the data frame. Rows corresponding to future observations are
deleted. The entry times into states given in death
are assumed to
be known exactly.
sim.msm
### Simulate 100 individuals with common observation times
sim.df <- data.frame(subject = rep(1:100, rep(13,100)), time = rep(seq(0, 24, 2), 100))
qmatrix <- rbind(c(-0.11, 0.1, 0.01 ),
c(0.05, -0.15, 0.1 ),
c(0.02, 0.07, -0.09))
simmulti.msm(sim.df, qmatrix)
Run the code above in your browser using DataLab