simulate is used to draw from exponential
family random network models. See ergm() for more
information on these models.
The method for ergm objects inherits the model,
the coefficients, the response attribute, the reference, the
constraints, and most simulation parameters from the model fit,
unless overridden by passing them explicitly. Unless overridden,
the simulation is initialized with either a random draw from near
the fitted model saved by ergm() or, if unavailable, the
network to which the ERGM was fit.
# S3 method for formula_lhs_network
simulate(object, nsim = 1, seed = NULL, ...)simulate_formula(object, ..., basis = eval_lhs.formula(object))
# S3 method for network
simulate_formula(
object,
nsim = 1,
seed = NULL,
coef,
response = NULL,
reference = ~Bernoulli,
constraints = ~.,
observational = FALSE,
monitor = NULL,
statsonly = FALSE,
esteq = FALSE,
output = c("network", "stats", "edgelist", "ergm_state"),
simplify = TRUE,
sequential = TRUE,
control = control.simulate.formula(),
verbose = FALSE,
...,
basis = ergm.getnetwork(object),
do.sim = NULL,
return.args = NULL
)
# S3 method for ergm_state
simulate_formula(
object,
nsim = 1,
seed = NULL,
coef,
response = NULL,
reference = ~Bernoulli,
constraints = ~.,
observational = FALSE,
monitor = NULL,
statsonly = FALSE,
esteq = FALSE,
output = c("network", "stats", "edgelist", "ergm_state"),
simplify = TRUE,
sequential = TRUE,
control = control.simulate.formula(),
verbose = FALSE,
...,
basis = ergm.getnetwork(object),
do.sim = NULL,
return.args = NULL
)
# S3 method for ergm_model
simulate(
object,
nsim = 1,
seed = NULL,
coef,
reference = if (is(constraints, "ergm_proposal")) NULL else trim_env(~Bernoulli),
constraints = trim_env(~.),
observational = FALSE,
monitor = NULL,
basis = NULL,
esteq = FALSE,
output = c("network", "stats", "edgelist", "ergm_state"),
simplify = TRUE,
sequential = TRUE,
control = control.simulate.formula(),
verbose = FALSE,
...,
do.sim = NULL,
return.args = NULL
)
# S3 method for ergm_state_full
simulate(
object,
nsim = 1,
seed = NULL,
coef,
esteq = FALSE,
output = c("network", "stats", "edgelist", "ergm_state"),
simplify = TRUE,
sequential = TRUE,
control = control.simulate.formula(),
verbose = FALSE,
...,
return.args = NULL
)
# S3 method for ergm
simulate(
object,
nsim = 1,
seed = NULL,
coef = coefficients(object),
response = object$network %ergmlhs% "response",
reference = object$reference,
constraints = list(object$constraints, object$obs.constraints),
observational = FALSE,
monitor = NULL,
basis = if (observational) object$network else NVL(object$newnetwork, object$network),
statsonly = FALSE,
esteq = FALSE,
output = c("network", "stats", "edgelist", "ergm_state"),
simplify = TRUE,
sequential = TRUE,
control = control.simulate.ergm(),
verbose = FALSE,
...,
return.args = NULL
)
If output=="stats" an mcmc object containing the
simulated network statistics. If control$parallel>0, an
mcmc.list object. If simplify=TRUE (the default), these
would then be "stacked" and converted to a standard matrix. A
logical vector indicating whether or not the term had come from
the monitor= formula is stored in attr()-style attribute
"monitored".
Otherwise, a representation of the simulated network is returned,
in the form specified by output. In addition to a network
representation or a list thereof, they have the following
attr()-style attributes:
formulaThe formula used to generate the
sample.
statsAn mcmc or mcmc.list object as above.
controlControl parameters used to generate the sample.
constraintsConstraints used to generate the sample.
referenceThe reference measure for the sample.
monitorThe monitoring formula.
responseThe edge attribute used as a response.
The following are the permitted network formats:
"network"If nsim==1, an object of class
network. If nsim>1, it returns an object of class
network.list (a list of networks) with the
above-listed additional attributes.
"edgelist"An edgelist representation of the network,
or a list thereof, depending on nsim.
"ergm_state"A semi-internal representation of
a network consisting of a network object emptied of edges, with
an attached edgelist matrix, or a list thereof, depending on
nsim.
If simplify==FALSE, the networks are returned as a nested list,
with outer list being the parallel chain (including 1 for no
parallelism) and inner list being the samples within that chains
(including 1, if one network per chain). If TRUE, they are
concatenated, and if a total of one network had been simulated, the
network itself will be returned.
Either a formula or an
ergm object. The formula should be of the form
y ~ <model terms>, where y is a network object or a matrix
that can be coerced to a network object. For the
details on the possible <model terms>, see ergmTerm.
To create a network object in , use the
network() function, then add nodal attributes to it using the
%v% operator if necessary.
Number of networks to be randomly drawn from the given distribution on the set of all networks, returned by the Metropolis-Hastings algorithm.
Seed value (integer) for the random number generator. See
set.seed().
Further arguments passed to or used by methods.
a value (usually a network) to override the LHS of the formula.
Vector of parameter values for the model from which the
sample is to be drawn. If object is of class ergm,
the default value is the vector of estimated coefficients. Can be
set to NULL to bypass, but only if return.args below is used.
Either a character string, a formula, or NULL (the default), to specify the response attributes and whether the ERGM is binary or valued. Interpreted as follows:
NULLModel simple presence or absence, via a binary ERGM.
The name of the edge attribute whose value is to be modeled. Type of ERGM will be determined by whether the attribute is logical (TRUE/FALSE) for binary or numeric for valued.
must be of the form NAME~EXPR|TYPE (with | being literal). EXPR is evaluated in the formula's environment with the network's edge attributes accessible as variables. The optional NAME specifies the name of the edge attribute into which the results should be stored, with the default being a concise version of EXPR. Normally, the type of ERGM is determined by whether the result of evaluating EXPR is logical or numeric, but the optional TYPE can be used to override by specifying a scalar of the type involved (e.g., TRUE for binary and 1 for valued).
A one-sided formula specifying the reference measure (\(h(y)\)) to be used. See help for ERGM reference measures implemented in the ergm package.
A formula specifying one or more constraints
on the support of the distribution of the networks being modeled. Multiple constraints
may be given, separated by “+” and “-” operators. See
ergmConstraint for the detailed explanation of
their semantics and also for an indexed list of the constraints visible to the ergm package.
The default is to have no constraints except those provided through
the ergmlhs API.
Together with the model terms in the formula and the reference measure, the constraints define the distribution of networks being modeled.
It is also possible to specify a proposal function directly either
by passing a string with the function's name (in which case,
arguments to the proposal should be specified through the
MCMC.prop.args argument to the relevant control function, or
by giving it on the LHS of the hints formula to MCMC.prop
argument to the control function. This will override
the one chosen automatically.
Note that not all possible combinations of constraints and reference measures are supported. However, for relatively simple constraints (i.e., those that simply permit or forbid specific dyads or sets of dyads from changing), arbitrary combinations should be possible.
Inherit observational constraints rather than model constraints.
A one-sided formula specifying one or more terms
whose value is to be monitored. These terms are appended to the
model, along with a coefficient of 0, so their statistics are
returned. An ergm_model objectcan be passed as well.
Logical: If TRUE, return only the network statistics, not
the network(s) themselves. Deprecated in favor of output=.
Logical: If TRUE, compute the sample estimating equations of an ERGM: if the model is non-curved, all non-offset statistics are returned either way, but if the model is curved, the score estimating function values (3.1) by Hunter and Handcock (2006) are returned instead.
Normally character, one of "network" (default),
"stats", "edgelist", or "ergm_state": determines the output
format. Partial matching is performed.
Alternatively, a function with prototype
function(ergm_state, chain, iter, ...) that is
called for each returned network, and its return value, rather
than the network itself, is stored. This can be used to, for
example, store the simulated networks to disk without storing
them in memory or compute network statistics not implemented
using the ERGM API, without having to store the networks
themselves.
Logical: If TRUE the output is "simplified":
sampled networks are returned in a single list, statistics from
multiple parallel chains are stacked, etc.. This makes it
consistent with behavior prior to ergm 3.10.
Logical: If FALSE, each of the nsim simulated
Markov chains begins at the initial network. If TRUE, the end of one
simulation is used as the start of the next. Irrelevant when nsim=1.
A list of control parameters for algorithm tuning,
typically constructed with control.simulate.ergm() or control.simulate.formula(), which have different
defaults. Their documentation gives the the list of recognized
control parameters and their meaning. The more generic utility
snctrl() (StatNet ConTRoL) also provides argument completion
for the available control functions and limited argument name
checking.
A logical or an integer to control the amount of
progress and diagnostic information to be printed. FALSE/0
produces minimal output, with higher values producing more
detail. Note that very high values (5+) may significantly slow
down processing.
Logical; a deprecated interface superseded by return.args,
that saves the inputs to the next level of the function.
Character; if not NULL, the simulate method
for that particular class will, instead of proceeding for
simulation, instead return its arguments as a list that can be
passed as a second argument to do.call() or a lower-level
function such as ergm_MCMC_sample(). This can be useful if, for
example, one wants to run several simulations with varying
coefficients and does not want to reinitialize the model and the
proposal every time. Valid inputs at this time are "formula",
"ergm_model", and one of the "ergm_state" classes, for the three
respective stopping points.
simulate(ergm_state_full): a low-level function to simulate from an ergm_state object.
A sample of networks is randomly drawn from the specified model. The model
is specified by the first argument of the function. If the first argument
is a formula then this defines the model. If the first
argument is the output of a call to ergm() then the model used
for that call is the one fit -- and unless coef is specified, the
sample is from the MLE of the parameters. If neither of those are given as
the first argument then a Bernoulli network is generated with the
probability of ties defined by prob or coef.
Note that the first network is sampled after burnin steps,
and any subsequent networks are sampled each interval steps
after the first.
More information can be found by looking at the documentation of
ergm().
ergm(), network,
ergm_MCMC_sample() for a demonstration of return.args=.
# \dontshow{
options(ergm.eval.loglik=FALSE)
# }
#
# Let's draw from a Bernoulli model with 16 nodes
# and density 0.5 (i.e., coef = c(0,0))
#
g.sim <- simulate(network(16) ~ edges + mutual, coef=c(0, 0))
#
# What are the statistics like?
#
summary(g.sim ~ edges + mutual)
#
# Now simulate a network with higher mutuality
#
g.sim <- simulate(network(16) ~ edges + mutual, coef=c(0,2))
#
# How do the statistics look?
#
summary(g.sim ~ edges + mutual)
#
# Let's draw from a Bernoulli model with 16 nodes
# and tie probability 0.1
#
g.use <- network(16,density=0.1,directed=FALSE)
#
# Starting from this network let's draw 3 realizations
# of a edges and 2-star network
#
g.sim <- simulate(~edges+kstar(2), nsim=3, coef=c(-1.8,0.03),
basis=g.use, control=control.simulate(
MCMC.burnin=1000,
MCMC.interval=100))
g.sim
summary(g.sim)
#
# attach the Florentine Marriage data
#
data(florentine)
#
# fit an edges and 2-star model using the ergm function
#
gest <- ergm(flomarriage ~ edges + kstar(2))
summary(gest)
#
# Draw from the fitted model (statistics only), and observe the number
# of triangles as well.
#
g.sim <- simulate(gest, nsim=10,
monitor=~triangles, output="stats",
control=control.simulate.ergm(MCMC.burnin=1000, MCMC.interval=100))
g.sim
# Custom output: store the edgecount (computed in R), iteration index, and chain index.
output.f <- function(x, iter, chain, ...){
list(nedges = network.edgecount(as.network(x)),
chain = chain, iter = iter)
}
g.sim <- simulate(gest, nsim=3,
output=output.f, simplify=FALSE,
control=control.simulate.ergm(MCMC.burnin=1000, MCMC.interval=100))
unclass(g.sim)
Run the code above in your browser using DataLab