Learn R Programming

RDS (version 0.9-9)

fauxmadrona: A Simulated RDS Data Set with no seed dependency

Description

This is a faux set used to illustrate how the estimators perform under different populations and RDS schemes.

Arguments

Format

An rds.data.frame

Details

The population had N=1000 nodes. In this case, the sample size is 500 so that there is a relatively small sample fraction (50%). There is homophily on disease status (R=5) and there is differential activity by disease status whereby the infected nodes have mean degree twice that of the uninfected (w=1.8).

In the sampling, the seeds are chosen randomly from the full population, so there is no dependency induced by seed selection.

Each sample member is given 2 uniquely identified coupons to distribute to other members of the target population in their acquaintance. Further each respondent distributes their coupons completely at random from among those they are connected to.

Here are the results for this data set and the sister fauxsycamore data set:

NameCityTypeMeanRDS I (SH)RDS II (VH)SSfauxsycamore
Oxfordseed dependency, 70%0.24080.10870.13720.1814fauxmadronaSeattle

Even with only 50% sample, the VH is substantially biased , and the SS does much better.

References

Gile, Krista J., Handcock, Mark S., 2010 Respondent-driven Sampling: An Assessment of Current Methodology, Sociological Methodology, 40, 285-327.

See Also

fauxsycamore, faux