Learn R Programming

spsurvey

spsurvey is an R package that implements a design-based approach to statistical inference, with a focus on spatial data. Spatially balanced samples are selected using the Generalized Random Tessellation Stratified (GRTS) algorithm. The GRTS algorithm can be applied to finite resources (point geometries) and infinite resources (linear / linestring and areal / polygon geometries) and flexibly accommodates a diverse set of sampling design features, including stratification, unequal inclusion probabilities, proportional (to size) inclusion probabilities, legacy (historical) sites, a minimum distance between sites, and two options for replacement sites (reverse hierarchical order and nearest neighbor). Data are analyzed using a wide range of analysis functions that perform categorical variable analysis, continuous variable analysis, attributable risk analysis, risk difference analysis, relative risk analysis, change analysis, and trend analysis. spsurvey can also be used to summarize objects, visualize objects, select samples that are not spatially balanced, select panel samples, measure the amount of spatial balance in a sample, adjust design weights, and more. For additional details, see Dumelle et al. (2023) doi:10.18637/jss.v105.i03.

Installation

You can install and load the most recent approved version from CRAN by running

# install the most recent approved version from CRAN
install.packages("spsurvey")
# load the most recent approved version from CRAN
library(spsurvey)

You can install and load the most recent development version ofspsurvey from GitHub by running:

# Installing from GitHub requires you first install the remotes package
install.packages("remotes")

# install the most recent development version from GitHub
remotes::install_github("USEPA/spsurvey", ref = "main")
# load the most recent development version from GitHub
library(spsurvey)

You can install the most recent development version of spsurvey from GitHub with package vignettes by running:

install the most recent development version from GitHub with package vignettes
devtools::install_github("USEPA/spsurvey", build_vignettes=TRUE)

To view the vignettes in RStudio, run

vignette("start-here", "spsurvey") # start with this vignette for an spsurvey overview
vignette("EDA", "spsurvey") # for summaries and visualizations (exploratory data analysis)
vignette("sampling", "spsurvey") # for spatially balanced sampling
vignette("analysis", "spsurvey") # for analyzing data

To view the vignettes in a web format, visit here.

Further detail regarding spsurvey is contained in the package's documentation manual available for download here.

Citation

If you used spsurvey in your work, please cite it. You can view the most recent citation by running

citation(package = "spsurvey")
#> To cite spsurvey in publications use:
#> 
#>   Michael Dumelle, Tom Kincaid, Anthony R. Olsen, Marc Weber (2023).
#>   spsurvey: Spatial Sampling Design and Analysis in R. Journal of
#>   Statistical Software, 105(3), 1-29. doi:10.18637/jss.v105.i03
#> 
#> A BibTeX entry for LaTeX users is
#> 
#>   @Article{,
#>     title = {{spsurvey}: Spatial Sampling Design and Analysis in {R}},
#>     author = {Michael Dumelle and Tom Kincaid and Anthony R. Olsen and Marc Weber},
#>     journal = {Journal of Statistical Software},
#>     year = {2023},
#>     volume = {105},
#>     number = {3},
#>     pages = {1--29},
#>     doi = {10.18637/jss.v105.i03},
#>   }

Package Contributions

We encourage users to submit issues and enhancement requests so we may continue to improve spsurvey.

EPA Disclaimer

The United States Environmental Protection Agency (EPA) GitHub project code is provided on an "as is" basis and the user assumes responsibility for its use. EPA has relinquished control of the information and no longer has responsibility to protect the integrity , confidentiality, or availability of the information. Any reference to specific commercial products, processes, or services by service mark, trademark, manufacturer, or otherwise, does not constitute or imply their endorsement, recommendation or favoring by EPA. The EPA seal and logo shall not be used in any manner to imply endorsement of any commercial product or activity by EPA or the United States Government.

License

This project is licensed under the GNU General Public License, GPL-3.

Copy Link

Version

Install

install.packages('spsurvey')

Monthly Downloads

758

Version

5.5.1

License

GPL (>= 3)

Issues

Pull Requests

Stars

Forks

Maintainer

Last Published

January 9th, 2024

Functions in spsurvey (5.5.1)

cont_analysis

Continuous variable analysis
ash1_wgt

Compute the average shifted histogram (ASH) for one-dimensional weighted data
change_analysis

Change analysis
cov_panel_dsgn

Create a covariance matrix for a panel design
cdf_plot

Plot a cumulative distribution function (CDF)
cont_cdftest

Cumulative distribution function (CDF) inference for a probability survey
diffrisk_analysis

Risk difference analysis
attrisk_analysis

Attributable risk analysis
cat_analysis

Categorical variable analysis
cont_cdfplot

Create a PDF file containing cumulative distribution functions (CDF) plots
plot

Plot sampling frames, design sites, and analysis data.
localmean_weight

Internal Function: Local Mean Variance Neighbors and Weights
localmean_cov

Internal Function: Variance-Covariance Matrix Based on Local Mean Estimator
grts

Select a generalized random tessellation stratified (GRTS) sample
localmean_var

Internal Function: Local Mean Variance Estimator
pd_summary

Summary characteristics of a panel revisit design
power_dsgn

Power calculation for multiple panel designs
errorprnt

Print errors from analysis functions
plot.sp_CDF

Plot a cumulative distribution function (CDF)
sp_rbind

Combine rows from GRTS or IRS samples.
sp_summary

Summarize sampling frames, design sites, and analysis data.
irs

Select an independent random sample (IRS)
spsurvey-package

spsurvey: Spatial Sampling Design and Analysis
warnprnt

Print grts(), irs()), and analysis function warnings
summary

Summarize sampling frames, design sites, and analysis data.
revisit_bibd

Create a balanced incomplete block panel revisit design
revisit_dsgn

Create a panel revisit design
trend_analysis

Trend analysis
revisit_rand

Create a revisit design with random assignment to panels and time periods
ppd_plot

Plot power curves for panel designs
relrisk_analysis

Relative risk analysis
sp_plot

Plot sampling frames, design sites, and analysis data.
sp_frame

sp_frame objects
sp_balance

Calculate spatial balance metrics
stopprnt

Print grts() and irs() errors.
Illinois_River

Illinois River data
Illinois_River_Legacy

Illinois River legacy data
NE_Lakes_Legacy

New England Lakes legacy data
NLA_PNW

NLA PNW data
NE_Lakes_df

New England Lakes data (as a data frame)
NRSA_EPA7

NRSA EPA7 data
NE_Lakes

New England Lakes data
adjwgtNR

Adjust survey design weights for non-response by categories
adjwgt

Adjust survey design weights by categories
Lake_Ontario

Lake Ontario data