Learn R Programming

⚠️There's a newer version (3.0.9) of this package.Take me there.

EGRET

Exploration and Graphics for RivEr Trends (EGRET): An R-package for the analysis of long-term changes in water quality and streamflow, including the water-quality method Weighted Regressions on Time, Discharge, and Season (WRTDS).

Please visit the wiki for more information: EGRET Wiki

The link for the official USGS publication user guide is here:

https://pubs.usgs.gov/tm/04/a10/

Package Installation

To install the EGRET package, you must be using R 3.0 or greater and run the following command:

install.packages("EGRET")

To get inter-CRAN release updates, use the command:

install.packages("EGRET",repos="https://owi.usgs.gov/R")

To get cutting-edge changes, install from GitHub using the devtools packages:

library(devtools)
install_github("USGS-R/EGRET")

Background:

Evaluating long-term changes in river conditions (water quality and discharge) is an important use of hydrologic data. To carry out such evaluations, the hydrologist needs tools to facilitate several key steps in the process: acquiring the data records from a variety of sources, structuring it in ways that facilitate the analysis, routines that will process the data to extract information about changes that may be happening, and graphical techniques that can display findings about change. The R package EGRET (Exploration and Graphics for RivEr Trends) was developed for carrying out each of these steps in an integrated manner. It is designed to accept easily data from three sources: U.S. Geological Survey hydrologic data, Water Quality Portal Data (currently including U.S. Environmental Protection Agency (EPA) STORET data, and USDA STEWARDS data), and user-supplied flat files. The EGRET package has components oriented towards the description of long-term changes in streamflow statistics (high flow, average flow, and low flow) as well as changes in water quality. For the water-quality analysis, it uses Weighted Regressions on Time, Discharge and Season (WRTDS) to describe long-term trends in both concentration and flux. EGRET also creates a wide range of graphical presentations of the water-quality data and of the WRTDS results. The following report serves as a user guide, providing detailed guidance on installation and use of the software, documentation of the analysis methods used, as well as guidance on some of the kinds of questions and approaches that the software can facilitate.

EGRET includes statistics and graphics for streamflow history, water quality trends, and the statistical modeling algorithm Weighted Regressions on Time, Discharge, and Season (WRTDS). Please see the official EGRET User Guide for more information on the EGRET package:

https://doi.org/10.3133/tm4A10 The best ways to learn about the WRTDS approach is to read the User Guide and two journal articles. These articles are available, for free, from the journals in which they were published. The first relates to nitrate and total phosphorus data for 9 rivers draining to Chesapeake Bay. The URL is:

https://onlinelibrary.wiley.com/doi/full/10.1111/j.1752-1688.2010.00482.x.

The second is an application to nitrate data for 8 monitoring sites on the Mississippi River or its major tributaries. The URL is:

<http://pubs.acs.org/doi/abs/10.1021/es201221s

For a thorough discussion of the generalized flow normalization method implemented in the EGRET enhancements, see the paper: "Tracking changes in nutrient delivery to western Lake Erie: Approaches to compensate for variability and trends in streamflow":

https://www.sciencedirect.com/science/article/pii/S0380133018302235

Sample Workflow

WRTDS on the Choptank River at Greensboro MD, for Nitrate:

library(EGRET)

############################
# Gather discharge data:
siteID <- "01491000" #Choptank River at Greensboro, MD
startDate <- "" #Gets earliest date
endDate <- "2011-09-30"
# Gather sample data:
parameter_cd<-"00631" #5 digit USGS code
Sample <- readNWISSample(siteID,parameter_cd,startDate,endDate)
#Gets earliest date from Sample record:
#This is just one of many ways to assure the Daily record
#spans the Sample record
startDate <- min(as.character(Sample$Date)) 
# Gather discharge data:
Daily <- readNWISDaily(siteID,"00060",startDate,endDate)
# Gather site and parameter information:

# Here user must input some values for
# the default (interactive=TRUE)
INFO<- readNWISInfo(siteID,parameter_cd)
INFO$shortName <- "Choptank River at Greensboro, MD"

# Merge discharge with sample data:
eList <- mergeReport(INFO, Daily, Sample)
############################
library(EGRET)
# Sample data included in package:
eList <- Choptank_eList

############################
# Check sample data:
boxConcMonth(eList)

boxQTwice(eList)

plotConcTime(eList)

plotConcQ(eList)

multiPlotDataOverview(eList)
############################

############################
# Run WRTDS model:
eList <- modelEstimation(eList)

#> 
#>  first step running estCrossVal may take about 1 minute
#>  estCrossVal % complete:
#> 0    1   2   3   4   5   6   7   8   9   10  
#> 11   12  13  14  15  16  17  18  19  20  
#> 21   22  23  24  25  26  27  28  29  30  
#> 31   32  33  34  35  36  37  38  39  40  
#> 41   42  43  44  45  46  47  48  49  50  
#> 51   52  53  54  55  56  57  58  59  60  
#> 61   62  63  64  65  66  67  68  69  70  
#> 71   72  73  74  75  76  77  78  79  80  
#> 81   82  83  84  85  86  87  88  89  90  
#> 91   92  93  94  95  96  97  98  99  
#> Next step running  estSurfaces with survival regression:
#> Survival regression (% complete):
#> 0    1   2   3   4   5   6   7   8   9   10  
#> 11   12  13  14  15  16  17  18  19  20  
#> 21   22  23  24  25  26  27  28  29  30  
#> 31   32  33  34  35  36  37  38  39  40  
#> 41   42  43  44  45  46  47  48  49  50  
#> 51   52  53  54  55  56  57  58  59  60  
#> 61   62  63  64  65  66  67  68  69  70  
#> 71   72  73  74  75  76  77  78  79  80  
#> 81   82  83  84  85  86  87  88  89  90  
#> 91   92  93  94  95  96  97  98  99  
#> Survival regression: Done
############################

############################
#Check model results:

#eList:
plotConcTimeDaily(eList)

plotFluxTimeDaily(eList)

plotConcPred(eList)

plotFluxPred(eList)

plotResidPred(eList)

plotResidQ(eList)

plotResidTime(eList)

boxResidMonth(eList)

boxConcThree(eList)


#Require Daily + INFO:
plotConcHist(eList)

plotFluxHist(eList)

# Multi-line plots:
date1 <- "2000-09-01"
date2 <- "2005-09-01"
date3 <- "2009-09-01"
qBottom<-100
qTop<-5000
plotConcQSmooth(eList, date1, date2, date3, qBottom, qTop, 
                   concMax=2,qUnit=1)

q1 <- 10
q2 <- 25
q3 <- 75
centerDate <- "07-01"
yearEnd <- 2009
yearStart <- 2000
plotConcTimeSmooth(eList, q1, q2, q3, centerDate, yearStart, yearEnd)


# Multi-plots:
fluxBiasMulti(eList)

#Contour plots:
clevel<-seq(0,2,0.5)
maxDiff<-0.8
yearStart <- 2000
yearEnd <- 2010

plotContours(eList, yearStart,yearEnd,qBottom,qTop, 
             contourLevels = clevel,qUnit=1)

plotDiffContours(eList, yearStart,yearEnd,
                 qBottom,qTop,maxDiff,qUnit=1)

Sample workflow for a flowHistory application for the entire record

library(EGRET)

# Flow history analysis
############################
# Gather discharge data:
siteID <- "01491000" #Choptank River at Greensboro, MD
startDate <- "" # Get earliest date
endDate <- "" # Get latest date
Daily <- readNWISDaily(siteID,"00060",startDate,endDate)
#> There are 25856 data points, and 25856 days.
# Gather site and parameter information:
# Here user must input some values for
# the default (interactive=TRUE)
INFO<- readNWISInfo(siteID,"00060")
#> Your site for streamflow data is:
#>  01491000 .
#> Your site name is CHOPTANK RIVER NEAR GREENSBORO, MD 
#> but you can modify this to a short name in a style you prefer. 
#> This name will be used to label graphs and tables. 
#> If you want the program to use the name given above, just do a carriage return,
#> otherwise enter the preferred short name(no quotes):
#> 
#> The latitude and longitude of the site are:  38.99719 ,  -75.78581 (degrees north and west).
#> 
#> The drainage area at this site is  113 square miles
#>  which is being stored as 292.6687 square kilometers.
#> 
#> It is helpful to set up a station abbreviation when doing multi-site studies,
#> enter a unique id (three or four characters should work). It is case sensitive.  
#> Even if you don't feel you need an abbreviation for your site you need to enter something(no quotes):
#> 
#> Your water quality data are for parameter number:
#> 00060 
#> which has the name:' Discharge, cubic feet per second '.
#> Typically you will want a shorter name to be used in graphs and tables.
#> The suggested short name is:' Stream flow, mean. daily '.
#> If you would like to change the short name, enter it here, 
#> otherwise just hit enter (no quotes):
#> The units for the water quality data are:  ft3/s .
#> It is helpful to set up a constiuent abbreviation, enter a unique id 
#> three or four characters should work something like tn or tp or NO3).
#> Even if you don't feel you need an abbreviation you need to enter something (no quotes):
#> 
#> Required concentration units are mg/l. 
#> The INFO dataframe indicates: ft3/s 
#> Flux calculations will be wrong if units are not consistent.
INFO$shortName <- "Choptank River at Greensboro, MD"
eList <- as.egret(INFO, Daily, NA, NA)
############################

############################
# Check flow history data:
plotFlowSingle(eList, istat=7,qUnit="thousandCfs")

plotSDLogQ(eList)

plotQTimeDaily(eList, qLower=1,qUnit=3)

plotFour(eList, qUnit=3)

plotFourStats(eList, qUnit=3)

############################

Model Archive

When using the WRTDS model, it is important to be able to reproduce the results in the future. The following version of R and package dependencies were used most recently to pass the embedded tests within this package. There is no guarantee of reproducible results using future versions of R or updated versions of package dependencies; however, we will make diligent efforts to test and update future modeling environments.

devtools::session_info()
#> Session info -------------------------------------------------------------
#>  setting  value                       
#>  version  R version 3.5.1 (2018-07-02)
#>  system   x86_64, mingw32             
#>  ui       RTerm                       
#>  language (EN)                        
#>  collate  English_United States.1252  
#>  tz       America/Chicago             
#>  date     2018-10-16
#> Packages -----------------------------------------------------------------
#>  package       * version    date       source                          
#>  assertthat      0.2.0      2017-04-11 CRAN (R 3.5.1)                  
#>  backports       1.1.2      2017-12-13 CRAN (R 3.5.0)                  
#>  base          * 3.5.1      2018-07-02 local                           
#>  bindr           0.1.1      2018-03-13 CRAN (R 3.5.1)                  
#>  bindrcpp        0.2.2      2018-03-29 CRAN (R 3.5.1)                  
#>  compiler        3.5.1      2018-07-02 local                           
#>  crayon          1.3.4      2017-09-16 CRAN (R 3.5.1)                  
#>  curl            3.2        2018-03-28 CRAN (R 3.5.1)                  
#>  dataRetrieval   2.7.4      2018-05-09 local                           
#>  datasets      * 3.5.1      2018-07-02 local                           
#>  devtools        1.13.6     2018-06-27 CRAN (R 3.5.1)                  
#>  digest          0.6.18     2018-10-10 CRAN (R 3.5.1)                  
#>  dotCall64       1.0-0      2018-07-30 CRAN (R 3.5.1)                  
#>  dplyr           0.7.6      2018-06-29 CRAN (R 3.5.1)                  
#>  EGRET         * 3.0.1      2018-10-16 local                           
#>  evaluate        0.12       2018-10-09 CRAN (R 3.5.1)                  
#>  fields          9.6        2018-01-29 CRAN (R 3.5.1)                  
#>  glue            1.3.0      2018-07-17 CRAN (R 3.5.1)                  
#>  graphics      * 3.5.1      2018-07-02 local                           
#>  grDevices     * 3.5.1      2018-07-02 local                           
#>  grid            3.5.1      2018-07-02 local                           
#>  hms             0.4.2      2018-03-10 CRAN (R 3.5.1)                  
#>  htmltools       0.3.6      2017-04-28 CRAN (R 3.5.1)                  
#>  httr            1.3.1      2017-08-20 CRAN (R 3.5.1)                  
#>  jsonlite        1.5        2017-06-01 CRAN (R 3.5.1)                  
#>  knitr           1.20       2018-02-20 CRAN (R 3.5.1)                  
#>  lattice         0.20-35    2017-03-25 CRAN (R 3.5.1)                  
#>  lubridate       1.7.4      2018-04-11 CRAN (R 3.5.1)                  
#>  magrittr        1.5        2014-11-22 CRAN (R 3.5.1)                  
#>  maps            3.3.0      2018-04-03 CRAN (R 3.5.1)                  
#>  Matrix          1.2-14     2018-04-13 CRAN (R 3.5.1)                  
#>  memoise         1.1.0      2017-04-21 CRAN (R 3.5.1)                  
#>  methods       * 3.5.1      2018-07-02 local                           
#>  pillar          1.3.0      2018-07-14 CRAN (R 3.5.1)                  
#>  pkgconfig       2.0.2      2018-08-16 CRAN (R 3.5.1)                  
#>  plyr            1.8.4      2016-06-08 CRAN (R 3.5.1)                  
#>  purrr           0.2.5      2018-05-29 CRAN (R 3.5.1)                  
#>  R6              2.3.0      2018-10-04 CRAN (R 3.5.1)                  
#>  Rcpp            0.12.19    2018-10-01 CRAN (R 3.5.1)                  
#>  readr           1.1.1      2017-05-16 CRAN (R 3.5.1)                  
#>  reshape2        1.4.3      2017-12-11 CRAN (R 3.5.1)                  
#>  rlang           0.2.2.9001 2018-09-25 Github (tidyverse/rlang@b7b06af)
#>  rmarkdown       1.10       2018-06-11 CRAN (R 3.5.1)                  
#>  rprojroot       1.3-2      2018-01-03 CRAN (R 3.5.1)                  
#>  spam            2.2-0      2018-06-19 CRAN (R 3.5.1)                  
#>  splines         3.5.1      2018-07-02 local                           
#>  stats         * 3.5.1      2018-07-02 local                           
#>  stringi         1.1.7      2018-03-12 CRAN (R 3.5.0)                  
#>  stringr         1.3.1      2018-05-10 CRAN (R 3.5.1)                  
#>  survival        2.42-3     2018-04-16 CRAN (R 3.5.1)                  
#>  tibble          1.4.2      2018-01-22 CRAN (R 3.5.1)                  
#>  tidyselect      0.2.5      2018-10-11 CRAN (R 3.5.1)                  
#>  tools           3.5.1      2018-07-02 local                           
#>  utils         * 3.5.1      2018-07-02 local                           
#>  withr           2.1.2      2018-03-15 CRAN (R 3.5.1)                  
#>  xml2            1.2.0      2018-01-24 CRAN (R 3.5.1)                  
#>  yaml            2.2.0      2018-07-25 CRAN (R 3.5.1)

Reporting bugs

Please consider reporting bugs and asking questions on the Issues page: https://github.com/USGS-R/EGRET/issues

Follow @USGS_R on Twitter for updates on USGS R packages:

Subscribe

Please email questions, comments, and feedback to: egret_comments@usgs.gov

Additionally, to subscribe to an email list concerning updates to these R packages, please send a request to egret_comments@usgs.gov.

Code of Conduct

We want to encourage a warm, welcoming, and safe environment for contributing to this project. See the code of conduct for more information.

Package Support

The Water Mission Area of the USGS has supported the development and maintenance of the EGRET R-package. Further maintenance is expected to be stable through September 2019. Resources are available primarily for maintenance and responding to user questions. Priorities on the development of new features are determined by the EGRET development team.

Sunset date

Funding for EGRET currently expires summer 2019. Expectations are that maintenance and customer service will continue to be supported past that date.

How to cite EGRET:

citation(package = "EGRET")
#> 
#> To cite EGRET in publications, please use:
#> 
#>   Hirsch, R.M., and De Cicco, L.A., 2015, User guide to
#>   Exploration and Graphics for RivEr Trends (EGRET) and
#>   dataRetrieval: R packages for hydrologic data (version 2.0,
#>   February 2015): U.S. Geological Survey Techniques and Methods
#>   book 4, chap. A10, 93 p., doi:10.3133/tm4A10
#> 
#> A BibTeX entry for LaTeX users is
#> 
#>   @InBook{,
#>     author = {Robert M. Hirsch and Laura A. {De Cicco}},
#>     title = {User guide to Exploration and Graphics for RivEr Trends (EGRET) and dataRetrieval: R packages for hydrologic data},
#>     publisher = {U.S. Geological Survey},
#>     address = {Reston, VA},
#>     booktitle = {Techniques and Methods},
#>     institution = {U.S. Geological Survey},
#>     year = {2015},
#>     chapter = {A10},
#>     url = {https://pubs.usgs.gov/tm/04/a10/},
#>   }

Disclaimer

This software has been approved for release by the U.S. Geological Survey (USGS). Although the software has been subjected to rigorous review, the USGS reserves the right to update the software as needed pursuant to further analysis and review. No warranty, expressed or implied, is made by the USGS or the U.S. Government as to the functionality of the software and related material nor shall the fact of release constitute any such warranty. Furthermore, the software is released on condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from its authorized or unauthorized use.

Copy Link

Version

Install

install.packages('EGRET')

Monthly Downloads

815

Version

3.0.2

License

CC0

Issues

Pull Requests

Stars

Forks

Maintainer

Laura DeCicco

Last Published

February 8th, 2019

Functions in EGRET (3.0.2)

decimalHighLow

decimalHighLow
flowDuration

Computes several values of the flow duration curve for streamflow centered on a specific date of the year
estSurfaces

Estimate the three surfaces (for yHat, SE and ConcHat) as a function of DecYear and logQ and store in the three-dimensional object called surfaces
fixSampleFrame

Update Sample dataframe
logPretty3

Sets up tick marks for an axis with a log scale
plotFluxHist

Graph of annual flux and flow normalized flux versus year
plotFlowSingle

Creates a plot of a time series of a particular flow statistic and a lowess smooth of that flow statistic
multiPlotDataOverview

Produces a 4 panel plot that gives an overview of the data set prior to any processing
plot15

Makes 15 graphs of streamflow statistics on a single page
logPretty1

Sets up tick marks for an axis with a log scale, where the graph is small
fluxBiasMulti

Produces 8-panel plot that is useful for determining if there is a flux bias problem
Constants

Constants included with EGRET
plotFluxTimeDaily

Plot of the time series of daily flux estimates and the sample values for the days that were sampled
getDaily

Get Daily dataframe from EGRET object
flexPlotAddOn

Flexible Flow Normalization Plot Add On
flexFN

Flexible Flow Normalization
print.egret

EGRET helper functions
is.egret

Check for EGRET object
plot1of15

plots 1 of the 15 graphs of streamflow statistics on a single page
boxConcThree

Three box plots side-by-side
plotFour

Makes four graphs of streamflow statistics on a single page
populateSiteINFO

Populate Site Information Columns
runSurvReg

Run the weighted survival regression for a set of estimation points (defined by DecYear and Log(Q))
readNWISDaily

Import NWIS Daily Data for EGRET analysis
printFluxUnitCheatSheet

Reminder to user of flux unit properties (such as kg/day, tons/year, etc).
readDataFromFile

Basic Data Import for Water Flow Data
plotConcHist

Graph of annual concentration and flow normalized concentration versus year
Choptank_eList

Example eList
getInfo

Get INFO dataframe from EGRET object
boxQTwice

Two box plots side-by-side, discharge on sample days, and discharge on all days
fluxBiasStat

Compute the flux bias statistic: (mean of estimated flux - mean of observed flux) / mean of observed flux
EGRET-package

EGRET package includes WRTDS and flowHistory
estDailyFromSurfaces

Estimates all daily values of Concentration, Flux, Flow-Normalized Concentration, and Flow Normalized Flux
plotConcTimeDaily

Plot of the time series of daily concentration estimates and the sample values for the days that were sampled
plotConcTimeSmooth

Plot up to three curves representing the concentration versus time relationship, each curve representing a different flow.
plotResidPred

Plot of the residuals from WRTDS versus the estimated values (all in log concentration units)
plotResidQ

Plot of the residuals from WRTDS (in log concentration units) versus the discharge
INFOdataframe

Import metadata to create INFO data frame
fluxUnit-class

fluxUnit class
as.egret

Create named list for EGRET analysis
getSample

Get Sample dataframe from EGRET object
censoredSegments

Generic plotting function to create censored line segments
getSurfaces

Get surfaces matrix from EGRET object
makeAnnualSeries

Produces annual series of 8 streamflow statistics (and a lowess smooth of them) from daily streamflow data
plotConcQSmooth

Plot up to three curves representing the concentration versus discharge relationship. Each curve is a different point in time.
makeAugmentedSample

Create randomized residuals and observations for data sets that have some censored data
estCrossVal

Jack-Knife cross validation of the WRTDS (Weighted Regressions on Time, Discharge, and Season)
modelEstimation

Estimation process for the WRTDS (Weighted Regressions on Time, Discharge, and Season)
checkStartEndDate

checkStartEndDate
formatCheckDate

formatCheckDate
makeDateInfo

makeDateInfo
monthLabel-class

monthLabel class
processQWData

Processing of Water Quality Data
plotConcQ

Plot of Observed Concentration versus Discharge
plotConcPred

Plot of Observed Concentration versus Estimated Concentration
qUnit-class

qUnit class
populateConcentrations

Populate Concentration Columns
setupYears

Creates the AnnualResults data frame from the Daily data frame
populateDaily

Populate Daily data frame
formatCheckParameterCd

formatCheckParameterCd
stitch

stitch surfaces
startEnd

startEnd
surfaceIndex

Compute the 6 parameters needed to lay out the grid for the surfaces computed in estSurfaces
plotConcTime

Plot of Observed Concentration versus Time
plotFourStats

Makes four graphs of annual streamflow statistics on a single page
runPairs

Runs a comparison of any two years in the record.
plotQTimeDaily

Plot of the discharge time series
runSeries

Annual series of flow-normalized concentration and flow-normalzed flux
mergeReport

mergeReport
printSeries

Print annual results for a given streamflow statistic
plotContours

Color contour plot of the estimated surfaces as a function of discharge and time (surfaces include log concentration, standard error, and concentration)
plotDiffContours

Plots the difference between two years from a contour plot created by plotContours
generalAxis

Axis generation for log discharge
genericEGRETDotPlot

Generic EGRET plotting function
plotFluxPred

Graph of observed versus estimated flux
plotFluxQ

Sample data plot: observed log flux vs log discharge
plotResidTime

Plot of the residuals from WRTDS (in log concentration units) versus time
removeDuplicates

Remove duplicates values from Sample data frame.
printqUnitCheatSheet

Reminder to user of flow Unit properties such as cubic meters per second or thousands of cubic feet per second.
runGroups

Runs a comparison of any group of years in the record.
plotSDLogQ

Graph of the standard deviation of the log of daily discharge versus year
populateParameterINFO

Populate Parameter Information Columns
populateSampleColumns

Populate Sample Columns
readNWISSample

Import NWIS Sample Data for EGRET analysis
readUserDaily

Import user daily data for EGRET analysis
setSeasonLabelByUser

Creates a character variable that describes the period of analysis, when the period of analysis is being set by the user and not from AnnualResults
setUpEstimation

setUpEstimation
saveResults

A utility program for saving the contents of the workspace This function saves the workspace. It assigns the file a name using the abbreviations for station and constituent.
tableFlowChange

Prints table of change metrics for a given streamflow statistic
readUserSample

Import user-supplied sample data for EGRET analysis
setPA

Sets up the period of analysis (the portion of the year being evaluated).
readWQPSample

Import Sample Data from the Water Quality Portal for WRTDS
selectDays

Creates a subset Daily data frame that only contains daily estimates for the specified period of analysis
tableResults

Table of annual results for discharge, concentration and flux
setSeasonLabel

Create a character variable that describes the period of analysis, when period of analysis has already been set in AnnualResults
triCube

Tricube weight function
yPretty

Sets up tick marks for an axis for a graph with an arithmetic scale which starts at zero
surfaceStartEnd

Surface date limits
tableChange

Create a table of the changes in flow-normalized values between various points in time in the record
boxResidMonth

A box plot of WRTDS residuals by month
calculateMonthlyResults

Calculates monthly mean values of Q, Conc, Flux, FNConc, and FNFlux for the entire record.
populateDateColumns

Populate Date Columns
dataOverview

Data Overview for WRTDS
blankTime

Deletes the computed values during periods of time when there are no sample data
dateFormatCheck

Check date format
checkSurfaceSpan

checkSurfaceSpan
compressData

Compress sample data frame
boxConcMonth

Box plot of the water quality data by month