Learn R Programming

mkin

The R package mkin provides calculation routines for the analysis of chemical degradation data, including multicompartment kinetics as needed for modelling the formation and decline of transformation products, or if several degradation compartments are involved. It provides stable functionality for kinetic evaluations according to the FOCUS guidance (see below for details). In addition, it provides functionality to do hierarchical kinetics based on nonlinear mixed-effects models.

Installation

You can install the latest released version from CRAN from within R:

install.packages("mkin")

Background

In the regulatory evaluation of chemical substances like plant protection products (pesticides), biocides and other chemicals, degradation data play an important role. For the evaluation of pesticide degradation experiments, detailed guidance and various helpful tools have been developed as detailed in 'Credits and historical remarks' below. This package aims to provide a one stop solution for degradation kinetics, addressing modellers that are willing to, or even prefer to work with R.

Basic usage

For a start, have a look at the code examples provided for plot.mkinfit and plot.mmkin, and at the package vignettes FOCUS L and FOCUS D.

Documentation

The HTML documentation of the latest version released to CRAN is available at jrwb.de and github.

Documentation of the development version is found in the 'dev' subdirectory. In the articles section of this documentation, you can also find demonstrations of the application of nonlinear hierarchical models, also known as nonlinear mixed-effects models, to more complex data, including transformation products and covariates.

Features

General

  • Highly flexible model specification using mkinmod, including equilibrium reactions and using the single first-order reversible binding (SFORB) model, which will automatically create two state variables for the observed variable.
  • Model solution (forward modelling) in the function mkinpredict is performed either using the analytical solution for the case of parent only degradation or some simple models involving a single transformation product, , an eigenvalue based solution if only simple first-order (SFO) or SFORB kinetics are used in the model, or using a numeric solver from the deSolve package (default is lsoda).
  • The usual one-sided t-test for significant difference from zero is shown based on estimators for the untransformed parameters.
  • Summary and plotting functions. The summary of an mkinfit object is in fact a full report that should give enough information to be able to approximately reproduce the fit with other tools.
  • The chi-squared error level as defined in the FOCUS kinetics guidance (see below) is calculated for each observed variable.
  • The 'variance by variable' error model which is often fitted using Iteratively Reweighted Least Squares (IRLS) can be specified as error_model = "obs".

Unique in mkin

  • Three different error models can be selected using the argument error_model to the mkinfit function. A two-component error model similar to the one proposed by Rocke and Lorenzato can be selected using the argument error_model = "tc".
  • Model comparisons using the Akaike Information Criterion (AIC) are supported which can also be used for non-constant variance. In such cases the FOCUS chi-squared error level is not meaningful.
  • By default, kinetic rate constants and kinetic formation fractions are transformed internally using transform_odeparms so their estimators can more reasonably be expected to follow a normal distribution.
  • When parameter estimates are backtransformed to match the model definition, confidence intervals calculated from standard errors are also backtransformed to the correct scale, and will not include meaningless values like negative rate constants or formation fractions adding up to more than 1, which cannot occur in a single experiment with a single defined radiolabel position.
  • When a metabolite decline phase is not described well by SFO kinetics, SFORB kinetics can be used for the metabolite. Mathematically, the SFORB model is equivalent to the DFOP model. However, the SFORB model has the advantage that there is a mechanistic interpretation of the model parameters.
  • Nonlinear mixed-effects models (hierarchical models) can be created from fits of the same degradation model to different datasets for the same compound by using the nlme.mmkin and saem.mmkin methods. Note that the convergence of the nlme fits depends on the quality of the data. Convergence is better for simple models and data for many groups (e.g. soils). The saem method uses the saemix package as a backend. Analytical solutions suitable for use with this package have been implemented for parent only models and the most important models including one metabolite (SFO-SFO and DFOP-SFO). Fitting other models with saem.mmkin, while it makes use of the compiled ODE models that mkin provides, has longer run times (from a couple of minutes to more than an hour).

Performance

  • Parallel fitting of several models to several datasets is supported, see for example plot.mmkin.
  • If a C compiler is installed, the kinetic models are compiled from automatically generated C code, see vignette compiled_models. The autogeneration of C code was inspired by the ccSolve package. Thanks to Karline Soetaert for her work on that.
  • Even if no compiler is installed, many degradation models still give very good performance, as current versions of mkin also have analytical solutions for some models with one metabolite, and if SFO or SFORB are used for the parent compound, Eigenvalue based solutions of the degradation model are available.

GUI

There is a graphical user interface that may be useful. Please refer to its documentation page for installation instructions and a manual. It only supports evaluations using (generalised) nonlinear regression, but not simultaneous fits using nonlinear mixed-effects models.

News

There is a list of changes for the latest CRAN release and one for each github branch, e.g. the main branch.

Credits and historical remarks

mkin would not be possible without the underlying software stack consisting of, among others, R and the package deSolve. In previous version, mkin was also using the functionality of the FME package. Please refer to the package page on CRAN for the full list of imported and suggested R packages. Also, Debian Linux, the vim editor and the Nvim-R plugin have been invaluable in its development.

mkin could not have been written without me being introduced to regulatory fate modelling of pesticides by Adrian Gurney during my time at Harlan Laboratories Ltd (formerly RCC Ltd). mkin greatly profits from and largely follows the work done by the FOCUS Degradation Kinetics Workgroup, as detailed in their guidance document from 2006, slightly updated in 2011 and in 2014.

Also, it was inspired by the first version of KinGUI developed by BayerCropScience, which is based on the MatLab runtime environment.

The companion package kinfit (now deprecated) was started in 2008 and first published on CRAN on 01 May 2010.

The first mkin code was published on 11 May 2010 and the first CRAN version on 18 May 2010.

In 2011, Bayer Crop Science started to distribute an R based successor to KinGUI named KinGUII whose R code is based on mkin, but which added, among other refinements, a closed source graphical user interface (GUI), iteratively reweighted least squares (IRLS) optimisation of the variance for each of the observed variables, and Markov Chain Monte Carlo (MCMC) simulation functionality, similar to what is available e.g. in the FME package.

Somewhat in parallel, Syngenta has sponsored the development of an mkin and KinGUII based GUI application called CAKE, which also adds IRLS and MCMC, is more limited in the model formulation, but puts more weight on usability. CAKE is available for download from the CAKE website, where you can also find a zip archive of the R scripts derived from mkin, published under the GPL license.

Finally, there is KineticEval, which contains some further development of the scripts used for KinGUII.

Thanks to René Lehmann, formerly working at the Umweltbundesamt, for the nice cooperation on parameter transformations, especially the isometric log-ratio transformation that is now used for formation fractions in case there are more than two transformation targets.

Many inspirations for improvements of mkin resulted from doing kinetic evaluations of degradation data for my clients while working at Harlan Laboratories and at Eurofins Regulatory AG, and now as an independent consultant.

Funding was received from the Umweltbundesamt in the course of the projects

  • Project Number 27452 (Testing and validation of modelling software as an alternative

to ModelMaker 4.0, 2014-2015)

  • Project Number 56703 (Optimization of gmkin for routine use in the Umweltbundesamt, 2015)
  • Project Number 92570 (Update of Project Number 27452, 2017-2018)
  • Project Number 112407 (Testing the feasibility of using an error model according to Rocke and Lorenzato for more realistic parameter estimates in the kinetic evaluation of degradation data, 2018-2019)
  • Project Number 120667 (Development of objective criteria for the evaluation of the visual fit in the kinetic evaluation of degradation data, 2019-2020)
  • Project Number 146839 (Checking the feasibility of using mixed-effects models for the derivation of kinetic modelling parameters from degradation studies, 2020-2021)
  • Project Number 173340 (Application of nonlinear hierarchical models to the kinetic evaluation of chemical degradation data)

Thanks to everyone involved for collaboration and support!

Thanks are due also to Emmanuelle Comets, maintainer of the saemix package, for her interest and support for using the SAEM algorithm and its implementation in saemix for the evaluation of chemical degradation data.

References

Development

Contributions are welcome!

Copy Link

Version

Install

install.packages('mkin')

Monthly Downloads

1,061

Version

1.2.6

License

GPL

Maintainer

Last Published

October 14th, 2023

Functions in mkin (1.2.6)

f_time_norm_focus

Normalisation factors for aerobic soil degradation according to FOCUS guidance
HS.solution

Hockey-Stick kinetics
confint.mkinfit

Confidence intervals for parameters of mkinfit objects
ilr

Function to perform isometric log-ratio transformation
loftest

Lack-of-fit test for models fitted to data with replicates
hierarchical_kinetics

Hierarchical kinetics template
logLik.mkinfit

Calculated the log-likelihood of a fitted mkinfit object
create_deg_func

Create degradation functions for known analytical solutions
illparms

Method to get the names of ill-defined parameters
get_deg_func

Retrieve a degradation function from the mmkin namespace
endpoints

Function to calculate endpoints for further use from kinetic models fitted with mkinfit
intervals.saem.mmkin

Confidence intervals for parameters in saem.mmkin objects
experimental_data_for_UBA_2019

Experimental datasets used for development and testing of error models
lrtest.mkinfit

Likelihood ratio test for mkinfit models
llhist

Plot the distribution of log likelihoods from multistart objects
max_twa_parent

Function to calculate maximum time weighted average concentrations from kinetic models fitted with mkinfit
mkin_long_to_wide

Convert a dataframe from long to wide format
mccall81_245T

Datasets on aerobic soil metabolism of 2,4,5-T in six soils
mean_degparms

Calculate mean degradation parameters for an mmkin row object
mkinds

A dataset class for mkin
mkin_wide_to_long

Convert a dataframe with observations over time into long format
focus_soil_moisture

FOCUS default values for soil moisture contents at field capacity, MWHC and 1/3 bar
ds_mixed

Synthetic data for hierarchical kinetic degradation models
logLik.saem.mmkin

logLik method for saem.mmkin objects
mixed

Create a mixed effects model from an mmkin row object
mhmkin

Fit nonlinear mixed-effects models built from one or more kinetic degradation models and one or more error models
dimethenamid_2018

Aerobic soil degradation data on dimethenamid and dimethenamid-P from the EU assessment in 2018
mkindsg

A class for dataset groups for mkin
mkinfit

Fit a kinetic model to data with one or more state variables
logistic.solution

Logistic kinetics
mkinmod

Function to set up a kinetic model with one or more state variables
mkinerrplot

Function to plot squared residuals and the error model for an mkin object
mkinerrmin

Calculate the minimum error to assume in order to pass the variance test
mkinresplot

Function to plot residuals stored in an mkin object
mkinpredict

Produce predictions from a kinetic model using specific parameters
nlme_function

Helper functions to create nlme models from mmkin row objects
nafta

Evaluate parent kinetics using the NAFTA guidance
mkinplot

Plot the observed data and the fitted model of an mkinfit object
mkinparplot

Function to plot the confidence intervals obtained using mkinfit
mmkin

Fit one or more kinetic models with one or more state variables to one or more datasets
multistart

Perform a hierarchical model fit with multiple starting values
nlme.mmkin

Create an nlme model for an mmkin row object
nobs.mkinfit

Number of observations on which an mkinfit object was fitted
parms

Extract model parameters
reexports

Objects exported from other packages
read_spreadsheet

Read datasets and relevant meta information from a spreadsheet file
set_nd_nq

Set non-detects and unquantified values in residue series without replicates
plot.mmkin

Plot model fits (observed and fitted) and the residuals for a row or column of an mmkin object
summary.nlme.mmkin

Summary method for class "nlme.mmkin"
plot.nafta

Plot the results of the three models used in the NAFTA scheme.
schaefer07_complex_case

Metabolism data set used for checking the software quality of KinGUI
plot.mixed.mmkin

Plot predictions from a fitted nonlinear mixed model obtained via an mmkin row object
plot.mkinfit

Plot the observed data and the fitted model of an mkinfit object
residuals.mkinfit

Extract residuals from an mkinfit model
saem

Fit nonlinear mixed models with SAEM
parplot

Plot parameter variability of multistart objects
summary.saem.mmkin

Summary method for class "saem.mmkin"
summary.mkinfit

Summary method for class "mkinfit"
summary.mmkin

Summary method for class "mmkin"
update.mkinfit

Update an mkinfit model with different arguments
synthetic_data_for_UBA_2014

Synthetic datasets for one parent compound with two metabolites
sigma_twocomp

Two-component error model
status

Method to get status information for fit array objects
summary_listing

Display the output of a summary function according to the output format
test_data_from_UBA_2014

Three experimental datasets from two water sediment systems and one soil
transform_odeparms

Functions to transform and backtransform kinetic parameters for fitting
FOCUS_2006_HS_ref_A_to_F

Results of fitting the HS model to Datasets A to F of FOCUS (2006)
[.mmkin

Subsetting method for mmkin objects
FOCUS_2006_SFO_ref_A_to_F

Results of fitting the SFO model to Datasets A to F of FOCUS (2006)
FOCUS_2006_FOMC_ref_A_to_F

Results of fitting the FOMC model to Datasets A to F of FOCUS (2006)
D24_2014

Aerobic soil degradation data on 2,4-D from the EU assessment in 2014
FOCUS_2006_datasets

Datasets A to F from the FOCUS Kinetics report from 2006
FOCUS_2006_DFOP_ref_A_to_B

Results of fitting the DFOP model to Datasets A to B of FOCUS (2006)
anova.saem.mmkin

Anova method for saem.mmkin objects
SFORB.solution

Single First-Order Reversible Binding kinetics
add_err

Add normally distributed errors to simulated kinetic degradation data
NAFTA_SOP_Attachment

Example datasets from Attachment 1 to the NAFTA SOP published 2015
FOMC.solution

First-Order Multi-Compartment kinetics
DFOP.solution

Double First-Order in Parallel kinetics
aw

Calculate Akaike weights for model averaging
AIC.mmkin

Calculate the AIC for a column of an mmkin object
CAKE_export

Export a list of datasets format to a CAKE study file
SFO.solution

Single First-Order kinetics
IORE.solution

Indeterminate order rate equation kinetics
NAFTA_SOP_2015

Example datasets from the NAFTA SOP published 2015