This function is used to calculate the maximum of \(\Delta G\) based on correlation matrix, which depends on locations, testers and replicates, with a grid search algorithm. The changing correlation matrix of three-stage selection are the testcross progenies of DH lines in one marker-assisted selection (MAS) stage and two phenotypic selection (PS) stages.
multistageoptimum.search (maseff=0.4, VGCAandE,
VSCA, CostProd, CostTest, Nf, Budget, N2grid,
N3grid, L2grid, L3grid, T2grid, T3grid, R2, R3, alg,
detail, fig,alpha.nursery,cost.nursery,
t2free,parallel.search)
If \(\texttt{detail}\) = FALSE, the output of this function is a vector of the optimum allocation i.e., which achieves the maximum \(\Delta G\). Otherwise, the result for all the grid points, which have been calculated, will be exported as a table in the Rgui.
is the efficiency of MAS.
is the vector of variance components of genetic effect, genotype \(\times\) location interaction, genotype \(\times\) year interaction, genotype \(\times\) location \(\times\) year interaction and the plot error. When VSCA
is specified, it refers to the general combining ability, otherwise it stands for genetic effect. The default value is 1,1,1,1,1. Variances types listed in Longin et al. (2007) can be used. E.g., VGCAandE="VC2"
will set the value as 1,0.5,0.5,1,2.
is the vector of variance components for specific combining ability.
contains the initial costs of producing or identifying a candidate in each stage.
contains a vector with length n reflecting the cost of evaluating a candidate in the tests performed at stage i, i=1,...,n. The cost might vary in different stages.
is the number of finally selected candidates.
contains the value of total budget.
is the vector of lower and upper limits as well as the grid width of number of candidates in the first field test stage.
is the vector of lower and upper limits as well as the grid width of number of candidates in the second field test stage.
is the vector of lower and upper limits of number of location as well as the width in the first field test stage.
is the vector of lower and upper limits of number of location as well as the width in the second field test stage.
is the vector of lower and upper limits of number of tester as well as the width in the first field test stage.
is the vector of lower and upper limits of number of tester as well as the width in the second field test stage.
is the number of replications in the first field test stage. By default it is 1.
is the number of replications in the second field test stage. By default it is 1.
is used to switch between two algorithms. If alg = GenzBretz()
, which is by default, the quasi-Monte Carlo algorithm from Genz et al. (2009, 2013), will be used. If alg = Miwa()
, the program will use the Miwa algorithm (Mi et al., 2009), which is an analytical solution of the MVN integral. Miwa's algorithm has higher accuracy (7 digits) than quasi-Monte Carlo algorithm (5 digits). However, its computational speed is slower. We recommend to use the Miwa algorithm.
is the control parameter to decide if the result of all the grids will be given (=TRUE
) or only the maximum (=FALSE
).
is the control parameter to decide if a contour plot will be saved in the default folder of R. The default value is FALSE
, which means no figure will be saved.
a value that should be 0<x<1, prelimitery test alpha fraction should be used for the stage 1. it is setted to 1 as default, when no prelimitery test "nursery stage".
a vector of length two c([cost of producing a DH line],[cost of testing a DH in nursery]). The default value is 0,0.
is a logical value. If =FALSE, the cost of using T3 and T2 testers will be accounted seperately. If =TRUE, the cost of using T3 and T2 testers will be accounted according to number of testers, i.e., CostProd=c(CostProd[1],CostProd[2]*T2,CostProd[3]*(T3-T2)
is a logical variable to desided if the multiple cores can be used for computing, by default is FALSE. The users have to notice that assign cores also cost time. So this procedure can only be efficient if the dim >5.
Xuefei Mi, Jose Marulanda
for the new added to parameters "alpha.nursery" and "cost.nursery" since v2.0.47:
After producing new DH lines, breeders do NOT go directly for a selection stage in the field, neither for genomic selection. Most of the times, they prefer to make a small field experiment (called "nursery") in which all DH lines are observed and discarded for other traits as disease resistance. That means, all DH lines with poor resistance will be discarded. At the end of the nursery stage only certain amount of DH lines (alpha) advance to the first selection stage (phenotypic or genomic). Specially in maize that makes sense, because in experience around 90 percent of the new DH lines are very weak in terms of per se performance what make them not suitable as new hybrid parents. Then, budget should not be used to make genotyping on or testcrossing with them. Only the alpha fraction should be used for entering the stage 1 of the multistageoptimum.search function.
More details are available in the Crop Science and Computational Statistics papers.
A. Genz and F. Bretz. Computation of Multivariate Normal and t Probabilities. Lecture Notes in Statistics, Vol. 195, Springer-Verlag, Heidelberg, 2009.
A. Genz, F. Bretz, T. Miwa, X. Mi, F. Leisch, F. Scheipl and T. Hothorn. mvtnorm: Multivariate normal and t distributions. R package version 0.9-9995, 2013.
E.L. Heffner, A.J. Lorenz, J.L. Jannink, and M.E. Sorrells. Plant breeding with genomic selection: gain per unit time and cost. Crop Sci. 50: 1681-1690, 2010.
X. Mi, T. Miwa and T. Hothorn. Implement of Miwa's analytical algorithm of multi-normal distribution. R Journal, 1:37-39, 2009.
selectiongain()
CostProd =c(0.5,1,1)
CostTest = c(0.5,1,1)
Budget=1021
# Budget is very small here to save time in package checking
# for the example in Heffner's paper, please change it to Budget=10021
VCGCAandError=c(0.4,0.2,0.2,0.4,2)
VCSCA=c(0.2,0.1,0.1,0.2)
Nf=10
multistageoptimum.search (maseff=0.4, VGCAandE=VCGCAandError,
VSCA=VCSCA, CostProd = c(0.5,1,1), CostTest = c(0.5,1,1),
Nf = 10, Budget = Budget, N2grid = c(11, 1211, 30),
N3grid = c(11, 211, 5), L2grid=c(1,3,1), L3grid=c(6,6,1),
#important note! by Xuefei Mi 2022-02-09
# in the paper L3grid=c(6,8,1) but please do not change it here, otherwise
# due to Budget =1021, the searching room will out of boudry
T2grid=c(1,2,1), T3grid=c(3,5,1), R2=1, R3=1, alg = Miwa(),
detail=TRUE, fig=TRUE, alpha.nursery=1)
Run the code above in your browser using DataLab