Selecting the appropriate value for alleleMismatch
, cutHeight
, or
matchThreshold
is an important task. Use this function to assist in this
process. Typically the optimal value of any of these parameters is found where the
number of multiple matches is minimized (the majority of samples are similar to only
one unique genotype). Usually there is a minimum when these parameters are set to be
very sensitive to differences among samples (i.e., alleleMismatch
or
cutHeight
are 0, matchThreshold
is 1). Simulations suggest that the next
most sensitive minimum in multiple matches is the optimal value. This minimum will
often be associated with a drop in multiple matches as sensitivity drops. For more
discussion of this important step, see the Data S1 Supplementary documentation and
tutorials (PDF) located at <doi:10.1111/j.1755-0998.2012.03137.x>.
Using guessOptimum = TRUE
will attempt to estimate the location of this minimum
and add it to the profile plot. Manual assessment of this estimate using the plot is
strongly recommended.
If none of alleleMismatch
, cutHeight
, or matchThreshold
is given,
the function runs a sequence of values for alleleMismatch
as follows:
seq(from = 0, to = floor(ncol(amDatasetFocal$multilocus) * 0.4), by = 1)
multilocusMap
is often not required, as amDataset
objects will typically
consist of paired columns of genotypes, where each pair is a separate locus. In cases
where this is not the case (e.g., gender is given in only one column), a map vector
must be specified.
Example: amDataset
consists of gender followed by 4 diploid loci in paired
columns
multilocusMap = c(1, 2, 2, 3, 3, 4, 4, 5, 5)
or equally
multilocusMap=c("GENDER", "LOC1", "LOC1", "LOC2", "LOC2", "LOC3", "LOC4",
"LOC4")
For more information on selecting consensusMethod
see amCluster
.
The default consensusMethod = 1
is typically adequate.