Converts data from a vcf file to onemap initial object, while identify the appropriate marker segregation patterns.
onemap_read_vcfR(
vcf = NULL,
vcfR.object = NULL,
cross = c("outcross", "f2 intercross", "f2 backcross", "ri self", "ri sib"),
parent1 = NULL,
parent2 = NULL,
f1 = NULL,
only_biallelic = TRUE,
output_info_rds = NULL,
verbose = TRUE
)
An object of class onemap
, i.e., a list with the following
components:
a matrix with integers indicating the genotypes read for each marker. Each column contains data for a marker and each row represents an individual.
number of individuals.
number of markers.
a vector with the
segregation type of each marker, as strings
.
a
vector with the segregation type of each marker, represented in a
simplified manner as integers, i.e. 1 corresponds to markers of type
"A"
; 2 corresponds to markers of type "B1.5"
; 3 corresponds
to markers of type "B2.6"
; 4 corresponds to markers of type
"B3.7"
; 5 corresponds to markers of type "C.8"
; 6 corresponds
to markers of type "D1"
and 7 corresponds to markers of type
"D2"
. Markers for F2 intercrosses are coded as 1; all other crosses
are left as NA
.
the name of the input file.
number of phenotypes.
a matrix with phenotypic values. Each column contains data for a trait and each row represents an individual.
matrix containing HMM emission probabilities
string defining the path to VCF file;
object of class vcfR;
type of cross. Must be one of: "outcross"
for full-sibs;
"f2 intercross"
for an F2 intercross progeny; "f2 backcross"
;
"ri self"
for recombinant inbred lines by self-mating; or
"ri sib"
for recombinant inbred lines by sib-mating.
string
specifying sample ID of the first parent. If f2 backcross population, define here the ID of the backcrossed parent.
string
specifying sample ID of the second parent.
string
if you are working with f2 intercross or backcross populations you may have f1 parents in you vcf, specify its ID here
if TRUE (default) only biallelic markers are considered, if FALSE multiallelic markers are included.
define a name for the file with alleles information.
A logical, if TRUE it output progress status information.
Cristiane Taniguti, chtaniguti@tamu.edu
Only biallelic SNPs and indels for diploid variant sites are considered.
Genotype information on the parents is required for all cross types. For full-sib progenies, both outbred parents must be genotyped. For backcrosses, F2 intercrosses and recombinant inbred lines, the original inbred lines must be genotyped. Particularly for backcross progenies, the recurrent line must be provided as the first parent in the function arguments.
Marker type is determined based on parental genotypes. Variants for which parent genotypes cannot be determined are discarded.
Reference sequence ID and position for each variant site are also stored.
read_onemap
for a description of the output object of class onemap.
# \donttest{
data <- onemap_read_vcfR(vcf=system.file("extdata/vcf_example_out.vcf.gz", package = "onemap"),
cross="outcross",
parent1=c("P1"),
parent2=c("P2"))
# }
Run the code above in your browser using DataLab