Learn R Programming

ape (version 5.5)

getAnnotationsGenBank: Read Annotations from GenBank

Description

This function connects to the GenBank database and reads sequence annotations using accession number(s) given as argument.

Usage

getAnnotationsGenBank(access.nb, quiet = TRUE)

Arguments

access.nb

a vector of mode character giving the accession numbers.

quiet

a logical value indicating whether to show the progress of the downloads.

Value

On of the followings: (i) a data frame if access.nb contains a single accession number; (ii) a list of data frames if access.nb contains several accession numbers, the names are set with access.nb (if some accession numbers are not found on GenBank, the corresponding entries are set to NULL); (iii) NULL if all accession numbers are not found on GenBank.

Details

The sequence annotations (a.k.a. feature list) are returned in a data frame with five or six columns: start, end, type, product, others, and gene (the last being optional). This is the same information that can be downloaded from NCBI's Web interface by clicking on `Send to:', `File', and then selecting `Feature Table' under `Format'.

A warning is given if some features are incomplete (this information is then dropped from the returned object).

A warning is given if some accession numbers are not found on GenBank.

References

https://www.ncbi.nlm.nih.gov/Sequin/table.html

See Also

read.GenBank, read.gff, DNAbin

Examples

Run this code
# NOT RUN {
## The 8 sequences of tanagers (Ramphocelus):
ref <- c("U15717", "U15718", "U15719", "U15720",
         "U15721", "U15722", "U15723", "U15724")
## Copy/paste or type the following commands if you
## want to try them.
# }
# NOT RUN {
annot.rampho <- getAnnotationsGenBank(ref)
annot.rampho
## check all annotations are the same:
unique(do.call(rbind, annot.rampho)[, -5])
# }

Run the code above in your browser using DataLab