stepB: Step B: Search and Download Sequences
Description
For any given project (see megapteraProj
), this step searches the Nucleotide database on GenBank, downloads all sequences and stores them in a postgreSQL database table.
Value
None. stepB
is called for its side effects: (1) strings of DNA sequences with attribute data are stored in a pgSQL database; (2) a log file is written to the current working directory.
Details
All accessions are stored under thier species name as appearing in the organism field at GenBank, information about infrageneric ranks is stripped of the taxon names before they are stored in the database.