Usage
makeOrganismDbFromBiomart(biomart="ENSEMBL_MART_ENSEMBL", dataset="hsapiens_gene_ensembl", transcript_ids=NULL, circ_seqs=DEFAULT_CIRC_SEQS, filter="", id_prefix="ensembl_", host="www.ensembl.org", port=80, miRBaseBuild=NA, keytype = "ENSEMBL")
Arguments
biomart
which BioMart database to use.
Get the list of all available BioMart databases with the
listMarts
function from the biomaRt
package. See the details section below for a list of BioMart
databases with compatible transcript annotations. dataset
which dataset from BioMart. For example:
"hsapiens_gene_ensembl"
, "mmusculus_gene_ensembl"
,
"dmelanogaster_gene_ensembl"
, "celegans_gene_ensembl"
,
"scerevisiae_gene_ensembl"
, etc in the ensembl database.
See the examples section below for how to discover which datasets
are available in a given BioMart database.
transcript_ids
optionally, only retrieve transcript
annotation data for the specified set of transcript ids.
If this is used, then the meta information displayed for the
resulting TxDb object will say 'Full dataset: no'.
Otherwise it will say 'Full dataset: yes'. This TxDb object
will be embedded in the resulting OrganismDb object. circ_seqs
a character vector to list out which chromosomes
should be marked as circular.
filter
Additional filters to use in the BioMart query. Must be
a named list. An example is filter=as.list(c(source="entrez"))
host
The host URL of the BioMart. Defaults to www.ensembl.org.
port
The port to use in the HTTP communication with the host.
id_prefix
Specifies the prefix used in BioMart attributes. For
example, some BioMarts may have an attribute specified as
"ensembl_transcript_id"
whereas others have the same attribute
specified as "transcript_id"
. Defaults to "ensembl_"
.
miRBaseBuild
specify the string for the appropriate build
Information from mirbase.db to use for microRNAs. This can be
learned by calling supportedMiRBaseBuildValues
. By default,
this value will be set to NA
, which will inactivate the
microRNAs
accessor.
keytype
This indicates the kind of key that this database will
use as a foreign key between it's TxDb object and it's OrgDb
object. So basically whatever the column name is for the foreign key
from your OrgDb that your TxDb will need to map it's GENEID on to.
By default it is "ENSEMBL" since the GENEID's for most biomaRt based
TxDbs will be ensembl gene ids and therefore they will need to map
to ENSEMBL gene mappings from the associated OrgDb object.