Get amino acid and chemical compositions of proteins.
protcomp(uniprot = NULL, basis = "rQEC",
aa = NULL, aa_file = NULL)
character, UniProt IDs of proteins
character, keyword for basis species to use
data frame, amino acid compositions
character, file name
The function returns a list with elements uniprot
(UniProt IDs as given in the arguments), protein.formula
(elemental compositions of the proteins), ZC
(average oxidation state of carbon), protein.basis
(compositions of the proteins in terms of the basis species), protein.length
(lengths of the amino acid sequences), residue.basis
(per-residue compositions of the proteins in terms of the basis species), residue.formula
(per-residue elemental compositions of the proteins), and aa
(amino acid compositions of the proteins).
This function retrieves the amino acid compositions of one or more proteins specified by uniprot
or ip
, then calculates some chemical compositional properties using functions provided by CHNOSZ.
The basis
argument is used to select the basis species using a keyword (see basis
).
The default is rQEC where the number of is residual-corrected stoichiometric water content of amino acids (see metrics
).
Other options are CHNOS for , , , , and , or QEC for glutamine, glutamic acid, cysteine, , and .
This function depends on the amino acid compositions of human proteins, which are stored in the human
environment when the package is attached.
If aa_file
is specified, additional amino acid compositions to be considered are read from this file, which should be in the same format as e.g. human_extra.csv
(see also thermo$protein
).
Alternatively, the amino acid compositions can be given in aa
, bypassing the search step.
# NOT RUN {
library(CHNOSZ)
protcomp("P24298")
# }
Run the code above in your browser using DataLab