Learn R Programming

bio3d (version 2.1-2)

conserv: Score Residue Conservation At Each Position in an Alignment

Description

Quantifies residue conservation in a given protein sequence alignment by calculating the degree of amino acid variability in each column of the alignment.

Usage

conserv(x, method = c("similarity","identity","entropy22","entropy10"),
        sub.matrix = c("bio3d", "blosum62", "pam30", "other"),
        matrix.file = NULL, normalize.matrix = TRUE)

Arguments

x
an alignment list object with id and ali components, similar to that generated by read.fasta.
method
the conservation assesment method.
sub.matrix
a matrix to score conservation.
matrix.file
a file name of an arbitary user matrix.
normalize.matrix
logical, if TRUE the matrix is normalized pior to assesing conservation.

Value

  • Returns a numeric vector of scores

Details

To assess the level of sequence conservation at each position in an alignment, the similarity, identity, and entropy per position can be calculated.

The similarity is defined as the average of the similarity scores of all pairwise residue comparisons for that position in the alignment, where the similarity score between any two residues is the score value between those residues in the chosen substitution matrix sub.matrix.

The identity i.e. the preference for a specific amino acid to be found at a certain position, is assessed by averaging the identity scores resulting from all possible pairwise comparisons at that position in the alignment, where all identical residue comparisons are given a score of 1 and all other comparisons are given a value of 0.

Entropy is based on Shannons information entropy. See the entropy function for further details.

Note that the returned scores are normalized so that conserved columns score 1 and diverse columns score 0.

References

Grant, B.J. et al. (2006) Bioinformatics 22, 2695--2696. Grant, B.J. et al. (2007) J. Mol. Biol. 368, 1231--1248.

See Also

read.fasta, read.fasta.pdb

Examples

Run this code
## Read an example alignment
aln <- read.fasta(system.file("examples/hivp_xray.fa",package="bio3d"))

## Score conservation
conserv(x=aln$ali, method="similarity", sub.matrix="bio3d")
##conserv(x=aln$ali,method="entropy22", sub.matrix="other")

Run the code above in your browser using DataLab