To assess the level of sequence conservation at each position in an
alignment, the “similarity”, “identity”, and
“entropy” per position can be calculated.
The “similarity” is defined as the average of the similarity
scores of all pairwise residue comparisons for that position in the
alignment, where the similarity score between any two residues is the
score value between those residues in the chosen substitution matrix
“sub.matrix”.
The “identity” i.e. the preference for a specific amino acid to
be found at a certain position, is assessed by averaging the identity
scores resulting from all possible pairwise comparisons at that position
in the alignment, where all identical residue comparisons are given a
score of 1 and all other comparisons are given a value of 0.
“Entropy” is based on Shannons information entropy. See the
entropy
function for further details.
Note that the returned scores are normalized so that conserved columns
score 1 and diverse columns score 0.