plot.hmmer: Plot a Summary of HMMER Hit Statistics.

Description

Produces a number of basic plots that should facilitate hit selection from the match statistics of a HMMER result.

Usage

"plot"(x, cutoff = NULL, cut.seed=NULL, cluster=TRUE, mar=c(2, 5, 1, 1), cex=1.1, ...)

Arguments

HMMER results as obtained from the function hmmer.

cutoff

A numeric cutoff value, in terms of minus the log of the evalue, for returned hits. If null then the function will try to find a suitable cutoff near ‘cut.seed’ which can be used as an initial guide (see below).

cut.seed

A numeric seed cutoff value, used for initial cutoff estimation. If null then a seed position is set to the point of largest drop-off in normalized scores (i.e. the biggest jump in E-values).

cluster

Logical, if TRUE (and ‘cutoff’ is null) a clustering of normalized scores is performed to partition hits in groups by similarity to query. If FALSE the partition point is set to the point of largest drop-off in normalized scores.

mar

A numerical vector of the form c(bottom, left, top, right) which gives the number of lines of margin to be specified on the four sides of the plot.

cex

a numerical single element vector giving the amount by which plot labels should be magnified relative to the default.

...

extra plotting arguments.

Value

hits: an ordered matrix detailing the subset of hits with a normalized score above the chosen cutoff. Database identifiers are listed along with their cluster group number.
acc: a character vector containing the database accession identifier of each hit above the chosen threshold.
inds: a numeric vector containing the indices of the hits relative to the input hmmer object.

Details

Examining plots of HMMER scores, E-values and normalized scores (-log(E-Value), see ‘hmmer’ function) can aid in the identification sensible hit similarity thresholds.

If a ‘cutoff’ value is not supplied then a basic hierarchical clustering of normalized scores is performed with initial group partitioning implemented at a hopefully sensible point in the vicinity of ‘h=cut.seed’. Inspection of the resultant plot can then be use to refine the value of ‘cut.seed’ or indeed ‘cutoff’. As the ‘cutoff’ value can vary depending on the desired application and indeed the properties of the system under study it is envisaged that ‘plot.hmmer’ will be called multiple times to aid selection of a suitable ‘cutoff’ value. See the examples below for further details.

References

Grant, B.J. et al. (2006) Bioinformatics 22, 2695--2696.

Examples

Run this code

## Not run: 
# # HMMER server connection required - testing excluded
# 
# ##- PHMMER
# seq <- get.seq("2abl_A", outfile = tempfile())
# res <- hmmer(seq, db="pdb")
# plot.hmmer(res)
# 
# ## End(Not run)

Run the code above in your browser using DataLab