HMMER

From SNIC Documentation
Revision as of 14:27, 17 February 2011 by Joel Hedlund (NSC) (talk | contribs)
Jump to: navigation, search

HMMER is a software package for working with profile hidden Markov models (HMM) of known regions in proteins.

An HMM is a statistical model that describes the known sequence variations within a specific group of proteins that may be of special interest; for example a protein family with known function, or a domain containing a well studied interaction surface or an active site. HMM is a machine learning technique [1] where the models are built from training examples that are known good members, and the finished models can be used to reliably classify and annotate new or poorly understood protein sequences in an automated fashion. Large libraries of trusted HMMs (such as Pfam) are of course immensely beneficial, as they can be used to automatically classify large portions of newly sequenced genomes, directly as they become available.