|
The GeneMark-P and GeneMark-E programs determine the protein-coding potential
of a DNA sequence (within a sliding window) by using species specific parameters
of the Markov models of coding and non-coding regions. This approach allows
deliniating local variations of coding potential, therefore, the GeneMark graph
shows details of the protein-coding potential distribution along a sequence.
GeneMark is documented as the most accurate prokaryotic gene finder.
GeneMark.hmm-P and GeneMark.hmm-E programs are predicting genes and intergenic
regions in a sequence as a whole. They use the Hidden Markov models reflecting
the "grammar" of gene organization. The GeneMark.hmm (P and E) programs identify
the maximum likely parse of the whole DNA sequence into protein coding genes
(with possible introns) and intergenic regions.
For more information see
Background
and
References.
* "P" stands for "Prokaryotic"
"E" stands for "Eukaryotic"
"S" stands for "Self-training"
|