Minimum message length inference of secondary structure from protein coordinate data
- PMID:22689785
- PMCID: PMC3371855
- DOI: 10.1093/bioinformatics/bts223
Minimum message length inference of secondary structure from protein coordinate data
Abstract
Motivation: Secondary structure underpins the folding pattern and architecture of most proteins. Accurate assignment of the secondary structure elements is therefore an important problem. Although many approximate solutions of the secondary structure assignment problem exist, the statement of the problem has resisted a consistent and mathematically rigorous definition. A variety of comparative studies have highlighted major disagreements in the way the available methods define and assign secondary structure to coordinate data.
Results: We report a new method to infer secondary structure based on the Bayesian method of minimum message length inference. It treats assignments of secondary structure as hypotheses that explain the given coordinate data. The method seeks to maximize the joint probability of a hypothesis and the data. There is a natural null hypothesis and any assignment that cannot better it is unacceptable. We developed a program SST based on this approach and compared it with popular programs, such as DSSP and STRIDE among others. Our evaluation suggests that SST gives reliable assignments even on low-resolution structures.
Availability: http://www.csse.monash.edu.au/~karun/sst.
Figures



References
- Andersen C.A., Rost B. Secondary structure assignment. In: Gu J., Bourne P.E., editors. Structural Bioinformatics. Wiley-Blackwell; 2009. pp. 459–484.
- Bayes T., Price R. An essay towards solving a problem in the doctrine of chance. Philos. Trans. Roy. Soc. Lond. 1763;53:370–418.
- Colloc'h N., et al. Comparison of three algorithms for the assignment of secondary structure in proteins. Protein Eng. 1993;6:377–382. - PubMed
- Conway J.H., Sloane N.J.A. On the Voronoi regions of certain lattices. SIAM Journal on Algebraic and Discrete Methods. 1984;5:294–305.
- Cuff J.A., Barton G.J. Evaluation and improvement of multiple sequence methods for protein secondary structure prediction. Proteins. 1999;34:508–519. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
