FunClust: a web server for the identification of structural motifs in a set of non-homologous protein structures
- PMID:18387204
- PMCID: PMC2323665
- DOI: 10.1186/1471-2105-9-S2-S2
FunClust: a web server for the identification of structural motifs in a set of non-homologous protein structures
Abstract
Background: The occurrence of very similar structural motifs brought about by different parts of non homologous proteins is often indicative of a common function. Indeed, relatively small local structures can mediate binding to a common partner, be it a protein, a nucleic acid, a cofactor or a substrate. While it is relatively easy to identify short amino acid or nucleotide sequence motifs in a given set of proteins or genes, and many methods do exist for this purpose, much more challenging is the identification of common local substructures, especially if they are formed by non consecutive residues in the sequence.
Results: Here we describe a publicly available tool, able to identify common structural motifs shared by different non homologous proteins in an unsupervised mode. The motifs can be as short as three residues and need not to be contiguous or even present in the same order in the sequence. Users can submit a set of protein structures deemed or not to share a common function (e.g. they bind similar ligands, or share a common epitope). The server finds and lists structural motifs composed of three or more spatially well conserved residues shared by at least three of the submitted structures. The method uses a local structural comparison algorithm to identify subsets of similar amino acids between each pair of input protein chains and a clustering procedure to group similarities shared among different structure pairs.
Conclusions: FunClust is fast, completely sequence independent, and does not need an a priori knowledge of the motif to be found. The output consists of a list of aligned structural matches displayed in both tabular and graphical form. We show here examples of its usefulness by searching for the largest common structural motifs in test sets of non homologous proteins and showing that the identified motifs correspond to a known common functional feature.
Figures

Similar articles
- Improved K-means clustering algorithm for exploring local protein sequence motifs representing common structural property.Zhong W, Altun G, Harrison R, Tai PC, Pan Y.Zhong W, et al.IEEE Trans Nanobioscience. 2005 Sep;4(3):255-65. doi: 10.1109/tnb.2005.853667.IEEE Trans Nanobioscience. 2005.PMID:16220690
- CompariMotif: quick and easy comparisons of sequence motifs.Edwards RJ, Davey NE, Shields DC.Edwards RJ, et al.Bioinformatics. 2008 May 15;24(10):1307-9. doi: 10.1093/bioinformatics/btn105. Epub 2008 Mar 28.Bioinformatics. 2008.PMID:18375965
- Motif extraction and protein classification.Kunik V, Solan Z, Edelman S, Ruppin E, Horn D.Kunik V, et al.Proc IEEE Comput Syst Bioinform Conf. 2005:80-5. doi: 10.1109/csb.2005.39.Proc IEEE Comput Syst Bioinform Conf. 2005.PMID:16447965
- WebAllergen: a web server for predicting allergenic proteins.Riaz T, Hor HL, Krishnan A, Tang F, Li KB.Riaz T, et al.Bioinformatics. 2005 May 15;21(10):2570-1. doi: 10.1093/bioinformatics/bti356. Epub 2005 Mar 3.Bioinformatics. 2005.PMID:15746289
- Identifying property based sequence motifs in protein families and superfamilies: application to DNase-1 related endonucleases.Mathura VS, Schein CH, Braun W.Mathura VS, et al.Bioinformatics. 2003 Jul 22;19(11):1381-90. doi: 10.1093/bioinformatics/btg164.Bioinformatics. 2003.PMID:12874050
Cited by
- Mining protein loops using a structural alphabet and statistical exceptionality.Regad L, Martin J, Nuel G, Camproux AC.Regad L, et al.BMC Bioinformatics. 2010 Feb 4;11:75. doi: 10.1186/1471-2105-11-75.BMC Bioinformatics. 2010.PMID:20132552Free PMC article.
- Structural motifs recurring in different folds recognize the same ligand fragments.Ausiello G, Gherardini PF, Gatti E, Incani O, Helmer-Citterich M.Ausiello G, et al.BMC Bioinformatics. 2009 Jun 15;10:182. doi: 10.1186/1471-2105-10-182.BMC Bioinformatics. 2009.PMID:19527512Free PMC article.
- fPOP: footprinting functional pockets of proteins by comparative spatial patterns.Tseng YY, Chen ZJ, Li WH.Tseng YY, et al.Nucleic Acids Res. 2010 Jan;38(Database issue):D288-95. doi: 10.1093/nar/gkp900. Epub 2009 Oct 30.Nucleic Acids Res. 2010.PMID:19880384Free PMC article.
- A simple extension to the CMASA method for the prediction of catalytic residues in the presence of single point mutations.Flores DI, Sotelo-Mundo RR, Brizuela CA.Flores DI, et al.PLoS One. 2014 Sep 30;9(9):e108513. doi: 10.1371/journal.pone.0108513. eCollection 2014.PLoS One. 2014.PMID:25268770Free PMC article.
- Detecting mutually exclusive interactions in protein-protein interaction maps.Sánchez Claros C, Tramontano A.Sánchez Claros C, et al.PLoS One. 2012;7(6):e38765. doi: 10.1371/journal.pone.0038765. Epub 2012 Jun 8.PLoS One. 2012.PMID:22715412Free PMC article.
References
Publication types
MeSH terms
Substances
Related information
Grants and funding
LinkOut - more resources
Full Text Sources