Functional associations of proteins in entire genomes by means of exhaustive detection of gene fusions
- PMID:11820254
- PMCID: PMC65099
- DOI: 10.1186/gb-2001-2-9-research0034
Functional associations of proteins in entire genomes by means of exhaustive detection of gene fusions
Abstract
Background: It has recently been shown that the detection of gene fusion events across genomes can be used for predicting functional associations of proteins, including physical interaction or complex formation. To obtain such predictions we have made an exhaustive search for gene fusion events within 24 available completely sequenced genomes.
Results: Each genome was used as a query against the remaining 23 complete genomes to detect gene fusion events. Using an improved, fully automatic protocol, a total of 7,224 single-domain proteins that are components of gene fusions in other genomes were detected, many of which were identified for the first time. The total number of predicted pairwise functional associations is 39,730 for all genomes. Component pairs were identified by virtue of their similarity to 2,365 multidomain composite proteins. We also show for the first time that gene fusion is a complex evolutionary process with a number of contributory factors, including paralogy, genome size and phylogenetic distance. On average, 9% of genes in a given genome appear to code for single-domain, component proteins predicted to be functionally associated. These proteins are detected by an additional 4% of genes that code for fused, composite proteins.
Conclusions: These results provide an exhaustive set of functionally associated genes and also delineate the power of fusion analysis for the prediction of protein interactions.
Figures





Comment in
- Rosetta Stone proteins: "chance and necessity"?Veitia RA.Veitia RA.Genome Biol. 2002;3(2):INTERACTIONS1001. doi: 10.1186/gb-2002-3-2-interactions1001. Epub 2002 Jan 8.Genome Biol. 2002.PMID:11864366Free PMC article.Review.
Similar articles
- The comparative genomics of protein interactions.Peregrín-Alvarez JM, Ouzounis CA.Peregrín-Alvarez JM, et al.Genome Inform. 2007;19:131-41.Genome Inform. 2007.PMID:18546511
- Evolution of gene fusions: horizontal transfer versus independent events.Yanai I, Wolf YI, Koonin EV.Yanai I, et al.Genome Biol. 2002;3(5):research0024. doi: 10.1186/gb-2002-3-5-research0024. Epub 2002 Apr 26.Genome Biol. 2002.PMID:12049665Free PMC article.
- Fusion and fission of genes define a metric between fungal genomes.Durrens P, Nikolski M, Sherman D.Durrens P, et al.PLoS Comput Biol. 2008 Oct;4(10):e1000200. doi: 10.1371/journal.pcbi.1000200. Epub 2008 Oct 24.PLoS Comput Biol. 2008.PMID:18949021Free PMC article.
- Genome and protein evolution in eukaryotes.Copley RR, Letunic I, Bork P.Copley RR, et al.Curr Opin Chem Biol. 2002 Feb;6(1):39-45. doi: 10.1016/s1367-5931(01)00278-2.Curr Opin Chem Biol. 2002.PMID:11827821Review.
- Experimental evidence validating the computational inference of functional associations from gene fusion events: a critical survey.Promponas VJ, Ouzounis CA, Iliopoulos I.Promponas VJ, et al.Brief Bioinform. 2014 May;15(3):443-54. doi: 10.1093/bib/bbs072. Epub 2012 Dec 5.Brief Bioinform. 2014.PMID:23220349Free PMC article.Review.
Cited by
- Popular computational methods to assess multiprotein complexes derived from label-free affinity purification and mass spectrometry (AP-MS) experiments.Armean IM, Lilley KS, Trotter MW.Armean IM, et al.Mol Cell Proteomics. 2013 Jan;12(1):1-13. doi: 10.1074/mcp.R112.019554. Epub 2012 Oct 15.Mol Cell Proteomics. 2013.PMID:23071097Free PMC article.Review.
- The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest.Szklarczyk D, Kirsch R, Koutrouli M, Nastou K, Mehryary F, Hachilif R, Gable AL, Fang T, Doncheva NT, Pyysalo S, Bork P, Jensen LJ, von Mering C.Szklarczyk D, et al.Nucleic Acids Res. 2023 Jan 6;51(D1):D638-D646. doi: 10.1093/nar/gkac1000.Nucleic Acids Res. 2023.PMID:36370105Free PMC article.
- Evolutionary history and functional implications of protein domains and their combinations in eukaryotes.Itoh M, Nacher JC, Kuma K, Goto S, Kanehisa M.Itoh M, et al.Genome Biol. 2007;8(6):R121. doi: 10.1186/gb-2007-8-6-r121.Genome Biol. 2007.PMID:17588271Free PMC article.
- Installation of LYRM proteins in early eukaryotes to regulate the metabolic capacity of the emerging mitochondrion.Dohnálek V, Doležal P.Dohnálek V, et al.Open Biol. 2024 May;14(5):240021. doi: 10.1098/rsob.240021. Epub 2024 May 22.Open Biol. 2024.PMID:38772414Free PMC article.
- On the detection of functionally coherent groups of protein domains with an extension to protein annotation.McLaughlin WA, Chen K, Hou T, Wang W.McLaughlin WA, et al.BMC Bioinformatics. 2007 Oct 16;8:390. doi: 10.1186/1471-2105-8-390.BMC Bioinformatics. 2007.PMID:17937820Free PMC article.
References
- Enright AJ, Iliopoulos I, Kyrpides NC, Ouzounis CA. Protein interaction maps for complete genomes based on gene fusion events. Nature. 1999;402:86–90. - PubMed
- Marcotte EM, Pellegrini M, Thompson MJ, Yeates TO, Eisenberg D. A combined algorithm for genome-wide prediction of protein function. Nature. 1999;402:83–86. - PubMed
- Marcotte EM, Pellegrini M, Ng H-L, Rice DW, Yeates TO, Eisenberg D. Detecting protein function and protein-protein interactions from genome sequences. Science. 1999;285:751–753. - PubMed
- Sali A. Functional links between proteins. Nature. 1999;402:23–26. - PubMed
- Doolittle RF. Do you dig my groove? Nat Genet. 1999;23:6–8. - PubMed
Publication types
MeSH terms
Substances
Related information
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases