Tandem Repeats Provide Evidence for Convergent Evolution to Similar Protein Structures
- PMID:39852593
- PMCID: PMC11812678
- DOI: 10.1093/gbe/evaf013
Tandem Repeats Provide Evidence for Convergent Evolution to Similar Protein Structures
Abstract
Homology is a key concept underpinning the comparison of sequences across organisms. Sequence-level homology is based on a statistical framework optimized over decades of work. Recently, computational protein structure prediction has enabled large-scale homology inference beyond the limits of accurate sequence alignment. In this regime, it is possible to observe nearly identical protein structures lacking detectable sequence similarity. In the absence of a robust statistical framework for structure comparison, it is largely assumed similar structures are homologous. However, it is conceivable that matching structures could arise through convergent evolution, resulting in analogous proteins without shared ancestry. Large databases of predicted structures offer a means of determining whether analogs are present among structure matches. Here, I find that a small subset (∼2.6%) of Foldseek clusters lack sequence-level support for homology, including ∼1% of strong structure matches with template modeling score ≥ 0.5. This result by itself does not imply these structure pairs are nonhomologous, since their sequences could have diverged beyond the limits of recognition. Yet, strong matches without sequence-level support for homology are enriched in structures with predicted repeats that could induce spurious matches. Some of these structural repeats are underpinned by sequence-level tandem repeats in both matching structures. I show that many of these tandem repeat units have genealogies inconsistent with their corresponding structures sharing a common ancestor, implying these highly similar structure pairs are analogous rather than homologous. This result suggests caution is warranted when inferring homology from structural resemblance alone in the absence of sequence-level support for homology.
Keywords: TM-score; analogy; homology; protein structure search.
© The Author(s) 2025. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution.
Figures




Similar articles
- Expert Witness.Ronquillo Y, Robinson KJ, Kopitnik NL, Nouhan PP.Ronquillo Y, et al.2024 Dec 7. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–.2024 Dec 7. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–.PMID:28613772Free Books & Documents.
- Peer Play.Scott HK, Cogburn M.Scott HK, et al.2023 Jul 4. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–.2023 Jul 4. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–.PMID:30020595Free Books & Documents.
- Gadolinium Magnetic Resonance Imaging.Ibrahim MA, Hazhirkarzar B, Dublin AB.Ibrahim MA, et al.2023 Jul 3. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–.2023 Jul 3. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–.PMID:29494094Free Books & Documents.
- Exploring conceptual and theoretical frameworks for nurse practitioner education: a scoping review protocol.Wilson R, Godfrey CM, Sears K, Medves J, Ross-White A, Lambert N.Wilson R, et al.JBI Database System Rev Implement Rep. 2015 Oct;13(10):146-55. doi: 10.11124/jbisrir-2015-2150.JBI Database System Rev Implement Rep. 2015.PMID:26571290
- Depressing time: Waiting, melancholia, and the psychoanalytic practice of care.Salisbury L, Baraitser L.Salisbury L, et al.In: Kirtsoglou E, Simpson B, editors. The Time of Anthropology: Studies of Contemporary Chronopolitics. Abingdon: Routledge; 2020. Chapter 5.In: Kirtsoglou E, Simpson B, editors. The Time of Anthropology: Studies of Contemporary Chronopolitics. Abingdon: Routledge; 2020. Chapter 5.PMID:36137063Free Books & Documents.Review.
References
- Clementel D, Arrias PN, Mozaffari S, Osmanli Z, Castro XA; Repeats DBc curators, Ferrari C, Kajava AV, Tosatto SCE, Monzon AM. 2024. RepeatsDB in 2025: expanding annotations of structured tandem repeats proteins on AlphaFoldDB. Nucleic Acids Res. 53(D1):D575–D581. 10.1093/nar/gkae965. - DOI - PMC - PubMed
MeSH terms
Substances
Related information
Grants and funding
LinkOut - more resources
Full Text Sources