Computer Science > Machine Learning
arXiv:2406.09346 (cs)
[Submitted on 13 Jun 2024 (v1), last revised 25 Jun 2024 (this version, v2)]
Title:Scoreformer: A Surrogate Model For Large-Scale Prediction of Docking Scores
Authors:Álvaro Ciudad,Adrián Morales-Pastor,Laura Malo,Isaac Filella-Mercè,Victor Guallar,Alexis Molina
View a PDF of the paper titled Scoreformer: A Surrogate Model For Large-Scale Prediction of Docking Scores, by \'Alvaro Ciudad and 5 other authors
View PDFHTML (experimental)Abstract:In this study, we present ScoreFormer, a novel graph transformer model designed to accurately predict molecular docking scores, thereby optimizing high-throughput virtual screening (HTVS) in drug discovery. The architecture integrates Principal Neighborhood Aggregation (PNA) and Learnable Random Walk Positional Encodings (LRWPE), enhancing the model's ability to understand complex molecular structures and their relationship with their respective docking scores. This approach significantly surpasses traditional HTVS methods and recent Graph Neural Network (GNN) models in both recovery and efficiency due to a wider coverage of the chemical space and enhanced performance. Our results demonstrate that ScoreFormer achieves competitive performance in docking score prediction and offers a substantial 1.65-fold reduction in inference time compared to existing models. We evaluated ScoreFormer across multiple datasets under various conditions, confirming its robustness and reliability in identifying potential drug candidates rapidly.
Comments: | Accepted at the 1st Machine Learning for Life and Material Sciences Workshop at ICML 2024 |
Subjects: | Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Biomolecules (q-bio.BM) |
Cite as: | arXiv:2406.09346 [cs.LG] |
(orarXiv:2406.09346v2 [cs.LG] for this version) | |
https://doi.org/10.48550/arXiv.2406.09346 arXiv-issued DOI via DataCite |
Submission history
From: Alexis Molina [view email][v1] Thu, 13 Jun 2024 17:31:02 UTC (481 KB)
[v2] Tue, 25 Jun 2024 13:25:08 UTC (498 KB)
Full-text links:
Access Paper:
- View PDF
- HTML (experimental)
- TeX Source
- Other Formats
View a PDF of the paper titled Scoreformer: A Surrogate Model For Large-Scale Prediction of Docking Scores, by \'Alvaro Ciudad and 5 other authors
Current browse context:
cs.LG
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer(What is the Explorer?)
Connected Papers(What is Connected Papers?)
Litmaps(What is Litmaps?)
scite Smart Citations(What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv(What is alphaXiv?)
CatalyzeX Code Finder for Papers(What is CatalyzeX?)
DagsHub(What is DagsHub?)
Gotit.pub(What is GotitPub?)
Hugging Face(What is Huggingface?)
Papers with Code(What is Papers with Code?)
ScienceCast(What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower(What are Influence Flowers?)
CORE Recommender(What is CORE?)
IArxiv Recommender(What is IArxiv?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community?Learn more about arXivLabs.