ChemFoundationModels/ChemLLMBenchPublic

NotificationsYou must be signed in to change notification settings
Fork6
Star139

What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks

139 stars 6 forks Branches Tags Activity

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
data		data
figures		figures
Molecule_Design.ipynb		Molecule_Design.ipynb
Name_Prediction.ipynb		Name_Prediction.ipynb
Property_Prediction.ipynb		Property_Prediction.ipynb
README.md		README.md
Reaction_Prediction.ipynb		Reaction_Prediction.ipynb
draft_frame3.png		draft_frame3.png
icl.png		icl.png
task_overview.png		task_overview.png
zero_shot.png		zero_shot.png

Repository files navigation

[NeurIPS 2023 Datasets and Benchmarks Track] ChemLLMBench ⚛

The official repository of"What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks".https://arxiv.org/abs/2305.18365

🆕 News

[Dec 2023] We updated all test dataset we used and we also add prompts we used (in log format) in the folder of each task. Please see data/ folder for more details!
[Sep 2023] Our paper has been accepted toNeurIPS 2023 Datasets and Benchmarks Track!
[Sep 2023] We released the second version (v2) of our paper, we added extra LLMs(GPT-4, GPT-3.5, Davinci-003, LLama2, Galactica) experiments; more baselines, and more investigations onSELFIES, and label interpretation!
[May 2023] We released the first version (v1) of our paper! Very glad to share our investigations and insights about LLM in chemistry!

💡 Tasks Overview

📌 Prompt

The followings are our prompt used in the paper. It's extremely easy to try your own designed prompt! Only need to change the prompt in the Jupyter code of each task and then we can see the results and performance.

Zero-shot Prompt

ICL Prompt

📊 Dataset

The datasets of some tasks are already uploaded in this repository.Becuase of the size limit, please download these datasets according to the link. After downloading these datasets, please move these datasets to the corresponding folder and then you can run our Jupyter code of each task.

Dataset	Link	Reference
USPTO_Mixed	download	https://github.com/MolecularAI/Chemformer
USPTO-50k	download	https://github.com/MolecularAI/Chemformer
ChEBI-20	download	https://github.com/blender-nlp/MolT5
Suzuki-miyaura	download	https://github.com/seokhokang/reaction_yield_nn
Butchward-Hariwig	download	https://github.com/seokhokang/reaction_yield_nn
BBBP,BACE,HIV,Tox21,Clintox	download	https://github.com/hwwang55/MolR
PubChem	download	https://github.com/ChemFoundationModels/ChemLLMBench/blob/main/data/name_prediction/llm_test.csv

🤗 Cite us

@misc{guo2023gpt,      title={What indeed can GPT models do in chemistry? A comprehensive benchmark on eight tasks},       author={Taicheng Guo and Kehan Guo and Bozhao Nan and Zhenwen Liang and Zhichun Guo and Nitesh V. Chawla and Olaf Wiest and Xiangliang Zhang},      year={2023},      eprint={2305.18365},      archivePrefix={arXiv},      primaryClass={cs.CL}}

🤗 Contact us

Taicheng Guo:tguo2@nd.edu

Xiangliang Zhang:xzhang33@nd.edu

About

What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks

arxiv.org/abs/2305.18365

Languages

Jupyter Notebook100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Folders and files

Latest commit

History

Repository files navigation

[NeurIPS 2023 Datasets and Benchmarks Track] ChemLLMBench ⚛

🆕 News

💡 Tasks Overview

📌 Prompt

Zero-shot Prompt

ICL Prompt

📊 Dataset

🤗 Cite us

🤗 Contact us

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages

Contributors2

Languages

Movatterモバイル変換

ChemFoundationModels/ChemLLMBench

Folders and files

Latest commit

History

Repository files navigation

[NeurIPS 2023 Datasets and Benchmarks Track] ChemLLMBench ⚛

🆕 News

💡 Tasks Overview

📌 Prompt

Zero-shot Prompt

ICL Prompt

📊 Dataset

🤗 Cite us

🤗 Contact us

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages0

Contributors2

Languages

Packages