Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks

NotificationsYou must be signed in to change notification settings

ChemFoundationModels/ChemLLMBench

Repository files navigation

The official repository of"What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks".https://arxiv.org/abs/2305.18365

frame

🆕 News

  • [Dec 2023] We updated all test dataset we used and we also add prompts we used (in log format) in the folder of each task. Please see data/ folder for more details!
  • [Sep 2023] Our paper has been accepted toNeurIPS 2023 Datasets and Benchmarks Track!
  • [Sep 2023] We released the second version (v2) of our paper, we added extra LLMs(GPT-4, GPT-3.5, Davinci-003, LLama2, Galactica) experiments; more baselines, and more investigations onSELFIES, and label interpretation!
  • [May 2023] We released the first version (v1) of our paper! Very glad to share our investigations and insights about LLM in chemistry!

💡 Tasks Overview

Task_overview

📌 Prompt

The followings are our prompt used in the paper. It's extremely easy to try your own designed prompt! Only need to change the prompt in the Jupyter code of each task and then we can see the results and performance.

Zero-shot Prompt

zero_prompt

ICL Prompt

ICL

📊 Dataset

The datasets of some tasks are already uploaded in this repository.Becuase of the size limit, please download these datasets according to the link. After downloading these datasets, please move these datasets to the corresponding folder and then you can run our Jupyter code of each task.

DatasetLinkReference
USPTO_Mixeddownloadhttps://github.com/MolecularAI/Chemformer
USPTO-50kdownloadhttps://github.com/MolecularAI/Chemformer
ChEBI-20downloadhttps://github.com/blender-nlp/MolT5
Suzuki-miyauradownloadhttps://github.com/seokhokang/reaction_yield_nn
Butchward-Hariwigdownloadhttps://github.com/seokhokang/reaction_yield_nn
BBBP,BACE,HIV,Tox21,Clintoxdownloadhttps://github.com/hwwang55/MolR
PubChemdownloadhttps://github.com/ChemFoundationModels/ChemLLMBench/blob/main/data/name_prediction/llm_test.csv

🤗 Cite us

@misc{guo2023gpt,      title={What indeed can GPT models do in chemistry? A comprehensive benchmark on eight tasks},       author={Taicheng Guo and Kehan Guo and Bozhao Nan and Zhenwen Liang and Zhichun Guo and Nitesh V. Chawla and Olaf Wiest and Xiangliang Zhang},      year={2023},      eprint={2305.18365},      archivePrefix={arXiv},      primaryClass={cs.CL}}

🤗 Contact us

Taicheng Guo:tguo2@nd.edu

Xiangliang Zhang:xzhang33@nd.edu


[8]ページ先頭

©2009-2025 Movatter.jp