- Notifications
You must be signed in to change notification settings - Fork78
composable-models/llm_multiagent_debate
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
Yilun Du,Shuang Li,Antonio Torralba,Joshua B. Tenenbaum,Igor Mordatch
This is a preliminary implementation of the paper "Improving Factuality and Reasoning in Language Models through Multiagent Debate". More tasks and settings will be released soon.You may see some additional debate logshere.
Also, check out gauss5930's awesome implementation of multiagent debate on opensource LLMshere!
The code for running arithmetic, GSM, biographies, and MMLU tasks may be found in the following subfolders
- ./math/ contains code for running math
- ./gsm/ contains code for running gsm
- ./biography/ contains code for running biographies
- ./mmlu/ contains code for running mmlu results.
Math:
To generate and evaluated answer for Math problems through multiagent debate, cd into the math directory and run:python gen_math.py
Grade School Math:
To generate answers for Grade School Math problems through multiagent debate, cd into the gsm directory and run:python gen_gsm.py
To evaluate the generated results of Grade School Math problems:python eval_gsm.py
You can download the GSM datasethere
Biography:
To generate answers for Biography problems through multiagent debate, cd into the biography directory and run:python gen_conversation.py
To evaluate the generated results for Biography problems:python eval_conversation.py
MMLU:
To generate answers for MMLU through multiagent debate, cd into the MMLU directory and run:python gen_mmlu.py
To evaluate the generated results of MMLU:python eval_mmlu.py
You can download the MMLU datasethere
If you would like to cite the paper, here is a bibtex file:
@article{du2023improving, title={Improving Factuality and Reasoning in Language Models through Multiagent Debate}, author={Du, Yilun and Li, Shuang and Torralba, Antonio and Tenenbaum, Joshua B and Mordatch, Igor}, journal={arXiv preprint arXiv:2305.14325}, year={2023}}About
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
Resources
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
Packages0
Uh oh!
There was an error while loading.Please reload this page.