tensorzero/llmgymPublic

NotificationsYou must be signed in to change notification settings
Fork2
Star27

License

Apache-2.0 license

27 stars 2 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
.github/workflows		.github/workflows
docs		docs
examples		examples
llmgym		llmgym
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Repository files navigation

Important

This repository is still under active development. Expect breaking changes.

LLM Gym

LLM Gym is a unified environment interface for developing and benchmarking LLM applications that learn from feedback. Thinkgym for LLM agents.

As the space of benchmarks rapidly grows, fair and comprehensive comparisons are getting trickier, so we aim to make that easier for you. The vision is an intuitive interface for a suite of environments you can seamlessly swap out for research and development purposes.

LLM Gym includes the following environments:

BabyAI - Text-based versions ofBabyAI grid world environments for instruction following
Multi-Hop - Multi-hop question answering with iterative search and note-taking
NER - Named Entity Recognition tasks
Tau Bench -Customer service environments for airline and retail domains
Terminal Bench - Docker-basedterminal environments for solving programming and system administration tasks
Twenty-One Questions - The classic guessing game where agents ask yes/no questions to identify a secret

Quickstart

importllmgymfromllmgym.logsimportget_loggerfromllmgym.agentsimportOpenAIAgentenv=llmgym.make("21_questions_v0")agent=llmgym.agents.OpenAIAgent(model_name="gpt-4o-mini",function_configs=env.functions,tool_configs=env.tools,)# Get default horizonmax_steps=env.horizon# Reset the environmentreset_data=awaitenv.reset()obs=reset_data.observation# Run the episodefor_stepinrange(max_steps):# Get action from agentaction=awaitagent.act(obs)# Step the environmentstep_data=awaitenv.step(action)obs=step_data.observation# Check if the episode is donedone=step_data.terminatedorstep_data.truncatedifdone:breakenv.close()

This can also be run in theQuickstart Notebook.

Installation

Follow these steps to set up the development environment for LLM Gym using uv for virtual environment management and Hatch (with Hatchling) for building and packaging.

Prerequisites

Python 3.12 (or a compatible version, e.g., >=3.12, <4.0)
uv – an extremely fast Python package manager and virtual environment tool

Steps

1. Clone the Repository

Clone the repository to your local machine:

git clone git@github.com:tensorzero/gym-scratchpad.gitcd llmgym

2. Create and Activate a Virtual Environment

Use uv to create a virtual environment. This command will create a new environment (by default in the .venv directory) using Python 3.12:

uv venv --python 3.12

Activate the virtual environment:

source .venv/bin/activate

3. Install Project Dependencies

Install the project in editable mode along with its development dependencies:

uv pip install -e.

4. Verify the Installation

To ensure everything is set up correctly, you can run the tests or simply import the package in Python.

Run tests:

uv run pytest

Import the package in Python:

python>>> import llmgym>>> llmgym.__version__'0.0.0'

Setting Environment Variables

To set theOPENAI_API_KEY environment variable, run the following command:

export OPENAI_API_KEY="your_openai_api_key"

We recommend usingdirenv and creating a local.envrc file to manage environment variables. For example, the.envrc file might look like this:

export OPENAI_API_KEY="your_openai_api_key"

and then rundirenv allow to load the environment variables.

Tutorial

For a full tutorial, see theTutorial Notebook.

To see how to run multiple episodes concurrently, see theTau Bench or21 Questions notebooks.

For a supervised finetuning example, see theSupervised Finetuning Notebook.

About

No description, website, or topics provided.

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

LLM Gym

LLM Gym includes the following environments:

Quickstart

Installation

Prerequisites

Steps

1. Clone the Repository

2. Create and Activate a Virtual Environment

3. Install Project Dependencies

4. Verify the Installation

Setting Environment Variables

Tutorial

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors5

Uh oh!

Languages

Movatterモバイル変換

License

tensorzero/llmgym

Folders and files

Latest commit

History

Repository files navigation

LLM Gym

LLM Gym includes the following environments:

Quickstart

Installation

Prerequisites

Steps

1. Clone the Repository

2. Create and Activate a Virtual Environment

3. Install Project Dependencies

4. Verify the Installation

Setting Environment Variables

Tutorial

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors5

Uh oh!

Languages

Packages