- Notifications
You must be signed in to change notification settings - Fork1.3k
Releases: microsoft/agent-lightning
Agent Lightning v0.3.0
3b5d733Agent-lightning v0.3.0 is a major release that introduces several new features and bug fixes. The release is a collaborative effort between Agent-lightning core teams and the community. Thanks to all the contributors who made this release possible.
Highlights
- Tinker integration: Support Tinker as an alternative backend for Reinforcement Learning (#226#245#264#269#327). Seeexample code,blog 1 andblog 2.
- Azure OpenAI integration: Support Azure OpenAI as a backend for LLM inference and supervised fine-tuning (#256#327).Example code.
- MongoDB-based Lightning Store is added as an alternative backend for Lightning Store (#323).Documentation.
- Contrib package: Add contrib package for community projects. Search-R1 is integrated as a contrib recipe. More coming. (#239#396#410#412#417).
- RESTful API: Stabilize and document RESTful API for Lightning Store (#241#275).Documentation.
- OTel Semantic Conventions that are specifically designed for Agent-optimization areas (#340).Documentation.
- [Preview]Agent-lightning Dashboard is now available (#288#289#291#296#371#375). It's the official web application for inspecting and debugging Agent-lightning experiments. See detailshere.
- [Preview]Multi-modality example featuring VERL and a LangGraph agent on ChartQA dataset (#379).Example code.
- [Preview] IntegrateClaude Code as a LitAgent and support training on SWE-Bench (#332#346#348).Example code.
- [Preview]Weave tracer as a substitute for AgentOps tracer (#277#411#420#423).Documentation.
- [Preview]Trajectory Level Aggregation for more efficient training with VERL. Seeblog anddocumentation.

Store Benchmark
In this release, the Lightning Store core was redesigned for significantly greater efficiency and scalability (#315#318#328#342#344#356#380#388#418#421). The benchmark results below demonstrate the impact: with large numbers of concurrent runners, v0.3.0 delivers up to a 15x increase in throughput compared to v0.2.2.
| Throughput (#rollout/sec) | v0.2.2 | v0.3.0 (in-memory) | v0.3.0 (Mongo) |
|---|---|---|---|
| Minimal (batch, #runner=32, #turns=6) | 8.73 | 9.06 | 8.71 |
| Medium (batch, #runners=100, #turns=10) | 12.03 | 23.26 | 32.79 |
| Mid-high (batch, #runners=300, #turns=6) | 10.61 | 24.42 | 40.24 |
| Large (batch, #runners=1000, #turns=3) | 3.36 | 14.60 | 50.05 |
| Long queue (queue, #runners=256, #turns=4) | 7.42 | 30.86 | 57.01 |
| Heavy trace (queue, #runners=512, #turns=20) | 5.93 | 13.28 | 29.41 |
Notes:
- Benchmarks were run on a single Standard_D32as_v4 Azure VM (Large and heavy trace tests used Standard_D64ads_v5), executed via GitHub Actions.
- Two algorithm patterns are evaluated: the batch pattern submits a group of rollouts and waits for all to finish before starting the next group, while the queue pattern maintains a set number of in-flight rollouts, submitting new ones as soon as capacity frees up. Configuration details are availablehere.
- The number of turns is directly proportional to the number of spans each rollout generates.
Maintenance and Bug fixes
Core (Store, Interfaces, etc.)
- Add Trainer port option for client-server strategies (#198)
- Fix store port conflict handling (#227)
- Unified PythonServerLauncher (#286#292#303)
- Make health timeout configurable (#305)
- Refactor logging (#306)
- Support OTLP in LightningStore (#313)
- Centralized metrics helper (#368)
- Fix redundant cancel tracebacks on Ctrl+C (#370)
Proxy, Adapters and Algorithms
- Fix training metrics before and after processing in VERL (#145)
- Forward streaming requests for Anthropic and OpenAI APIs (as non-streaming requests) (#299)
- Check traces with reward for VERL (#317)
- Patch LiteLLM root span (#341)
- Handle ref_in_actor flag for LoRA compatibility (#386)
- Support
with_llm_proxyandwith_storein algorithms (#398) - Support image URL export in TracerTraceToTriplets (#400)
- Fix match_rewards assign_to elements in TraceTree (#403)
- Support customizing trainer and daemon in VERL (#407)
Runners, Tracers and Agents
- Refactor tracer initialization (#321)
- Fix OpenAI Agents 0.6 compatibility (#322)
emit_operation,emit_annotation, tags and links (#359)- Sunset HTTP tracer (#402)
Examples
- Fix typos in train-first-agent.md (#263)
- Fix room_selector example which always runs the first task (#270)
- Fix typo in SQL agent example (#285)
- Add the README and script files for training SQL agent on NPU (#272)
- Examples Catalog and Refine Contribution Guide (#331)
- Upgrade LangChain to 1.x (#364)
- Update RAG example to Agent-lightning v0.2.x (#349)
Miscellaneous
New Contributors
Warm welcome to our first-time contributors:@cptnm3,@TerryChan,@genji970,@zxgx,@xiaochulaoban,@lspinheiro,@Kwanghoon-Choi,@Vasuk12,@totoluo,@jinghuan-Chen 🎉
Full Changelog:v0.2.0...v0.3.0
Assets2
Uh oh!
There was an error while loading.Please reload this page.
Agent Lightning v0.2.2
289c9caAgent-lightning v0.2.2 is a stabilization release for v0.2.1. It introduces several bug fixes.
- Fix compatibility issues with VERL 0.6.0.
- Fix model name for pre-downloaded models in VERL.
- Fix preparing status transition on rollout when creating attempts.
- Fix OpenAI Agents SDK compatibility issues.
Full Changelog:v0.2.1...v0.2.2
Assets2
Uh oh!
There was an error while loading.Please reload this page.
Agent Lightning v0.2.1
01101a6Agent-lightning v0.2.1 is a stabilization release for v0.2.0. It introduces several bug fixes and new features, plus a number of unlisted CI improvements.
Bug fixes
- Fix LiteLLM issues when restarting the proxy multiple times in the same process (#174#206)
- Fix LiteLLM model name selection when multiple servers use the same model (#197)
- Fix store port conflict handling (#227)
New Features
- Add trainer port option for client-server strategies (#198)
Documentation
- Add tutorial for launching workers on separate machines (#213)
- Add link to VERL framework (#210)
- Add link to vLLM blog (#215)
- Fix a couple of typos and avoid emacs backup files (#237)
New Contributors
A warm welcome to our first-time contributors:@scott-vsi,@ddsfda99,@jeis4wpi 🎉
Full Changelog:v0.2.0...v0.2.1
Assets2
Uh oh!
There was an error while loading.Please reload this page.
Agent Lightning v0.2.0
22454adAgent-Lightning v0.2.0 introduces major framework improvements, new execution strategies, expanded documentation, and enhanced reliability across the agent training and deployment workflow. This release includes78 pull requests since v0.1.2.
Core Enhancements
- Lightning Store: Added unified interface and implementation for Agent-lightning's core storage.
- Emitter: Emitting any objects as spans to the store.
- Adapter andTracer: Adapting to OpenAI-like messages, and OpenTelemetry dummy tracer.
- LLM Proxy: Added LLM Proxy as the first-class citizen in Agent-lightning.
- Agent Runner: New version providing a more modular and robust runner design.
- Embedded Algorithms: Algorithms are now embedded directly into trainers for simplicity.
- New Execution Strategies: IntroducedClient-Server andShared Memory execution models.
- Trainer Updates: Integrated v0.2 interfaces and FastAlgorithm validation.
Documentation & Examples
- Revamped documentation with new guides foragent creation,training,debugging, andstore concepts.
- Improved quickstart tutorials, clarified installation and new deep-dive articles.
- Added and updated examples:SQL Agent,Calc-X,Local SFT,Search-R1, andAPO algorithm.
Developer Experience
- Migrated build and CI pipelines to1ES, split workflows and aggregate badges for clarity.
- Adopteduv as the dependency manager.
- Added GPU-based pytest workflows for full test coverage.
- Enhanced debugging UX, pre-commit configs, and linting (Pyright fixes, import sorting).
Ecosystem & Integrations
- Added support for agents built withAgent-framework.
- Added new community listings:DeepWerewolf andAgentFlow.
New Contributors
A warm welcome to our first-time contributors:
@hzy46,@lunaqiu,@syeehyn,@linhx1999,@SiyunZhao, and@acured 🎉
Full changelog:v0.1.2 → v0.2.0
Assets2
Uh oh!
There was an error while loading.Please reload this page.
Agent Lightning v0.1.2
3ec5ebdWhat's Changed
- Add basic documentation in#33
- RAG example by@wizardlancet in#21
New Contributors
- @wizardlancet made their first contribution in#21
Full Changelog:v0.1.1...v0.1.2
Assets2
Uh oh!
There was an error while loading.Please reload this page.
Agent Lightning v0.1.1
a71e754What's Changed
Full Changelog:v0.1...v0.1.1
Assets2
Uh oh!
There was an error while loading.Please reload this page.
Agent Lightning v0.1
160a82bThe first release of Agent-lightning!
- Turn your agent into an optimizable beast withZERO CODE CHANGE (almost)! 💤
- Build withANY agent framework (LangChain, OpenAI Agent SDK, AutoGen, CrewAI, ...); or even WITHOUT agent framework (Python OpenAI). You name it! 🤖
- Selectively optimize one or more agents in a multi-agent system. 🎯
- Embraces Reinforcement Learning, Automatic Prompt Optimization and morealgorithms. 🤗
Install viapip install agentlightning.
Assets2
Uh oh!
There was an error while loading.Please reload this page.