Movatterモバイル変換

microsoft/agent-lightningPublic

NotificationsYou must be signed in to change notification settings
Fork1.3k
Star15.1k

Agent Lightning v0.3.0

24 Dec 01:54

ultmaster

v0.3.0

3b5d733

This commit was created on GitHub.com and signed with GitHub’sverified signature.

GPG key ID:B5690EEEBB952194

Verified

Learn about vigilant mode.

Agent Lightning v0.3.0Latest

Latest

Agent-lightning v0.3.0 is a major release that introduces several new features and bug fixes. The release is a collaborative effort between Agent-lightning core teams and the community. Thanks to all the contributors who made this release possible.

Highlights

Tinker integration: Support Tinker as an alternative backend for Reinforcement Learning (#226 #245 #264 #269 #327). Seeexample code,blog 1 andblog 2.
Azure OpenAI integration: Support Azure OpenAI as a backend for LLM inference and supervised fine-tuning (#256 #327).Example code.
MongoDB-based Lightning Store is added as an alternative backend for Lightning Store (#323).Documentation.
Contrib package: Add contrib package for community projects. Search-R1 is integrated as a contrib recipe. More coming. (#239 #396 #410 #412 #417).
RESTful API: Stabilize and document RESTful API for Lightning Store (#241 #275).Documentation.
OTel Semantic Conventions that are specifically designed for Agent-optimization areas (#340).Documentation.
[Preview]Agent-lightning Dashboard is now available (#288 #289 #291 #296 #371 #375). It's the official web application for inspecting and debugging Agent-lightning experiments. See detailshere.
[Preview]Multi-modality example featuring VERL and a LangGraph agent on ChartQA dataset (#379).Example code.
[Preview] IntegrateClaude Code as a LitAgent and support training on SWE-Bench (#332 #346 #348).Example code.
[Preview]Weave tracer as a substitute for AgentOps tracer (#277 #411 #420 #423).Documentation.
[Preview]Trajectory Level Aggregation for more efficient training with VERL. Seeblog anddocumentation.

Store Benchmark

In this release, the Lightning Store core was redesigned for significantly greater efficiency and scalability (#315 #318 #328 #342 #344 #356 #380 #388 #418 #421). The benchmark results below demonstrate the impact: with large numbers of concurrent runners, v0.3.0 delivers up to a 15x increase in throughput compared to v0.2.2.

Throughput (#rollout/sec)	v0.2.2	v0.3.0 (in-memory)	v0.3.0 (Mongo)
Minimal (batch, #runner=32, #turns=6)	8.73	9.06	8.71
Medium (batch, #runners=100, #turns=10)	12.03	23.26	32.79
Mid-high (batch, #runners=300, #turns=6)	10.61	24.42	40.24
Large (batch, #runners=1000, #turns=3)	3.36	14.60	50.05
Long queue (queue, #runners=256, #turns=4)	7.42	30.86	57.01
Heavy trace (queue, #runners=512, #turns=20)	5.93	13.28	29.41

Notes:

Benchmarks were run on a single Standard_D32as_v4 Azure VM (Large and heavy trace tests used Standard_D64ads_v5), executed via GitHub Actions.
Two algorithm patterns are evaluated: the batch pattern submits a group of rollouts and waits for all to finish before starting the next group, while the queue pattern maintains a set number of in-flight rollouts, submitting new ones as soon as capacity frees up. Configuration details are availablehere.
The number of turns is directly proportional to the number of spans each rollout generates.

Maintenance and Bug fixes

Core (Store, Interfaces, etc.)

Add Trainer port option for client-server strategies (#198)
Fix store port conflict handling (#227)
Unified PythonServerLauncher (#286 #292 #303)
Make health timeout configurable (#305)
Refactor logging (#306)
Support OTLP in LightningStore (#313)
Centralized metrics helper (#368)
Fix redundant cancel tracebacks on Ctrl+C (#370)

Proxy, Adapters and Algorithms

Fix training metrics before and after processing in VERL (#145)
Forward streaming requests for Anthropic and OpenAI APIs (as non-streaming requests) (#299)
Check traces with reward for VERL (#317)
Patch LiteLLM root span (#341)
Handle ref_in_actor flag for LoRA compatibility (#386)
Supportwith_llm_proxy andwith_store in algorithms (#398)
Support image URL export in TracerTraceToTriplets (#400)
Fix match_rewards assign_to elements in TraceTree (#403)
Support customizing trainer and daemon in VERL (#407)

Runners, Tracers and Agents

Refactor tracer initialization (#321)
Fix OpenAI Agents 0.6 compatibility (#322)
emit_operation,emit_annotation, tags and links (#359)
Sunset HTTP tracer (#402)

Examples

Fix typos in train-first-agent.md (#263)
Fix room_selector example which always runs the first task (#270)
Fix typo in SQL agent example (#285)
Add the README and script files for training SQL agent on NPU (#272)
Examples Catalog and Refine Contribution Guide (#331)
Upgrade LangChain to 1.x (#364)
Update RAG example to Agent-lightning v0.2.x (#349)

Miscellaneous

DeepWiki Badge (#263)
Add AGENTS.md (#374)

New Contributors

Warm welcome to our first-time contributors:@cptnm3,@TerryChan,@genji970,@zxgx,@xiaochulaoban,@lspinheiro,@Kwanghoon-Choi,@Vasuk12,@totoluo,@jinghuan-Chen 🎉

Full Changelog:v0.2.0...v0.3.0

Contributors

TerryChan, lspinheiro, and 8 other contributors

Assets2

3 people reacted

Agent Lightning v0.2.2

12 Nov 16:06

ultmaster

v0.2.2

289c9ca

This commit was created on GitHub.com and signed with GitHub’sverified signature.

GPG key ID:B5690EEEBB952194

Verified

Learn about vigilant mode.

Agent Lightning v0.2.2

Agent-lightning v0.2.2 is a stabilization release for v0.2.1. It introduces several bug fixes.

Fix compatibility issues with VERL 0.6.0.
Fix model name for pre-downloaded models in VERL.
Fix preparing status transition on rollout when creating attempts.
Fix OpenAI Agents SDK compatibility issues.

Full Changelog:v0.2.1...v0.2.2

Assets2

Agent Lightning v0.2.1

30 Oct 01:48

ultmaster

v0.2.1

01101a6

This commit was created on GitHub.com and signed with GitHub’sverified signature.

GPG key ID:B5690EEEBB952194

Verified

Learn about vigilant mode.

Agent Lightning v0.2.1

Agent-lightning v0.2.1 is a stabilization release for v0.2.0. It introduces several bug fixes and new features, plus a number of unlisted CI improvements.

Bug fixes

Fix LiteLLM issues when restarting the proxy multiple times in the same process (#174 #206)
Fix LiteLLM model name selection when multiple servers use the same model (#197)
Fix store port conflict handling (#227)

New Features

Add trainer port option for client-server strategies (#198)

Documentation

Add tutorial for launching workers on separate machines (#213)
Add link to VERL framework (#210)
Add link to vLLM blog (#215)
Fix a couple of typos and avoid emacs backup files (#237)

New Contributors

A warm welcome to our first-time contributors:@scott-vsi,@ddsfda99,@jeis4wpi 🎉

Full Changelog:v0.2.0...v0.2.1

Contributors

scott-vsi, jeis4wpi, and ddsfda99

Assets2

Agent Lightning v0.2.0

22 Oct 06:09

ultmaster

v0.2.0

22454ad

This commit was created on GitHub.com and signed with GitHub’sverified signature.

GPG key ID:B5690EEEBB952194

Verified

Learn about vigilant mode.

Agent Lightning v0.2.0

Agent-Lightning v0.2.0 introduces major framework improvements, new execution strategies, expanded documentation, and enhanced reliability across the agent training and deployment workflow. This release includes78 pull requests since v0.1.2.

Core Enhancements

Lightning Store: Added unified interface and implementation for Agent-lightning's core storage.
Emitter: Emitting any objects as spans to the store.
Adapter andTracer: Adapting to OpenAI-like messages, and OpenTelemetry dummy tracer.
LLM Proxy: Added LLM Proxy as the first-class citizen in Agent-lightning.
Agent Runner: New version providing a more modular and robust runner design.
Embedded Algorithms: Algorithms are now embedded directly into trainers for simplicity.
New Execution Strategies: IntroducedClient-Server andShared Memory execution models.
Trainer Updates: Integrated v0.2 interfaces and FastAlgorithm validation.

Documentation & Examples

Revamped documentation with new guides foragent creation,training,debugging, andstore concepts.
Improved quickstart tutorials, clarified installation and new deep-dive articles.
Added and updated examples:SQL Agent,Calc-X,Local SFT,Search-R1, andAPO algorithm.

Developer Experience

Migrated build and CI pipelines to1ES, split workflows and aggregate badges for clarity.
Adopteduv as the dependency manager.
Added GPU-based pytest workflows for full test coverage.
Enhanced debugging UX, pre-commit configs, and linting (Pyright fixes, import sorting).

Ecosystem & Integrations

Added support for agents built withAgent-framework.
Added new community listings:DeepWerewolf andAgentFlow.

New Contributors

A warm welcome to our first-time contributors:
@hzy46,@lunaqiu,@syeehyn,@linhx1999,@SiyunZhao, and@acured 🎉

Full changelog:v0.1.2 → v0.2.0

Contributors

hzy46, acured, and 4 other contributors

Assets2

Agent Lightning v0.1.2

12 Aug 07:48

ultmaster

v0.1.2

3ec5ebd

This commit was created on GitHub.com and signed with GitHub’sverified signature.

GPG key ID:B5690EEEBB952194

Verified

Learn about vigilant mode.

Agent Lightning v0.1.2

What's Changed

Add basic documentation in#33
RAG example by@wizardlancet in#21

New Contributors

@wizardlancet made their first contribution in#21

Full Changelog:v0.1.1...v0.1.2

Contributors

wizardlancet

Assets2

Agent Lightning v0.1.1

06 Aug 00:49

ultmaster

v0.1.1

a71e754

This commit was created on GitHub.com and signed with GitHub’sverified signature.

GPG key ID:B5690EEEBB952194

Verified

Learn about vigilant mode.

Agent Lightning v0.1.1

What's Changed

Disable HTTP tracer tests and bump to 0.1.1 in#26
Fix trainer bugs in v0.1 in#24

Full Changelog:v0.1...v0.1.1

Assets2

Agent Lightning v0.1

04 Aug 10:22

ultmaster

v0.1

160a82b

This commit was created on GitHub.com and signed with GitHub’sverified signature.

GPG key ID:B5690EEEBB952194

Verified

Learn about vigilant mode.

Agent Lightning v0.1

The first release of Agent-lightning!

Turn your agent into an optimizable beast withZERO CODE CHANGE (almost)! 💤
Build withANY agent framework (LangChain, OpenAI Agent SDK, AutoGen, CrewAI, ...); or even WITHOUT agent framework (Python OpenAI). You name it! 🤖
Selectively optimize one or more agents in a multi-agent system. 🎯
Embraces Reinforcement Learning, Automatic Prompt Optimization and morealgorithms. 🤗

Install viapip install agentlightning.

Assets2

Movatterモバイル変換

Releases: microsoft/agent-lightning

Agent Lightning v0.3.0

Highlights

Store Benchmark

Maintenance and Bug fixes

Core (Store, Interfaces, etc.)

Proxy, Adapters and Algorithms

Runners, Tracers and Agents

Examples

Miscellaneous

New Contributors

Contributors

Uh oh!

Agent Lightning v0.2.2

Uh oh!

Agent Lightning v0.2.1

Bug fixes

New Features

Documentation

New Contributors

Contributors

Uh oh!

Agent Lightning v0.2.0

Core Enhancements

Documentation & Examples

Developer Experience

Ecosystem & Integrations

New Contributors

Contributors

Uh oh!

Agent Lightning v0.1.2

What's Changed

New Contributors

Contributors

Uh oh!

Agent Lightning v0.1.1

What's Changed

Uh oh!

Agent Lightning v0.1

Uh oh!