This PR leverages the parent class'create_accelerator_and_postprocess method to initialize theaccelerator correctly, without overhauling thePPOTrainer initialization flow.

Usescreate_accelerator_and_postprocess instead of manual accelerator setup.
Initialization now succeeds under specific configurations: ZeRO Stage 1 supports any--gradient_accumulation_steps, whereas ZeRO Stage 2 and 3 require --gradient_accumulation_steps == 1.

Note: With--gradient_accumulation_steps > 1, running with ZeRO stage 2 or 3 still trigger the well-known error:

[rank0]:   File "/workspace/Projects/trl/examples/scripts/ppo/ppo.py", line 163, in <module>                                                                                                                                12:44:26 [34/1998][rank0]:     trainer.train()                                                                                                                                                                                                                  [rank0]:   File "/workspace/Projects/trl/trl/trainer/ppo_trainer.py", line 668, in train                                                                                                                                                      [rank0]:     with accelerator.accumulate(model):                                                                                                                                                                                              [rank0]:   File "/usr/lib/python3.12/contextlib.py", line 137, in __enter__                                                                                                                                                                   [rank0]:     return next(self.gen)                                                                                                                                                                                                            [rank0]:            ^^^^^^^^^^^^^^                                                                                                                                                                                                            [rank0]:   File "/workspace/Projects/trl/venv/lib/python3.12/site-packages/accelerate/accelerator.py", line 1166, in accumulate                                                                                                               [rank0]:     cm_stack.enter_context(contextlib.nullcontext() if allow_gradient_sync else self.no_sync(m))                                                                                                                                     [rank0]:   File "/usr/lib/python3.12/contextlib.py", line 526, in enter_context                                                                                                                                                               [rank0]:     result = _enter(cm)                                                                                                                                                                                                              [rank0]:              ^^^^^^^^^^                                                                                                                                                                                                              [rank0]:   File "/usr/lib/python3.12/contextlib.py", line 137, in __enter__                                                                                                                                                                   [rank0]:     return next(self.gen)                                                                                                                                                                                                            [rank0]:            ^^^^^^^^^^^^^^                                                                                                                                                                                                            [rank0]:   File "/workspace/Projects/trl/venv/lib/python3.12/site-packages/accelerate/accelerator.py", line 1047, in no_sync                                                                                                                  [rank0]:     with context():                                                                                                                                                                                                                  [rank0]:   File "/usr/lib/python3.12/contextlib.py", line 137, in __enter__                                                                                                                                                                   [rank0]:     return next(self.gen)                                                                                                                                                                                                            [rank0]:            ^^^^^^^^^^^^^^                                                                                                                                                                                                            [rank0]:   File "/workspace/Projects/trl/venv/lib/python3.12/site-packages/deepspeed/runtime/engine.py", line 2243, in no_sync                                                                                                                [rank0]:     assert not self.zero_optimization_partition_gradients(), \                                                                                                                                                                       [rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^                                                                                                                                                                          [rank0]: AssertionError: no_sync context manager is incompatible with gradient partitioning logic of ZeRO stage 3                                                                                                                             [rank1]: Traceback (most recent call last):                                                                                                                                                                                                   [rank1]:   File "/workspace/Projects/trl/examples/scripts/ppo/ppo.py", line 163, in <module>                                                                                                                                                  [rank1]:     trainer.train()[rank1]:   File "/workspace/Projects/trl/trl/trainer/ppo_trainer.py", line 668, in train[rank1]:     with accelerator.accumulate(model):[rank1]:   File "/usr/lib/python3.12/contextlib.py", line 137, in __enter__[rank1]:     return next(self.gen)[rank1]:            ^^^^^^^^^^^^^^[rank1]:   File "/workspace/Projects/trl/venv/lib/python3.12/site-packages/accelerate/accelerator.py", line 1166, in accumulate[rank1]:     cm_stack.enter_context(contextlib.nullcontext() if allow_gradient_sync else self.no_sync(m))[rank1]:   File "/usr/lib/python3.12/contextlib.py", line 526, in enter_context[rank1]:     result = _enter(cm)[rank1]:              ^^^^^^^^^^[rank1]:   File "/usr/lib/python3.12/contextlib.py", line 137, in __enter__[rank1]:     return next(self.gen)[rank1]:            ^^^^^^^^^^^^^^[rank1]:   File "/workspace/Projects/trl/venv/lib/python3.12/site-packages/accelerate/accelerator.py", line 1047, in no_sync[rank1]:     with context():[rank1]:   File "/usr/lib/python3.12/contextlib.py", line 137, in __enter__[rank1]:     return next(self.gen)[rank1]:            ^^^^^^^^^^^^^^[rank1]:   File "/workspace/Projects/trl/venv/lib/python3.12/site-packages/deepspeed/runtime/engine.py", line 2243, in no_sync[rank1]:     assert not self.zero_optimization_partition_gradients(), \[rank1]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^[rank1]: AssertionError: no_sync context manager is incompatible with gradient partitioning logic of ZeRO stage 3

The related issues as follows:

Request for feedback

Combine fixes?
should this PR also address theno_sync with ZeRO 2/3 compatibility (i.e. implement a workaround or guard), or…
Separate the issue
open a new issue for the ZeRO compatibility problem and keep this PR focused solely on "accelerator initialization"?

Any guidance or opinions are greatly appreciated -- thank you! 🙏

Fixes # (issue)
#2377

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read thecontributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

ccs96307 added4 commits

July 14, 2025 12:16

correctly initialize accelerator

adc77f9

Merge branch 'main' into fix-ppo-example-accelerator-error

89198e0

run precommit

b44965a

remove unused lines

8c284e6

Labels

None yet

1 participant

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Fix ppo example accelerator initialization error#3732

Are you sure you want to change the base?

[WIP] Fix ppo example accelerator initialization error#3732

Uh oh!

Conversation

ccs96307 commentedJul 14, 2025

What does this PR do?

Request for feedback

Before submitting

Who can review?

Uh oh!

Uh oh!