- Notifications
You must be signed in to change notification settings - Fork26.3k
ProcessGroupGloo: fix CUDA tensor stream handling with futures#170812
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
base:main
Are you sure you want to change the base?
Uh oh!
There was an error while loading.Please reload this page.
Conversation
pytorch-botbot commentedDec 18, 2025 • edited
Loading Uh oh!
There was an error while loading.Please reload this page.
edited
Uh oh!
There was an error while loading.Please reload this page.
🔗 Helpful Links🧪 See artifacts and rendered test results athud.pytorch.org/pr/170812
Note: Links to docs will display an error until the docs builds have been completed. ⏳ No Failures, 1 PendingAs of commit41bc8d9 with merge base1984725 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
jeffdaily left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others.Learn more.
Confirmed, fixes#155714.
ec983ad to41bc8d9Compared4l3k commentedDec 18, 2025
@pytorchbot merge |
pytorchmergebot commentedDec 18, 2025
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in thewiki. Questions? Feedback? Please reach out to thePyTorch DevX Team |
pytorchmergebot commentedDec 19, 2025
Merge failedReason: 1 jobs have failed, first few of them are:trunk / win-vs2022-cpu-py3 / build Details for Dev Infra teamRaised byworkflow job |
Uh oh!
There was an error while loading.Please reload this page.
Fixes#155714
There's a very subtle bug in Gloo where CUDA future streams aren't preserved correctly leading to silent corruption when using Gloo with a CUDA model using the DDP reducer.
Test plan: