This repository was archived by the owner on Mar 21, 2024. It is now read-only.
- Notifications
You must be signed in to change notification settings - Fork457
Pull requests: NVIDIA/cub
Author
Uh oh!
There was an error while loading.Please reload this page.
Label
Uh oh!
There was an error while loading.Please reload this page.
Projects
Uh oh!
There was an error while loading.Please reload this page.
Milestones
Uh oh!
There was an error while loading.Please reload this page.
Reviews
Assignee
Assigned to nobodyLoading
Uh oh!
There was an error while loading.Please reload this page.
Sort
Pull requests list
Draft of segmented reduce optimization P2: nice to haveDesired, but not necessary.
#578 openedSep 30, 2022 bygevtushenkoLoading…
Wrap launch bounds testing: gpuCI in progressStarted gpuCI testing. type: bug: compilerBug in a compiler, not this library.
add support FutureValue for reduce P2: nice to haveDesired, but not necessary. type: enhancementNew feature or request.
[WIP] Allow cub::DeviceRadixSort and cub::DeviceSegmentedRadixSort to use iterator as input helps: pytorchHelps or needed by PyTorch. P3: backlogUnprioritized
Add assignment operator to the TestBar test util class. P2: nice to haveDesired, but not necessary. triageNeeds investigation and classification.
fix 'invalid arguments' warp sync error on Volta info neededCannot make progress without more information. P1: should haveNecessary, but not critical. repro: missingMissing a complete example that reproduces the issue. type: bug: functionalDoes not work as intended.
ProTip! Updated in the last three days:updated:>2025-07-01.