This repository was archived by the owner on Mar 21, 2024. It is now read-only.
- Notifications
You must be signed in to change notification settings - Fork449
Pull requests: NVIDIA/cub
Author
Label
Projects
Milestones
Reviews
Assignee
Assigned to nobodyLoading
Sort
Pull requests list
Draft of segmented reduce optimization P2: nice to haveDesired, but not necessary.
#578 openedSep 30, 2022 bygevtushenkoLoading…
Wrap launch bounds testing: gpuCI in progressStarted gpuCI testing. type: bug: compilerBug in a compiler, not this library.
add support FutureValue for reduce P2: nice to haveDesired, but not necessary. type: enhancementNew feature or request.
[WIP] Allow cub::DeviceRadixSort and cub::DeviceSegmentedRadixSort to use iterator as input helps: pytorchHelps or needed by PyTorch. P3: backlogUnprioritized
Add assignment operator to the TestBar test util class. P2: nice to haveDesired, but not necessary. triageNeeds investigation and classification.
fix 'invalid arguments' warp sync error on Volta info neededCannot make progress without more information. P1: should haveNecessary, but not critical. repro: missingMissing a complete example that reproduces the issue. type: bug: functionalDoes not work as intended.
ProTip! Typegi on any issue or pull request to go back to the issue listing page.