This repository was archived by the owner on Mar 21, 2024. It is now read-only.

NVIDIA/cubPublic archive

NotificationsYou must be signed in to change notification settings
Fork457
Star1.8k

Port adjacent difference into CUB#331

Merged

gevtushenko merged 1 commit intoNVIDIA:mainfromgevtushenko:main-feature/github/cub_adjacent_difference

Dec 14, 2021

Merged

Port adjacent difference into CUB#331

gevtushenko merged 1 commit intoNVIDIA:mainfromgevtushenko:main-feature/github/cub_adjacent_difference

Dec 14, 2021

Conversation

Copy link

Collaborator

gevtushenko commentedJun 29, 2021

The PR contains:

a port ofthrust::adjacent_difference algorithm;
deprecation ofFlagHeads andFlagTails methods fromBlockAdjacentDifference structure;
fixed API forBlockAdjacentDifference along with documentation and tests.

Note that the PR is based on some of the features introduced inMergeSort porting.

gevtushenko assignedalliepiper

Jun 29, 2021

gevtushenko force-pushed themain-feature/github/cub_adjacent_difference branch from10db20a to891c10cCompare

June 29, 2021 15:26

alliepiper suggested changes

Jul 22, 2021

View reviewed changes

Copy link

Collaborator

alliepiper left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

This is a great first pass 👍 Let me know when you're ready for a re-review.

cub/agent/agent_merge_sort.cuh OutdatedShow resolvedHide resolved

cub/block/block_adjacent_difference.cuh OutdatedShow resolvedHide resolved

cub/block/block_adjacent_difference.cuhShow resolvedHide resolved

cub/block/block_adjacent_difference.cuh OutdatedShow resolvedHide resolved

test/test_block_adjacent_difference.cu OutdatedShow resolvedHide resolved

cub/block/block_adjacent_difference.cuhShow resolvedHide resolved

test/test_block_adjacent_difference.cu OutdatedShow resolvedHide resolved

alliepiper added this to the1.14.0 milestone

Jul 22, 2021

alliepiper assignedgevtushenko and unassignedalliepiper

Jul 22, 2021

alliepiper modified the milestones:1.14.0,1.15.0

Aug 17, 2021

alliepiper added the P1: should haveNecessary, but not critical. label

Aug 17, 2021

gevtushenko force-pushed themain-feature/github/cub_adjacent_difference branch 2 times, most recently from3c13360 to53ffc37Compare

August 23, 2021 09:27

gevtushenko force-pushed themain-feature/github/cub_adjacent_difference branch 2 times, most recently from370bee7 to88a4ae8Compare

August 27, 2021 15:27

alliepiper assignedalliepiper and unassignedgevtushenko

Sep 21, 2021

alliepiper added type: enhancement

New feature or request.

P2: nice to haveDesired, but not necessary. and removed P1: should haveNecessary, but not critical. labels

Oct 14, 2021

alliepiper modified the milestones:1.15.0,1.16.0

Oct 14, 2021

alliepiper approved these changes

Nov 30, 2021

View reviewed changes

Copy link

Collaborator

alliepiper left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

LGTM -- just some comments on documentation, otherwise this is ready to start testing 👍

cub/block/block_adjacent_difference.cuh

		* `{ ...3], [4,2,1,1], [1,1,1,1], [2,3,3,3], [3,4,1,4] }`.
		* and that `valid_items` is `507`. The corresponding output `result` in
		* those threads will be
		* `{ ..., [-1,2,1,0], [0,0,0,-1], [-1,0,3,3], [3,4,1,4] }`.

Copy link

Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

(continuing from#331 (comment))

Gotcha, so the last 5 values are unchanged random values that are ignored by the algorithm.

There's another convention that uses- to stand in for such values, e.g.

cub/cub/device/device_histogram.cuh

Lines 181 to 182 ind6ba3cf

	* float* d_samples; // e.g., [2.2, 6.1, 7.1, 2.9, 3.5, -, -,
	* // 0.3, 2.9, 2.1, 6.1, 999.5, -, -]

I think it's a little easier to parse with the non-numeric characters, since it's immediately clear that theycan't be involved in the calculation. If you agree, let's change this to

* `{ ...3], [4,2,1,1], [1,1,1,1], [2,3,3,-], [-,-,-,-] }`.* `{ ..., [-1,2,1,0], [0,0,0,-1], [-1,0,3,-], [-,-,-,-] }`.

Copy link

CollaboratorAuthor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Gotcha, so the last 5 values are unchanged random values that are ignored by the algorithm.

These values aren't ignored by the algorithm:

As the documentation states, the input value is copied without modifications if the neighbour value is out of bounds.

Ifinput andoutput parameters point to different memory locations, these values will be copied without modification. I'm afraid that non-numeric characters would mean that the output values are unchanged at all, for example:

input { 1, 2, 3 };output { 4, 5, 6 };valid_items = 0;adjacent_difference(input, output, valid_items);// output should be { 1, 2, 3 } and not { 4, 5, 6 }

The same withstd::adjacent_differencedocumentation. The output for:

std::vector v {2, 4, 6, 8, 10, 12, 14, 16, 18, 20};

2 2 2 2 2 2 2 2 2 2

rather than:

- 2 2 2 2 2 2 2 2 2

Do you think it'll be clear from the- notation that the values are copied?

Copy link

Collaborator

alliepiperDec 1, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

That's a good point -- I'm fine leaving this as-is.

cub/block/block_adjacent_difference.cuh Outdated

		* __global__ void ExampleKernel(...)
		* {
		* // Specialize BlockAdjacentDifference for a 1D block of
		* // 128 threads on type int

Copy link

Collaborator

alliepiperNov 30, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

s/on/of/

(this is repeated in a few other docstrings)

Copy link

CollaboratorAuthor

gevtushenkoNov 30, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

There are quite a few places with this typo:

rg "threads on type" | wc -l33

I'll fix this in theblock_scan,block_discontinuity andblock_reduce documentation as well.

alliepiper assignedgevtushenko and unassignedalliepiper

Nov 30, 2021

gevtushenko force-pushed themain-feature/github/cub_adjacent_difference branch from28c79e4 toa64d24aCompare

November 30, 2021 21:05

Add device adjacent difference

6d052db

gevtushenko force-pushed themain-feature/github/cub_adjacent_difference branch fromf83d279 to6d052dbCompare

December 11, 2021 11:13

gevtushenko added testing: gpuCI in progress

Started gpuCI testing.

testing: internal ci in progressCurrently testing on internal NVIDIA CI (DVS). labels

Dec 11, 2021

Copy link

CollaboratorAuthor

gevtushenko commentedDec 11, 2021

gpuCI:NVIDIA/thrust/pull/1577
DVS: 30766806

gevtushenko removed the testing: gpuCI in progressStarted gpuCI testing. label

Dec 12, 2021

gevtushenko added testing: gpuCI passed

Passed gpuCI testing.

testing: internal ci passed

Passed internal NVIDIA CI (DVS).

release: breaking changeInclude in "Breaking Changes" section of release notes. and removed testing: internal ci in progressCurrently testing on internal NVIDIA CI (DVS). labels

Dec 12, 2021

gevtushenko merged commit722e3ca intoNVIDIA:main

Dec 14, 2021