Following discussion in#2678 this PR introduces an example in which the outputs of three compute tiles are joined in a mem tile before the final 48-element wide tensor (i32) is sent to external memory.

In this example, two iterations of the join pattern are required to move the 48-element wide output data tensor from the NPU to external memory. In combination with thetoStream data layout transformation on the 48-element wide data, the following BD programming is required:

%memtile_dma_0_1 = aie.memtile_dma(%mem_tile_0_1) {      %0 = aie.dma_start(MM2S, 0, ^bb1, ^bb6)    ^bb1:  // 2 preds: ^bb0, ^bb3      aie.use_lock(%out_cons_lock_2, AcquireGreaterEqual, 1)      aie.dma_bd(%out_buff_0 : memref<24xi32>, 0, 24, [<size = 8, stride = 1>, <size = 3, stride = 8>])      aie.use_lock(%out_prod_lock_2, Release, 1)      aie.next_bd ^bb2    ^bb2:  // pred: ^bb1      aie.use_lock(%out_cons_lock_1, AcquireGreaterEqual, 1)      aie.dma_bd(%out_buff_0 : memref<24xi32>, 0, 0, [<size = 8, stride = 1>, <size = 3, stride = 8>])      aie.use_lock(%out_prod_lock_1, Release, 1)      aie.next_bd ^bb3    ^bb3:  // pred: ^bb2      aie.use_lock(%out_cons_lock_0, AcquireGreaterEqual, 1)      aie.dma_bd(%out_buff_0 : memref<24xi32>, 0, 0, [<size = 8, stride = 1>, <size = 3, stride = 8>])      aie.use_lock(%out_prod_lock_0, Release, 1)      aie.next_bd ^bb1    ^bb6:  // pred: ^bb0      %1 = aie.dma_start(S2MM, 0, ^bb7, ^bb8)    ^bb7:  // 2 preds: ^bb6, ^bb7      aie.use_lock(%out_prod_lock_0, AcquireGreaterEqual, 1)      aie.dma_bd(%out_buff_0 : memref<24xi32>, 0, 8)      aie.use_lock(%out_cons_lock_0, Release, 1)      aie.next_bd ^bb7    ^bb8:  // pred: ^bb6      %2 = aie.dma_start(S2MM, 1, ^bb9, ^bb10)    ^bb9:  // 2 preds: ^bb8, ^bb9      aie.use_lock(%out_prod_lock_1, AcquireGreaterEqual, 1)      aie.dma_bd(%out_buff_0 : memref<24xi32>, 8, 8)      aie.use_lock(%out_cons_lock_1, Release, 1)      aie.next_bd ^bb9    ^bb10:  // pred: ^bb8      %3 = aie.dma_start(S2MM, 2, ^bb11, ^bb12)    ^bb11:  // 2 preds: ^bb10, ^bb11      aie.use_lock(%out_prod_lock_2, AcquireGreaterEqual, 1)      aie.dma_bd(%out_buff_0 : memref<24xi32>, 16, 8)      aie.use_lock(%out_cons_lock_2, Release, 1)      aie.next_bd ^bb11    ^bb12:  // pred: ^bb10      aie.end    }

The objectfifo lowering for a join currently only works at the granularity of the smaller tensors, and thus cannot apply the data layout transformation on the final output tensor. This PR enhances the lowering such that the pattern above is produced instead. This is similar for the distribute pattern using the fromStream data layout transformation on the input objectfifo.

TODO:

comment and cleanup code in objectfifo lowering
debug distribute with fromStream on input objfifo
add checks for AIE2 architecture: multiple acq/rel ops should not be allowed in the same BD
add documentation
add MLIR examples

Example showcasing limitation of multiple acq/rel ops in one DMA BD

78bf61b

Copy link

Contributor

github-actionsbot commentedNov 12, 2025•
edited
Loading

Coverage Report

Created: 2025-11-22 08:29

Clickhere for information about interpreting this report.

Filename	Function Coverage	Line Coverage	Region Coverage	Branch Coverage
home/runner/work/mlir-aie/mlir-aie/lib/Dialect/AIE/Transforms/AIEObjectFifoStatefulTransform.cpp	100.00%	94.47%	92.22%	86.70%
Totals	100.00%	94.47%	92.22%	86.70%

Generated by llvm-cov -- llvm version 18.1.3

Copy link

Collaborator

fifield commentedNov 12, 2025

Following discussion in#2678 this PR introduces an example which tests whether multiple locks can be acquired and released in a single DMA BD.

Maybe it's obvious, but the hardware does not support this. It should be an error.

abiscaand others added6 commits

November 12, 2025 23:39

Working BD schedule with dummy BDs to set the lock pairs

9fae57f

Example runs for two iterations, with different data each one.

da3d89f

Fix test for multiple iterations

9165b9b

Update programming_guide/section-2/section-2f/06_data_layout_transfor…

2a6a1c6

…mations/test.cppCo-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update programming_guide/section-2/section-2f/06_data_layout_transfor…

9edfd1a

…mations/test.cppCo-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Merge branch 'main' into of-dims

8b826b8

AndraBisca changed the title~~Limitation of multiple acq/rel ops in one DMA BD~~Support data layout transformation on objectfifo join output

Nov 17, 2025

AndraBisca changed the title~~Support data layout transformation on objectfifo join output~~Support data layout transformations on objectfifo join output

Nov 17, 2025

AndraBisca changed the title~~Support data layout transformations on objectfifo join output~~Data layout transformations on objectfifo join/split output/input

Nov 18, 2025

abisca added5 commits

November 22, 2025 00:08

Add to_stream design to section-2c

544d4ac

Merge branch 'of-dims' ofhttps://github.com/Xilinx/mlir-aieinto of-…

03cb0bf

…dims

Test to_stream with double buffering

81adfbb

Revert to single buffer as it showcases error better

2f69ebb

Test distribute with from_stream on input objectfifo

49ad68e

github-actionsbot reviewed

Nov 24, 2025

View reviewed changes

programming_guide/section-2/section-2c/from_stream_transformations/test.cpp

		if (j >= OUT_HEIGHT / 2)
		ref++;
		if ((bufOut + i + OUT_WIDTH j) != ref) {
		std::cout << "Error in output " << i + OUT_WIDTH * j << ": " << (bufOut + i + OUT_WIDTH j) << " != " << ref

Copy link

Contributor

github-actionsbotNov 24, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

[clang-format]_{reported byreviewdog 🐶}

Suggested change

	std::cout <<"Error in output" << i + OUT_WIDTH * j <<":" << (bufOut + i + OUT_WIDTH j) <<" !=" << ref
	std::cout <<"Error in output" << i + OUT_WIDTH * j <<":"
	<< (bufOut + i + OUT_WIDTH j) <<" !=" << ref

programming_guide/section-2/section-2c/from_stream_transformations/test.cpp

		<< std::endl;
		errors++;
		} else {
		std::cout << "Correct output " << i + OUT_WIDTH * j << ": " << (bufOut + i + OUT_WIDTH j) << " == " << ref

Copy link

Contributor

github-actionsbotNov 24, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

[clang-format]_{reported byreviewdog 🐶}

Suggested change

	std::cout <<"Correct output" << i + OUT_WIDTH * j <<":" << (bufOut + i + OUT_WIDTH j) <<" ==" << ref
	std::cout <<"Correct output" << i + OUT_WIDTH * j <<":"
	<< (bufOut + i + OUT_WIDTH j) <<" ==" << ref

github-actionsbot reviewed

Nov 24, 2025

View reviewed changes

programming_guide/section-2/section-2c/from_stream_transformations/from_stream.py

		# Input
		of_offsets = [8 * worker for worker in range(n_workers)]

		of_in = ObjectFifo(tile24_ty, depth=depth, name="in", dims_from_stream_per_cons=[(8, 3), (3, 1)])

Copy link

Contributor

github-actionsbotNov 24, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

[black]_{reported byreviewdog 🐶}

Suggested change

	of_in=ObjectFifo(tile24_ty,depth=depth,name="in",dims_from_stream_per_cons=[(8,3), (3,1)])
	of_in=ObjectFifo(
	tile24_ty,depth=depth,name="in",dims_from_stream_per_cons=[(8,3), (3,1)]
	)

Reviewers

github-actions[bot]github-actions[bot] left review comments

denolfAwaiting requested review from denolfdenolf will be requested when the pull request is marked ready for reviewdenolf is a code owner

jgmelberAwaiting requested review from jgmelberjgmelber will be requested when the pull request is marked ready for reviewjgmelber is a code owner

jackl-xilinxAwaiting requested review from jackl-xilinxjackl-xilinx will be requested when the pull request is marked ready for reviewjackl-xilinx is a code owner

andrejAwaiting requested review from andrejandrej will be requested when the pull request is marked ready for reviewandrej is a code owner

hunhoffeAwaiting requested review from hunhoffehunhoffe will be requested when the pull request is marked ready for reviewhunhoffe is a code owner

stephenneuendorfferAwaiting requested review from stephenneuendorfferstephenneuendorffer will be requested when the pull request is marked ready for reviewstephenneuendorffer is a code owner

fifieldAwaiting requested review from fifieldfifield will be requested when the pull request is marked ready for reviewfifield is a code owner

erwei-xilinxAwaiting requested review from erwei-xilinxerwei-xilinx will be requested when the pull request is marked ready for reviewerwei-xilinx is a code owner

Labels

None yet

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Data layout transformations on objectfifo join/split output/input#2706

Are you sure you want to change the base?

Data layout transformations on objectfifo join/split output/input#2706

Uh oh!

Conversation

AndraBisca commentedNov 12, 2025•
edited
Loading

Uh oh!

Uh oh!

github-actionsbot commentedNov 12, 2025•
edited
Loading

Uh oh!

Coverage Report

Created: 2025-11-22 08:29

Generated by llvm-cov -- llvm version 18.1.3

Uh oh!

fifield commentedNov 12, 2025

Uh oh!

github-actionsbotNov 24, 2025

Choose a reason for hiding this comment

Uh oh!

github-actionsbotNov 24, 2025

Choose a reason for hiding this comment

Uh oh!

github-actionsbotNov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Movatterモバイル変換

Data layout transformations on objectfifo join/split output/input#2706

Are you sure you want to change the base?

Data layout transformations on objectfifo join/split output/input#2706

Uh oh!

Conversation

AndraBisca commentedNov 12, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

github-actionsbot commentedNov 12, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Coverage Report

Created: 2025-11-22 08:29

Generated by llvm-cov -- llvm version 18.1.3

Uh oh!

fifield commentedNov 12, 2025

Uh oh!

github-actionsbotNov 24, 2025

Choose a reason for hiding this comment

Uh oh!

github-actionsbotNov 24, 2025

Choose a reason for hiding this comment

Uh oh!

github-actionsbotNov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

AndraBisca commentedNov 12, 2025•
edited
Loading

github-actionsbot commentedNov 12, 2025•
edited
Loading