This PR begins the process of taking the work others (mostly@pvasireddy-amd and@andrej , I think) did to create a class that manages XRT kernels with a cache of pre-loaded kernels in amd/IRON (https://github.com/amd/IRON/blob/devel/applications/llama_3.2_1b/src/aie_device_manager.py).

The primary goals of this work are to:

Deduplicate logic between the JIT runtime code (e.g., the NPUKernel class in IRON), the XRT helper code (e.g., theAIE_Application class in theaie.utils.xrt modulel) and the amd/IRON runtime code (e..g, the AIEDeviceManager class inhttps://github.com/amd/IRON/blob/devel/applications/llama_3.2_1b/src/aie_device_manager.py). This will help maintainability and ensure improvements made for efficiency (e.g., in pre-loading, buffer handling, etc.) are consistently used.
Continue abstracting specifics of XRT away from the conceptual use of a runtime. This is a follow on to a previous PR or two oniron.Tensors.

What this PR does not do:

I do not see this PR introducing a fully complete solution runtime management; this is an incremental step towards consolidating different code bases with different functionalities; I anticipate further fine-tuning after this PR.

Note: this PR is not yet ready for review.

hunhoffe added8 commits

November 24, 2025 16:22

first attempt at a runtime

9c27208

Start to stub out runtime class

e499107

add import

c24c7a8

more progress towards a runtime

2a0ac24

try to make the xrt hostruntime handle insts more appropriately

b13c025

handle insts a little better

49c7fb9

still not working, some small progress though

889b2ea

Merge branch 'main' into iron_runtime

00cae63

hunhoffe changed the title~~[WIP] IRON library runtime object~~[WIP] IRON host runtime abstraction

Nov 25, 2025

Copy link

Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Maybe have this as a stand-alone function?

Copy link

CollaboratorAuthor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I believe this function will go away before this PR is finished because this is information we can fetch from the compiler about the device/targetmodel.

I think the enum AIEArch is the key, but I haven't fully finished the implementation of this yet. Thoughts@fifield ?

Copy link

Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I believe this function will go away before this PR is finished because this is information we can fetch from the compiler about the device/targetmodel.
I think the enum AIEArch is the key, but I haven't fully finished the implementation of this yet. Thoughts@fifield ?

You could certainly useAIEArch orAIEDevice enums here. I don't have much else to add here other than this basic logic is already is in iron device.py and in aie_lit_utils. Hopefully the string names of things are stable by now, but the fewer places it's parsed the better.

Copy link

Collaborator

ypapadop-amdNov 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

How does the compiler know the device, esp. if there is no device present?

Copy link

CollaboratorAuthor

hunhoffeNov 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

At the level of an iron program, you can instantiate a device object in Python based on the aie C++ target model code and use it for generating the MLIR. My current understanding is that the device type actually needs to be present for the MLIR generation, not just the compilation process; aiecc just parses the device name from the mlir-aie device operation. So, for the compiler, the device type is embedded in the input file and not something parsed from the system.

I haven't fully untangled how I want to set defaults in the JIT infrastructure yet as this is just a work in progress and I am not yet introducing a "Compilable" or "Runnable" abstractions yet (which will eventually fit beneath the JIT frontend).

I'm open to suggestions!

Copy link

Collaborator

ypapadop-amdNov 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

OK, that makes sense.

ypapadop-amd reviewed

Nov 25, 2025

View reviewed changes

python/iron/hostruntime/xrtruntime/hostruntime.py Outdated

		device_type_str = self._device.get_info(pyxrt.xrt_info_device.name)

		# Fetch the device type by matching strings for NPU2 or NPU1
		# TODO: how to use only a portion of the device rather than whole array?

Copy link

Collaborator

ypapadop-amdNov 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

You could return a tuple of device and max columns to let users know what is the limit.

Copy link

CollaboratorAuthor

hunhoffeNov 25, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

In my mind, you can ask the device object returned byruntime.device() for things like columns and rows, e.g.,

mlir-aie/python/iron/device/device.py

Line 94 ina81f682

defcols(self)->int:

So maybe this doesn't need to be a TODO. Thoughts?

Copy link

Collaborator

ypapadop-amdNov 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Do I need to instantiate XRT to ask that?

Copy link

CollaboratorAuthor

hunhoffeNov 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

The device target model is defined in the aiecc compiler code, I believe. I do not think it is dependent on XRT, although I have never tried it -- if you have a different experience, let me know!

Copy link

Collaborator

ypapadop-amdNov 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'm not sure. Right now, I have hardcoded the number of columns based on the device type but I'd prefer if IRON has that knowledge.

Copy link

Collaborator

fifieldNov 26, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

The target model bindings only depend on the dialect bindings, not xrt.

>>>fromaie.dialects.aieimportget_target_model,AIEDevice>>>get_target_model(AIEDevice.npu1).columns()4>>>get_target_model(AIEDevice.npu1_1col).columns()1>>>

Copy link

CollaboratorAuthor

hunhoffe commentedNov 25, 2025

@fifield good question. I think it's tied to IRON unless we moved some of this logic to pyxrt. However, installing the mlir-aie wheels is maybe not too bad anymore?

hunhoffe added13 commits

November 25, 2025 13:51

remove buggy test for now

a8fa64e

manage the instructions cache within the host runtime class

4de5715

move read insts and clean up cache

a7b9ef3

fix bad import

c65b999

fix default kernel name

540d6dc

Merge branch 'main' into iron_runtime

095e899

remove unneeded todos

919d75d

fix bad function call

b4c9b71

debugging, small progress

71f6f88

Small fix, still debugging

ce592b2

still debugging

876e8c8

revert some changes for easier debugging

739f5e2

simplify jit example device handling

ca9c9d9

Copy link

CollaboratorAuthor

hunhoffe commentedNov 26, 2025

If I don't want to accidentally destroy the tracing utils with my consolidation, I think I need to do this PR first:#2743

Reviewers

fifieldfifield left review comments

ypapadop-amdypapadop-amd left review comments

stephenneuendorfferAwaiting requested review from stephenneuendorfferstephenneuendorffer will be requested when the pull request is marked ready for reviewstephenneuendorffer is a code owner

jgmelberAwaiting requested review from jgmelberjgmelber will be requested when the pull request is marked ready for reviewjgmelber is a code owner

jackl-xilinxAwaiting requested review from jackl-xilinxjackl-xilinx will be requested when the pull request is marked ready for reviewjackl-xilinx is a code owner

AndraBiscaAwaiting requested review from AndraBiscaAndraBisca will be requested when the pull request is marked ready for reviewAndraBisca is a code owner

andrejAwaiting requested review from andrejandrej will be requested when the pull request is marked ready for reviewandrej is a code owner

pvasireddy-amdAwaiting requested review from pvasireddy-amdpvasireddy-amd will be requested when the pull request is marked ready for reviewpvasireddy-amd is a code owner

denolfAwaiting requested review from denolfdenolf will be requested when the pull request is marked ready for reviewdenolf is a code owner

Labels

None yet

		if any(
		keyword in device_type_str
		for keyword in [
		"NPU Strix",
		"NPU Strix Halo",
		"NPU Krackan",
		"RyzenAI-npu4",
		"RyzenAI-npu6",
		]
		):
		self._device_type = NPU2
		elif any(
		keyword in device_type_str
		for keyword in [
		"NPU",
		"NPU Phoenix",
		"RyzenAI-npu1",
		]
		):
		self._device_type = NPU1

Movatterモバイル変換

[WIP] IRON host runtime abstraction#2737

Are you sure you want to change the base?

[WIP] IRON host runtime abstraction#2737

Uh oh!

Conversation

hunhoffe commentedNov 25, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

fifield commentedNov 25, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hunhoffeNov 25, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fifieldNov 26, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hunhoffe commentedNov 25, 2025

Uh oh!

hunhoffe commentedNov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hunhoffe commentedNov 25, 2025•
edited
Loading

hunhoffeNov 25, 2025•
edited
Loading

fifieldNov 26, 2025•
edited
Loading