I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

AttentionOnnxAiLayer

813c35a

asmorkalov reviewed

Nov 10, 2025

View reviewed changes

modules/dnn/src/layers/attention_onnxai_layer.cpp OutdatedShow resolvedHide resolved

asmorkalov added optimization feature pr: needs test

New functionality requires minimal tests set

category: dnn (onnx)ONNX suport issues in DNN module labels

Nov 10, 2025

Copy link

Author

nklskyoy commentedNov 12, 2025•
edited
Loading

@vpisarev @asmorkalov , I noticed some issues with graph simplification (in particular attention subgraph - see the failing test cases).

Right now we have

attention op fromcom.microsoft
attention op fromai.onnx (thats what I implemented)

So currently, the graph simplifier takes subgraph consisting ofai.onnx nodes and simplifies this subgraph into a singlecom.microsoft attention operation. But at runtime, thedomain_dispatch_map includes only parsers forai.onnx layers. (So thecom.microsoft attention is wrongly interpreted as ai.onnx attention)

Is there some reason why the dispatch map does not include parsers for bothai.onnx andcom.microsoft by default? This would fix the problem here.

Copy link

Contributor

asmorkalov commentedNov 12, 2025

Is there some reason why the dispatch map does not include parsers for both ai.onnx and com.microsoft by default? This would fix the problem here.

I do not know about any intent here. Most probable reason - we did not have extensions before. You are welcome to fix the issue, but I propose to do it with another PR.

nklskyoy mentioned this pull request

Nov 17, 2025

Onnx importer2 dispatch map#28032

Merged

6 tasks

asmorkalov pushed a commit that referenced this pull request

Nov 18, 2025

Merge pull request#28032from nklskyoy:onnx-importer2-dispatch-map

15dc935

Onnx importer2 dispatch map#28032in the new onnx_importer all domains in the dispatch map should be included per default. See#27988 (comment)### Pull Request Readiness ChecklistSee details athttps://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request- [x] I agree to contribute to the project under Apache 2 License.- [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV- [ ] The PR is proposed to the proper branch- [ ] There is a reference to the original bug report and related work- [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable      Patch to opencv_extra has the same branch name.- [ ] The feature is well documented and sample code can be built with the project CMake

nklskyoy added19 commits

November 20, 2025 01:23

Merge branch '5.x' into attention-2-layer

771b82e

Merge branch '5.x' into attention-2-layer

c405f0d

onnx graph simpifier: mark attention subgraph as com.microsoft

dd51550

causalize boolean mask of flexible depth

476e27b

more shape checks

7458081

clean up

d1ee667

test_attention_3d_diff_heads_sizes_causal

e781c24

ignore attention tests for classic engine

2d6edc4

ignore attention tests for openvino

3f4b2c9

fix masking logic, add 2 missing tests

5a695ed

opt.init() - mowe to forward method

803eab2

Merge branch '5.x' into attention-2-layer

532071d

attetnion to cuda denylist

0de3ce9

typo

aaea46b

fix division by zero in shape checks

dc712a5

parentheses

30ef36a

whitespace

bfa2378

parallel for

3ee28e2

whitespace

a5601fe

nklskyoy marked this pull request as ready for review

November 30, 2025 14:28

Copy link

Author

nklskyoy commentedNov 30, 2025

@vpisarev @asmorkalov , this is ready for review

asmorkalov reviewed

Dec 1, 2025

View reviewed changes

modules/dnn/src/layers/attention_onnxai_layer.cpp OutdatedShow resolvedHide resolved

asmorkalov reviewed

Dec 3, 2025

View reviewed changes

modules/dnn/src/layers/attention_onnxai_layer.cpp OutdatedShow resolvedHide resolved

asmorkalov removed the pr: needs testNew functionality requires minimal tests set label

Dec 5, 2025

nklskyoy added5 commits

December 6, 2025 14:56

mask cpu kernel

50d4723

intrinsics vx_setall

e430b9e

vx_setall_f32

0c8832e

vx_setall_f32

36ef24e

fix include

c0d61dc

Copy link

Shlok-Saxena commentedDec 14, 2025

Hi,

I was really interested in this Attention layer implementation, so I pulled the branch to test it locally on Linux. I noticed the CI was failing and encountered a few blockers during the build and test process.

I managed to fix all of them and get the tests passing 100% locally. Here is a summary of the findings that might help unblock this PR:

Test Name Mismatch: TheAttention layer implementation seems to output the layer nameAttentionOnnxAi, buttest_graph_simplifier.cpp is still expectingAttention.

Fix: Updated the expected string in TEST_F(Test_Graph_Simplifier,AttentionSubgraph).

Missing Test Data: The tests requireattention.onnx,attention_single_head.onnx, etc., which seem to be missing from the linkedopencv_extra PR (or at least weren't pulled in).

Workaround: I generated synthetic 4D ONNX models locally to verify the logic.

Test Harness Input Shape Mismatch (The Crash): Intest_onnx_importer.cpp, the tests were using the generictestONNXModels("attention") helper. This helper defaults to feeding 1 input. However, the strict validation inAttentionOnnxAi::getMemoryShapes correctly demands 3 inputs (Query, Key, Value), causing an assertion failure: (-215:Assertion failed)nsuggestedShapes == ninputs

Fix: I replaced the generic call with a manual test block that initializes a 4D input(1, 2, 2, 2) and feeds it explicitly to all three ports (query,key,value).

With these changes, allAttention tests pass successfully on Linux.

I have a patch file ready if you'd like me to push it or share the snippets! Great work on the layer logic itself—it works perfectly once the harness is aligned.

Copy link

Author

nklskyoy commentedDec 14, 2025•
edited
Loading

Hello@Shlok-Saxena , what do you mean by failing CI? Currently, all tests related to attention and dnn are passing.
Note that we have two layers: one attention layer from com.microsoft domain (labeled as attention), while the attentionOnnxAiLayer (labeled as attentionOnnxAiLayer) is from onnx.ai domain
The graph_simplifier produces the com.microsoft attention layer, so Test_Graph_Simplifier correctly expects the attention label.
Please note that I am currently still working on this PR.

Copy link

Shlok-Saxena commentedDec 14, 2025

Thanks for the detailed explanation,@nklskyoy !

That really clears up the architectural distinction between thecom.microsoft andonnx.ai implementations—I hadn't fully realized that context.

I mainly wanted to share these logs in case there is a platform-specific quirk on Linux (I'm building onUbuntu 22.04 / GCC 13) that might affect the CI later.

Regarding the Simplifier Test: On my local build, theTest_Graph_Simplifier actually failed because it did output theonnx.ai layer name instead of the expected one. It seems my environment is resolving the graph differently than yours:

[ RUN      ] Test_Graph_Simplifier.AttentionSubgraph/modules/dnn/test/test_graph_simplifier.cpp:38: FailureExpected equality of these values:  layers    Which is: { "AttentionOnnxAi" }  expected_layers    Which is: { "Attention" }

Regarding the Crash: You are totally right—the assertion failure(-215:Assertion failed) was indeed because I lacked the official test data (the.onnx files). I generated some dummy synthetic data to verify the logic, but I had to manually update the test harness to explicitly feed 3 inputs (Q, K, V) to get it to run without asserting.

Just wanted to document these behaviors from a fresh Linux build perspective in case it helps!

nklskyoy added3 commits

December 17, 2025 10:46

fastGemmBatch, fused_softmax_softcap_mask

aa49c68

Merge branch 'attention-2-layer' ofhttps://github.com/nklskyoy/opencv…

00cf283

…into attention-2-layer

parallel_for_

e57977b

Labels

category: dnn (onnx)

ONNX suport issues in DNN module

feature optimization

Movatterモバイル変換

Uh oh!

AttentionOnnxAiLayer#27988

Are you sure you want to change the base?

AttentionOnnxAiLayer#27988

Uh oh!

Conversation

nklskyoy commentedNov 9, 2025

Pull Request Readiness Checklist

Uh oh!

Uh oh!

nklskyoy commentedNov 12, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

asmorkalov commentedNov 12, 2025

Uh oh!

nklskyoy commentedNov 30, 2025

Uh oh!

Uh oh!

Uh oh!

Shlok-Saxena commentedDec 14, 2025

Uh oh!

nklskyoy commentedDec 14, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

Shlok-Saxena commentedDec 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nklskyoy commentedNov 12, 2025•
edited
Loading

nklskyoy commentedDec 14, 2025•
edited
Loading