Uh oh!
There was an error while loading.Please reload this page.
- Notifications
You must be signed in to change notification settings - Fork56.4k
DNN/CUDA: Solve the bug of same shape broadcast with CUDA#23560
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
Uh oh!
There was an error while loading.Please reload this page.
Conversation
WanliZhong commentedApr 28, 2023 • edited
Loading Uh oh!
There was an error while loading.Please reload this page.
edited
Uh oh!
There was an error while loading.Please reload this page.
@peters Hi, could you help to test if this pr will run yolov7-tiny correctly mentioned ine3e1f70#commitcomment-110475963? Thanks! |
DNN/CUDA: make 'abcd op 1b11' broadcast eltwise operator support cuda
peters commentedApr 28, 2023
@WanliZhong This PR fixes the yolov7-tiny inference problem. All tests are now green on our side! Thank you :) |
WanliZhong commentedApr 28, 2023
@peters That's good! Thanks! |
zihaomu left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others.Learn more.
LGTM! 👍
WanliZhong commentedMay 11, 2023
Test results after merged PR#23557 and this PR |
opencv-alalek commentedMay 12, 2023
Lets re-enable disabled tests. They are fine on CPU target and pass on CUDA target (but they are slow in the current implementation). On CUDA target they could be skipped (if performance is critical) in this way: or To manually run "skipped" tests by tags add this option: |
WanliZhong commentedMay 12, 2023
Thanks@opencv-alalek ! I will enable it. |
asmorkalov commentedMay 16, 2023
@opencv-alalek All tests are green. Should I merge with enabled or disabled brute-force tets? |
| int target = get<1>(backend_target); | ||
| if (backend == DNN_BACKEND_CUDA && dim >4) | ||
| applyTestTag(CV_TEST_TAG_LONG); |
opencv-alalekMay 16, 2023 • edited
Loading Uh oh!
There was an error while loading.Please reload this page.
edited
Uh oh!
There was an error while loading.Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others.Learn more.
BTW, Looks like no tests are skipped on CI.
CV_TEST_TAG_VERYLONG tests are skipped by default.CV_TEST_TAG_LONG tests should be skipped on embedded platforms or coverage/valgrind build configurations (though extra parameter on CI).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others.Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others.Learn more.
Ok, lets run all CUDA tests.


Uh oh!
There was an error while loading.Please reload this page.
Merged after#23557
Fix yolo regression error mentioned ine3e1f70#commitcomment-110475963
After fix this bug, the brute force test mentioned in#23556 will all passed with CUDA. (different shape broadcast will fallback to cpu)
Test is added in PR#23557
Pull Request Readiness Checklist
See details athttps://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request
Patch to opencv_extra has the same branch name.