NotificationsYou must be signed in to change notification settings
Fork56.4k
Star85.3k

8-bit quantization in dnn module and int8 layers#20228

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

opencv-pushbot merged 1 commit intoopencv:masterfromjebastin-nadar:int8

Aug 19, 2021

Merged

8-bit quantization in dnn module and int8 layers#20228

opencv-pushbot merged 1 commit intoopencv:masterfromjebastin-nadar:int8

Aug 19, 2021

Conversation

Copy link

Contributor

jebastin-nadar commentedJun 7, 2021•
edited
Loading

PR for GSoC'21 project on quantization in DNN module. This PR adds functions to quantize FP32 models and int8 versions of some layers along with tests for the new layers.

Layer	Status	Remarks
Convolution	✔️	Variable weights unsupported
Inner Product	✔️	Variable weights unsupported
Pooling	✔️	Only Max and Average pooling
Padding	✔️
Flatten	✔️
Activations	✔️
Concat	✔️
Eltwise	✔️	Eltwise division unsupported
BatchNorm, Scale, Shift	✔️
Data Permutation layers	✔️

A second PR is planned later this summer to load 8-bit quantized models from other frameworks (ONNX/Tensorflow) and perform inference using int8 layers and weights without converting them to FP32 (as done currently).

mentor :@vpisarev
relates :#16633 #20188

Pull Request Readiness Checklist

See details athttps://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV
The PR is proposed to proper branch
There is reference to original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

asmorkalov added category: dnn GSoC labels

Jun 7, 2021

jebastin-nadar force-pushed theint8 branch from425a3fe tod24aa8cCompare

June 7, 2021 12:50

Copy link

ContributorAuthor

jebastin-nadar commentedJun 9, 2021

@vpisarev MaxPool int8 tests pass now. Tests with maxpooling as the last layer are commented as they compute max indices which is not supported in int8 version.

Although I still don't get why maxpooling as the last layer should compute max index as well by default.

opencv/modules/dnn/src/layers/pooling_layer.cpp

Line 1314 in4c2dff8

int numOutputs = requiredOutputs ? requiredOutputs : (type == MAX ?2 :1);

jebastin-nadar commented

Jun 9, 2021

View reviewed changes

modules/dnn/src/layers/batch_norm_layer.cpp OutdatedShow resolvedHide resolved

Copy link

Contributor

vpisarev commentedJun 10, 2021•
edited
Loading

@vpisarev MaxPool int8 tests pass now. Tests with maxpooling as the last layer are commented as they compute max indices which is not supported in int8 version.
Although I still don't get why maxpooling as the last layer should compute max index as well by default.
opencv/modules/dnn/src/layers/pooling_layer.cpp
Line 1314 in4c2dff8
int numOutputs = requiredOutputs ? requiredOutputs : (type == MAX ?2 :1);

I actually suggest to support computing indices (in FP32, as before) together with computing the max value in INT8. Yes, it will be slower, but it will let us to provide 100% compatibility.

Copy link

ContributorAuthor

jebastin-nadar commentedJun 11, 2021

support computing indices (in FP32, as before) together with computing the max value in INT8

This is not possible right now as a single variable determines the datatypes of all the outputs for a layer. So both outputs[0] and outputs[1] can either be CV_32F or CV_8S. One as CV_32F and another as CV_8S is not possible currently.

opencv/modules/dnn/src/dnn.cpp

Line 582 inc2c67c2

int dtype;// Datatype of output blobs.

opencv/modules/dnn/src/dnn.cpp

Line 980 inc2c67c2

dst.create(shape, dtype);

Ofcourse changing "dtype" to std::vector would solve it, but that will introduce a lot of complexity in allocating blobs and a lot of work for a feature which is rarely used. Maybe we can keep it as low priority and look at it later.

Copy link

ContributorAuthor

jebastin-nadar commentedJun 11, 2021

Some build issues :

AVX-512 path in int8layers/layers_common.simd.hpp causes build warnings and convolution tests failure (segmentation fault). The same tests are passed locally and in some other builders so I suspect there is an issue with that specific path. Also, I cannot reproduce this locally as my CPU only supports up to AVX2.
DNN Tests failure in OpenCL builders. From what I remember, I haven't modified any OpenCL related code, so don't know whats causing the failures.

vpisarev mentioned this pull request

Jun 12, 2021

partial support for quantized models in ONNX importer#20264

Closed

6 tasks

Copy link

Contributor

vpisarev commentedJun 12, 2021•
edited
Loading

Ofcourse changing "dtype" to std::vector would solve it, but that will introduce a lot of complexity in allocating blobs and a lot of work for a feature which is rarely used. Maybe we can keep it as low priority and look at it later.

ok, sounds good to me. Let's keep it as a low-priority item

Copy link

Contributor

vpisarev commentedJun 12, 2021

Some build issues :
AVX-512 path in int8layers/layers_common.simd.hpp causes build warnings and convolution tests failure (segmentation fault). The same tests are passed locally and in some other builders so I suspect there is an issue with that specific path. Also, I cannot reproduce this locally as my CPU only supports up to AVX2.

I suggest to comment off AVX-512 branches for now (in your newly added code, not everywhere)

DNN Tests failure in OpenCL builders. From what I remember, I haven't modified any OpenCL related code, so don't know whats causing the failures.

well, you need to figure that out. If you stuck at that, we can look at it together

jebastin-nadar force-pushed theint8 branch 2 times, most recently fromf11678f tob36cf00Compare

June 16, 2021 06:16

jebastin-nadar mentioned this pull request

Jun 20, 2021

dnn : Fix NaNs during BatchNorm reinitialization#20283

Merged

6 tasks

Copy link

ContributorAuthor

jebastin-nadar commentedJun 22, 2021

@alalek @vpisarev How do I ensure only dnn module tests are run in CI Linux OpenCL builder. I edited my original comment but it looks like tests for all modules is being checked.

jebastin-nadar force-pushed theint8 branch 9 times, most recently from28d7c78 tofc8350dCompare

June 29, 2021 03:34

jebastin-nadar force-pushed theint8 branch 2 times, most recently from1afb5b9 to7b5a392Compare

July 7, 2021 13:43

Copy link

ContributorAuthor

jebastin-nadar commentedJul 7, 2021

@vpisarev As discussed, int8 layers which had mostly duplicated code (concat, flatten, padding) have been removed and the original fp32 layers are modified to support 8-bit inputs as well.

In some parallel_for(), I have used templates to support multiple datatypes, please check the latest commit to see if any changes have to be made.

jebastin-nadar changed the title~~WIP : 8-bit quantization in dnn module and int8 layers~~8-bit quantization in dnn module and int8 layers

Jul 12, 2021

jebastin-nadar marked this pull request as ready for review

July 12, 2021 07:10

jebastin-nadar force-pushed theint8 branch from04fb844 to03c9b01Compare

July 17, 2021 12:54

jebastin-nadar force-pushed theint8 branch 4 times, most recently from1ff055e toc8a294bCompare

July 26, 2021 03:55

vpisarev self-assigned this

Jul 27, 2021

jebastin-nadar force-pushed theint8 branch 2 times, most recently from80998b3 to6f0162cCompare

July 29, 2021 07:29

asmorkalov added the feature label

Aug 6, 2021

This was referencedAug 9, 2021

DNN : fix bug in extracting prior-box variances in detection output layer#20525

Merged

dnn : int8 quantized layers support in onnx importer#20535

Merged

Copy link

Contributor

vpisarev commentedAug 18, 2021

@SamFC10, could you please fix the merge conflicts once again? And then squash commits? We will try to merge your pull request quickly.

jebastin-nadar force-pushed theint8 branch from79eca09 toa1c63a2Compare

August 18, 2021 16:08

Copy link

ContributorAuthor

jebastin-nadar commentedAug 18, 2021

And then squash commits

Looks like I messed up something.
Command used :

git reset --soft HEAD~38git commit -m ""git push -f origin int8

Copy link

Member

alalek commentedAug 18, 2021

You can rollback changes:

git checkout -B int8 79eca09675fecd2b58c237233a0aaaa7197ace6f

int8 layers and 8-bit quantization support

fa90e14

jebastin-nadar force-pushed theint8 branch froma1c63a2 tofa90e14Compare

August 19, 2021 04:27

Copy link

ContributorAuthor

jebastin-nadar commentedAug 19, 2021

Managed to restore my commits and squashed them. Thanks for the help@alalek @vpisarev

vpisarev self-requested a review

August 19, 2021 08:08

vpisarev approved these changes

Aug 19, 2021

View reviewed changes

Copy link

Contributor

vpisarev commentedAug 19, 2021

👍

opencv-pushbot merged commitf787c49 intoopencv:master

Aug 19, 2021

alalek mentioned this pull request

Aug 23, 2021

[GSoC] OpenCV.js: Accelerate OpenCV.js DNN via WebNN#20406

Merged

alalek mentioned this pull request

Oct 3, 2021

DNN: Invalid memory access in Int8 code#20799

Closed

jebastin-nadar mentioned this pull request

Oct 3, 2021

dnn : fix illegal memory access in int8 convolution#20800

Merged

6 tasks

This was referencedOct 14, 2021

DNN: Int8 tests failures (2021-10-14)#20873

Closed

(5.x) Merge 4.x#20886

Merged

DNN(Int8): error messages from Quantized tests#20909

Open

alalek mentioned this pull request

Dec 2, 2021

dnn(test): drop non OCV/CPU cases for Int8#21172

Merged

fengyuentau mentioned this pull request

Feb 8, 2024

dnn cleanup: On-fly-quantization removal#24980

Merged

6 tasks

asmorkalov pushed a commit that referenced this pull request

Feb 16, 2024

Merge pull request#24980from fengyuentau:on-fly-quantization-removal

d4fd515

dnn cleanup: On-fly-quantization removal#2498On-fly-quantization is first introduced via#20228.We decided to remove it but keep int8 layers implementation because on-fly-quantizationis less practical given the fact that there has been so many dedicated tools for modelquantization.### Pull Request Readiness ChecklistSee details athttps://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request- [x] I agree to contribute to the project under Apache 2 License.- [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV- [x] The PR is proposed to the proper branch- [x] There is a reference to the original bug report and related work- [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable      Patch to opencv_extra has the same branch name.- [x] The feature is well documented and sample code can be built with the project CMake

Labels

category: dnn feature GSoC

Movatterモバイル変換

Uh oh!

8-bit quantization in dnn module and int8 layers#20228

8-bit quantization in dnn module and int8 layers#20228

Uh oh!

Conversation

jebastin-nadar commentedJun 7, 2021• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Pull Request Readiness Checklist

Uh oh!

jebastin-nadar commentedJun 9, 2021

Uh oh!

Uh oh!

vpisarev commentedJun 10, 2021• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

jebastin-nadar commentedJun 11, 2021

Uh oh!

jebastin-nadar commentedJun 11, 2021

Uh oh!

vpisarev commentedJun 12, 2021• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

vpisarev commentedJun 12, 2021

Uh oh!

jebastin-nadar commentedJun 22, 2021

Uh oh!

jebastin-nadar commentedJul 7, 2021

Uh oh!

vpisarev commentedAug 18, 2021

Uh oh!

jebastin-nadar commentedAug 18, 2021

Uh oh!

alalek commentedAug 18, 2021

Uh oh!

jebastin-nadar commentedAug 19, 2021

Uh oh!

vpisarev commentedAug 19, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jebastin-nadar commentedJun 7, 2021•
edited
Loading

vpisarev commentedJun 10, 2021•
edited
Loading

vpisarev commentedJun 12, 2021•
edited
Loading