Add support for quantized ONNX networks #20188

New issue

Closed

#20535

Closed

Add support for quantized ONNX networks#20188

#20535

Labels

category: dnnfeature

Milestone

5.0-alpha

Description

vpisarev

opened

on Jun 1, 2021

It's possible to quantize ONNX networks to reduce the storage requirements and also accelerate the inference:https://www.onnxruntime.ai/docs/how-to/quantization.html

However, OpenCV 4.x/pre-5.0 is unable to load such networks, because of the missing support for layersQLinearConv andQLinearMatMul that such quantized networks contain.

It would be nice to add support for such layers into OpenCV. By default, the weights can be converted to FP32 (or FP16 maybe), but the original INT8 weights should be preserved as well — we will be adding fixed-point paths to our implementations of convolution and fully-connected layers.

For testing, here is the original ONNX model:
https://drive.google.com/file/d/1JW6_zrgzjeSZQcKKEDhTvp3aseNu0pe9/view?usp=sharing
and its quantized variant:
https://drive.google.com/file/d/1RHkF8pGMfo0covNR0_GQhB11JvrzogFO/view?usp=sharing
(provided by @SamFC10)

Metadata

Assignees

No one assigned

Labels

category: dnnfeature

Type

No type

Projects

No projects

Milestone

5.0-alpha

Relationships

None yet

Development

No branches or pull requests

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add support for quantized ONNX networks #20188

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions