Movatterモバイル変換

Home

Rate this Page

★★★★★

quantize_dynamic #

classtorch.ao.quantization.quantize_dynamic(model,qconfig_spec=None,dtype=torch.qint8,mapping=None,inplace=False)[source]#

Converts a float model to dynamic (i.e. weights-only) quantized model.

Replaces specified modules with dynamic weight-only quantized versions and output the quantized model.

For simplest usage providedtype argument that can be float16 or qint8. Weight-only quantizationby default is performed for layers with large weights size - i.e. Linear and RNN variants.

Fine grained control is possible withqconfig andmapping that act similarly toquantize().Ifqconfig is provided, thedtype argument is ignored.

Parameters

model – input model
qconfig_spec –
Either:
- A dictionary that maps from name or type of submodule to quantizationconfiguration, qconfig applies to all submodules of a givenmodule unless qconfig for the submodules are specified (when thesubmodule already has qconfig attribute). Entries in the dictionaryneed to be QConfig instances.
- A set of types and/or submodule names to apply dynamic quantization to,in which case thedtype argument is used to specify the bit-width
inplace – carry out model transformations in-place, the original module is mutated
mapping – maps type of a submodule to a type of corresponding dynamically quantized versionwith which the submodule needs to be replaced

On this page

Show Source

PyTorch Libraries

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources

To analyze traffic and optimize your experience, we serve cookies on this site. By clicking or navigating, you agree to allow our usage of cookies. As the current maintainers of this site, Facebook’s Cookies Policy applies. Learn more, including about available controls:Cookies Policy.

[8]ページ先頭

Movatterモバイル変換

quantize_dynamic#

Docs

Tutorials

Resources

quantize_dynamic #