Movatterモバイル変換

Home

Rate this Page

★★★★★

Quantization #

Created On: Oct 09, 2019 | Last Updated On: Aug 19, 2025

We are cetralizing all quantization related development totorchao, please checkout our new doc page:https://docs.pytorch.org/ao/stable/index.html

Plan for the existing quantization flows:1. Eager mode quantization (torch.ao.quantization.quantize,torch.ao.quantization.quantize_dynamic), please migrate to use torchao eager modequantize_ API instead

2. FX graph mode quantization (torch.ao.quantization.quantize_fx.prepare_fxtorch.ao.quantization.quantize_fx.convert_fx, please migrate to use torchao pt2e quantizationAPI instead (torchao.quantization.pt2e.quantize_pt2e.prepare_pt2e,torchao.quantization.pt2e.quantize_pt2e.convert_pt2e)

3. pt2e quantization has been migrated to torchao (pytorch/ao)seepytorch/ao#2259 for more details

We plan to deletetorch.ao.quantization in 2.10 if there are no blockers, or in the earliest PyTorch version until all the blockers are cleared.

Quantization API Reference (Kept since APIs are still public)#

TheQuantization API Reference contains documentationof quantization APIs, such as quantization passes, quantized tensor operations,and supported quantized modules and functions.

torch.ao.ns.fx.utils.compute_sqnr(x,y)[source]#

torch.ao.ns.fx.utils.compute_normalized_l2_error(x,y)[source]#

torch.ao.ns.fx.utils.compute_cosine_similarity(x,y)[source]#

On this page

Edit on GitHub

Show Source

PyTorch Libraries

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources

To analyze traffic and optimize your experience, we serve cookies on this site. By clicking or navigating, you agree to allow our usage of cookies. As the current maintainers of this site, Facebook’s Cookies Policy applies. Learn more, including about available controls:Cookies Policy.

[8]ページ先頭