Movatterモバイル変換

Home

Rate this Page

★★★★★

PerToken #

classtorch.ao.quantization.observer.PerToken[source]#

Represents per-token granularity in quantization.

This granularity type calculates a different set of quantization parametersfor each token, which is represented as the last dimension of the tensor.

For example, if the input tensor has shape [2, 3, 4], then there are 6 tokenswith 4 elements each, and we will calculate 6 sets of quantization parameters,one for each token.

If the input tensor has only two dimensions, e.g. [8, 16], then this isequivalent toPerAxis(axis=0), which yields 8 sets of quantization parameters.

On this page

Show Source

PyTorch Libraries

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources

To analyze traffic and optimize your experience, we serve cookies on this site. By clicking or navigating, you agree to allow our usage of cookies. As the current maintainers of this site, Facebook’s Cookies Policy applies. Learn more, including about available controls:Cookies Policy.

[8]ページ先頭

Movatterモバイル変換

PerToken#

Docs

Tutorials

Resources

PerToken #