Movatterモバイル変換

Home

Rate this Page

★★★★★

torch.cuda.comm.reduce_add_coalesced #

torch.cuda.comm.reduce_add_coalesced(inputs,destination=None,buffer_size=10485760)[source]#

Sum tensors from multiple GPUs.

Small tensors are first coalesced into a buffer to reduce the numberof synchronizations.

Parameters

inputs (Iterable[Iterable[Tensor]]) – iterable of iterables thatcontain tensors from a single device.
destination (int,optional) – a device on which the output will beplaced (default: current device).
buffer_size (int) – maximum size of the buffer used for coalescing

Returns

A tuple of tensors containing an elementwise sum of each group ofinputs, placed on thedestination device.

On this page

Show Source

PyTorch Libraries

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources

To analyze traffic and optimize your experience, we serve cookies on this site. By clicking or navigating, you agree to allow our usage of cookies. As the current maintainers of this site, Facebook’s Cookies Policy applies. Learn more, including about available controls:Cookies Policy.

[8]ページ先頭

Movatterモバイル変換

torch.cuda.comm.reduce_add_coalesced#

Docs

Tutorials

Resources

torch.cuda.comm.reduce_add_coalesced #