torch.cuda.comm.gather#
- torch.cuda.comm.gather(tensors,dim=0,destination=None,*,out=None)[source]#
Gathers tensors from multiple GPU devices.
- Parameters
tensors (Iterable[Tensor]) – an iterable of tensors to gather.Tensor sizes in all dimensions other than
dimhave to match.dim (int,optional) – a dimension along which the tensors will beconcatenated. Default:
0.destination (torch.device,str, orint,optional) – the output device.Can be CPU or CUDA. Default: the current CUDA device.
out (Tensor,optional,keyword-only) – the tensor to store gather result.Its sizes must match those of
tensors, except fordim,where the size must equalsum(tensor.size(dim)fortensorintensors).Can be on CPU or CUDA.
Note
destinationmust not be specified whenoutis specified.- Returns
- If
destinationis specified, a tensor located on
destinationdevice, that is a result ofconcatenatingtensorsalongdim.
- If
- If
outis specified, the
outtensor, now containing results of concatenatingtensorsalongdim.
- If