Rate this Page
★★★★★
torch.cuda.jiterator._create_multi_output_jit_fn#
- torch.cuda.jiterator._create_multi_output_jit_fn(code_string,num_outputs,**kwargs)[source]#
Create a jiterator-generated cuda kernel for an elementwise op that supports returning one or more outputs.
- Parameters:
- Return type:
Example:
code_string="template <typename T> void my_kernel(T x, T y, T alpha, T& out) { out = -x + alpha * y; }"jitted_fn=create_jit_fn(code_string,alpha=1.0)a=torch.rand(3,device="cuda")b=torch.rand(3,device="cuda")# invoke jitted function like a regular python functionresult=jitted_fn(a,b,alpha=3.14)
Warning
This API is in beta and may change in future releases.
Warning
This API only supports up to 8 inputs and 8 outputs
On this page