Utilize GPUKernelArgs parameter for ew-collective, nd-conv, replace_slice. (#1346)
* Support GPUKernelArgs in Elementwise-collective and Nd-Convolution. * Update op::ReplaceSlice to use GPUKernelArgs and unroll coordinate transform loop. * Formatting. * Moved function signature for global kernels back to emitter body. * Formatting.
Showing
This diff is collapsed.
This diff is collapsed.
Please
register
or
sign in
to comment