GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum (#440)
* GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum (A + B) * C test now with cuBLAS Additional gemm and gemv calls cmake updates for cuDNN calls memcpy wrappers in gpu_util Additional passing tests: aliased outputs, parameter, constant tensor memcopy
Showing
This diff is collapsed.
This diff is collapsed.
Please
register
or
sign in
to comment