Use cuda_reduce for int32 & int8 input_type instead of cudnn_reduce (#2070)
* add cuda reduce for product and max and added tests * a quick fix for empty reduce axis and 0 axis * adding min cuda reduce * add for min * fix bug and format * add another min test * adding sum to the mix and adding tests to intelGPU manifest * Incorporate Chris's first comment + clang * Some mods to the last commit * Addressed Bob's comments + added more tests int8 * Added more int8 tests + added tests to IntelGPU manifest * CI test failure debug attempt * clang * edit * Adding the CPU failing test to manifest * pulled changes from master to address travis ci build failure
Showing
This diff is collapsed.
Please
register
or
sign in
to comment