• Fenglei's avatar
    nvgpu reduction optimization (#1455) · 6679c233
    Fenglei authored
    * add cuda reduce
    
    * clang format
    
    * fix bugs
    
    * fix bug
    
    * add 1d reduce
    
    * clang format
    
    * fix bugs
    
    * unroll loop
    
    * remove debug info
    
    * revert tests
    
    * unroll 1D reduce op
    
    * add comments
    
    * using cudnn for nd to scalar reduction
    
    * remove cuda 1d reduction since cudnn version is faster
    
    * remove 1D kernel
    
    * fix variable name
    
    * resolve Chris's comments
    
    * non_reduce_in_strides to non_reduce_strides
    6679c233
Name
Last commit
Last update
..
autodiff Loading commit data...
builder Loading commit data...
codegen Loading commit data...
descriptor Loading commit data...
frontend Loading commit data...
op Loading commit data...
pass Loading commit data...
pattern Loading commit data...
runtime Loading commit data...
type Loading commit data...
CMakeLists.txt Loading commit data...
assertion.hpp Loading commit data...
axis_set.cpp Loading commit data...
axis_set.hpp Loading commit data...
axis_vector.cpp Loading commit data...
axis_vector.hpp Loading commit data...
coordinate.cpp Loading commit data...
coordinate.hpp Loading commit data...
coordinate_diff.cpp Loading commit data...
coordinate_diff.hpp Loading commit data...
coordinate_transform.cpp Loading commit data...
coordinate_transform.hpp Loading commit data...
cpio.cpp Loading commit data...
cpio.hpp Loading commit data...
except.hpp Loading commit data...
file_util.cpp Loading commit data...
file_util.hpp Loading commit data...
function.cpp Loading commit data...
function.hpp Loading commit data...
graph_util.cpp Loading commit data...
graph_util.hpp Loading commit data...
log.cpp Loading commit data...
log.hpp Loading commit data...
ngraph.hpp Loading commit data...
node.cpp Loading commit data...
node.hpp Loading commit data...
node_vector.hpp Loading commit data...
placement.cpp Loading commit data...
placement.hpp Loading commit data...
serializer.cpp Loading commit data...
serializer.hpp Loading commit data...
shape.cpp Loading commit data...
shape.hpp Loading commit data...
strides.cpp Loading commit data...
strides.hpp Loading commit data...
util.cpp Loading commit data...
util.hpp Loading commit data...
uuid.hpp Loading commit data...