• Fenglei's avatar
    nvgpu cuda reduce with stable sum (#2076) · 606f3f93
    Fenglei authored
    * add some helper function
    
    * update with new helper function
    
    * update reduce to nd with new helper function
    
    * update float sum to stable sum
    
    * fix bug
    
    * update all reduce to stable sum for float
    
    * fix bug and pass the sum stable test
    
    * remove debug info
    
    * style
    
    * update with shape
    
    * fix bug
    
    * add host parameters to cuda_emitter
    
    * clang format
    
    * fix bugs
    
    * add element::type support
    
    * format
    
    * add a cached value with datatype name
    
    * add init_reduce_value
    
    * unroll loop
    
    * optimization
    
    * remove the need for init_value
    
    * add memset kernel
    
    * add memcpy
    
    * working version
    
    * remove debug info
    
    * add comments, clean up code.
    
    * change in_idx to input_idx
    
    * fix bug
    
    * change args name for memset in emitter
    
    * pass element::Type instead of string
    
    * the op::reduce come with init value, add support
    
    * resolve codacy-bot comment
    
    * fix bug
    
    * resove codacy-bot comment
    
    * remove unused comments, resolve comments
    
    * cuda reduce for max, min, mul, reduce op init value, format
    
    * use type::info
    
    * use type info for numeric_limits
    
    * remove code from gpu_host_parameters
    
    * header
    
    * remvoe outdated comments
    
    * add helper to check if stable sum is needed
    
    * add stable sum test for double
    
    * remove extra line
    
    * consolidate helper functions
    
    * no need list now.
    
    * remove extra ;
    
    * clang format
    
    * style
    
    * add skip test for cpu and intelGPU side
    
    * add line between groups of headers
    
    * add two simple stable sum test for float and double
    
    * skip test for intelGPU
    606f3f93
Name
Last commit
Last update
..
ngraph Loading commit data...
resource Loading commit data...
tools Loading commit data...
CMakeLists.txt Loading commit data...