1. 07 Dec, 2018 1 commit
    • Robert Kimball's avatar
      Backend API change pre-work (#2064) · e0933553
      Robert Kimball authored
      * change compile call to return Handle
      
      * make CPU require compile() before call()
      
      * fix unit tests to call compile() before call()
      
      * fix failing ops
      
      * update unit test
      
      * revert some changes
      
      * more fixups
      
      * more diff cleanup
      
      * a few more issues addressed
      
      * more fixes
      
      * update API
      
      * more updates
      
      * fix test_ops.py
      
      * fix
      
      * another attempt to fix
      
      * fix unit test
      
      * fix test error
      e0933553
  2. 06 Dec, 2018 1 commit
    • Fenglei's avatar
      nvgpu cuda reduce with stable sum (#2076) · 606f3f93
      Fenglei authored
      * add some helper function
      
      * update with new helper function
      
      * update reduce to nd with new helper function
      
      * update float sum to stable sum
      
      * fix bug
      
      * update all reduce to stable sum for float
      
      * fix bug and pass the sum stable test
      
      * remove debug info
      
      * style
      
      * update with shape
      
      * fix bug
      
      * add host parameters to cuda_emitter
      
      * clang format
      
      * fix bugs
      
      * add element::type support
      
      * format
      
      * add a cached value with datatype name
      
      * add init_reduce_value
      
      * unroll loop
      
      * optimization
      
      * remove the need for init_value
      
      * add memset kernel
      
      * add memcpy
      
      * working version
      
      * remove debug info
      
      * add comments, clean up code.
      
      * change in_idx to input_idx
      
      * fix bug
      
      * change args name for memset in emitter
      
      * pass element::Type instead of string
      
      * the op::reduce come with init value, add support
      
      * resolve codacy-bot comment
      
      * fix bug
      
      * resove codacy-bot comment
      
      * remove unused comments, resolve comments
      
      * cuda reduce for max, min, mul, reduce op init value, format
      
      * use type::info
      
      * use type info for numeric_limits
      
      * remove code from gpu_host_parameters
      
      * header
      
      * remvoe outdated comments
      
      * add helper to check if stable sum is needed
      
      * add stable sum test for double
      
      * remove extra line
      
      * consolidate helper functions
      
      * no need list now.
      
      * remove extra ;
      
      * clang format
      
      * style
      
      * add skip test for cpu and intelGPU side
      
      * add line between groups of headers
      
      * add two simple stable sum test for float and double
      
      * skip test for intelGPU
      606f3f93
  3. 01 Dec, 2018 1 commit
    • Ayan Moitra's avatar
      Support arg_reduce for int32 for nvGPU (#2122) · 7188b5bd
      Ayan Moitra authored
      * Add argmReduce support for int32 +  tests
      
      * add new line
      
      * add to intelGPU manifest
      
      * Address Fenglei's comment
      
      * address comments
      
      * Small change to emitter logic.
      
      * Chris's comments incorporated
      
      * minor edits + clang
      
      * edit
      7188b5bd
  4. 21 Nov, 2018 1 commit
    • Ayan Moitra's avatar
      Use cuda_reduce for int32 & int8 input_type instead of cudnn_reduce (#2070) · d36c180f
      Ayan Moitra authored
      * add cuda reduce for product and max and added tests
      
      * a quick fix for empty reduce axis and 0 axis
      
      * adding min cuda reduce
      
      * add for min
      
      * fix bug and format
      
      * add another min test
      
      * adding sum to the mix and adding tests to intelGPU manifest
      
      * Incorporate Chris's first comment + clang
      
      * Some mods to the last commit
      
      * Addressed Bob's comments + added more tests int8
      
      * Added more int8 tests + added tests to IntelGPU manifest
      
      * CI test failure debug attempt
      
      * clang
      
      * edit
      
      * Adding the CPU failing test to manifest
      
      * pulled changes from master to address travis ci build failure
      d36c180f
  5. 16 Nov, 2018 1 commit
  6. 14 Nov, 2018 1 commit
  7. 13 Nov, 2018 1 commit
    • Ayan Moitra's avatar
      Add cudnn sum and test that exposes tf test failure for nvGPU (#2033) · cccdc304
      Ayan Moitra authored
      * Add cudnn sum and test that exposes failure
      
      * Add test to interpreter manifest and move test to sum file
      
      * change test name
      
      * Fails INTELGPU and no need to run on CPU
      
      * Comment incorporation + kahan summation for interpreter
      
      * small oversight correction
      
      * resolve conflict
      
      * minor variable name edit
      cccdc304
  8. 19 Oct, 2018 1 commit