1. 20 Aug, 2019 1 commit
  2. 09 Aug, 2019 1 commit
  3. 25 Jul, 2019 1 commit
  4. 19 Jul, 2019 1 commit
  5. 16 Apr, 2019 1 commit
  6. 18 Mar, 2019 1 commit
    • Robert Kimball's avatar
      Change floating point comparisons from == to all_close_f (#2620) · 56e160ba
      Robert Kimball authored
      * change float comparisons from == to all_close_f
      
      * style
      
      * address a few more direct float comparisons
      
      * add missing include
      
      * specify tightest tolerance for Broadcast and Reshape tests
      
      * Increased tightness of float testing
      
      Increased tightness of float testing via MIN_FLOAT_TOLERANCE_BITS parameter
      
      * style
      56e160ba
  7. 27 Feb, 2019 1 commit
    • Ayan Moitra's avatar
      Unit tests for relevant resnet50 integer ops (#2456) · 86394f10
      Ayan Moitra authored
      * Int unit tests that fail with bfloat
      
      * move tests out of single file
      
      * style
      
      * Incorporate Bob's comments
      
      * edits
      
      * Incorporate comments
      
      * style
      
      * edits
      
      * Add failing test to intel gpu manifest
      
      * comments incoprorated
      86394f10
  8. 25 Feb, 2019 1 commit
    • Sang Ik Lee's avatar
      Update mkl-dnn build script. (#2487) · 65ac0e68
      Sang Ik Lee authored
      Update TBB build script for Windows.
      
      Fix typo.
      
      Fix incorrect omp lib name on Windows.
      
      Fix incorrect tbb.dll path on Windows.
      
      Make LIBRARY and ARCHIVE output directory consistent.
      
      Function missing on Windows.
      
      Update test::util::all_close() to fix compilation issue on Windows
      
      Export CPU_Executable on Windows.
      
      Change nbench path for unit-test on Windows.
      
      Change copy to copy_if_different.
      
      Install CPU backend on Windows.
      
      Disable tools test on Windows.
      
      Disable two failing unit test on Windows CPU.
      
      Fix incorrect CPU backend install path on Windows.
      65ac0e68
  9. 22 Feb, 2019 1 commit
  10. 07 Jan, 2019 1 commit
    • gcwenger's avatar
      Simplified all_close_f interface and tightened default criteria (#2285) · 0eaa960c
      gcwenger authored
      * Simplified & tightened all_close_f parameters
      
      Removed specification of mantissa bits for all_close_f in favor
      of just specifying tolerance bits. Tightened up all_close_f default.
      Fixed LRN unit test which had insufficient result precision to pass
      tighter all_close_f tolerance.
      
      * Addressed PR comments.
      
      Reworked mantissa bit and tolerance constants.
      Clarified and improved graph comparison tolerance calculation flexibility.
      Clarified unit test tolerance testing.
      0eaa960c
  11. 03 Jan, 2019 1 commit
  12. 19 Dec, 2018 1 commit
  13. 07 Dec, 2018 1 commit
    • Robert Kimball's avatar
      Backend API change pre-work (#2064) · e0933553
      Robert Kimball authored
      * change compile call to return Handle
      
      * make CPU require compile() before call()
      
      * fix unit tests to call compile() before call()
      
      * fix failing ops
      
      * update unit test
      
      * revert some changes
      
      * more fixups
      
      * more diff cleanup
      
      * a few more issues addressed
      
      * more fixes
      
      * update API
      
      * more updates
      
      * fix test_ops.py
      
      * fix
      
      * another attempt to fix
      
      * fix unit test
      
      * fix test error
      e0933553
  14. 06 Dec, 2018 1 commit
    • Fenglei's avatar
      nvgpu cuda reduce with stable sum (#2076) · 606f3f93
      Fenglei authored
      * add some helper function
      
      * update with new helper function
      
      * update reduce to nd with new helper function
      
      * update float sum to stable sum
      
      * fix bug
      
      * update all reduce to stable sum for float
      
      * fix bug and pass the sum stable test
      
      * remove debug info
      
      * style
      
      * update with shape
      
      * fix bug
      
      * add host parameters to cuda_emitter
      
      * clang format
      
      * fix bugs
      
      * add element::type support
      
      * format
      
      * add a cached value with datatype name
      
      * add init_reduce_value
      
      * unroll loop
      
      * optimization
      
      * remove the need for init_value
      
      * add memset kernel
      
      * add memcpy
      
      * working version
      
      * remove debug info
      
      * add comments, clean up code.
      
      * change in_idx to input_idx
      
      * fix bug
      
      * change args name for memset in emitter
      
      * pass element::Type instead of string
      
      * the op::reduce come with init value, add support
      
      * resolve codacy-bot comment
      
      * fix bug
      
      * resove codacy-bot comment
      
      * remove unused comments, resolve comments
      
      * cuda reduce for max, min, mul, reduce op init value, format
      
      * use type::info
      
      * use type info for numeric_limits
      
      * remove code from gpu_host_parameters
      
      * header
      
      * remvoe outdated comments
      
      * add helper to check if stable sum is needed
      
      * add stable sum test for double
      
      * remove extra line
      
      * consolidate helper functions
      
      * no need list now.
      
      * remove extra ;
      
      * clang format
      
      * style
      
      * add skip test for cpu and intelGPU side
      
      * add line between groups of headers
      
      * add two simple stable sum test for float and double
      
      * skip test for intelGPU
      606f3f93
  15. 01 Dec, 2018 1 commit
    • Ayan Moitra's avatar
      Support arg_reduce for int32 for nvGPU (#2122) · 7188b5bd
      Ayan Moitra authored
      * Add argmReduce support for int32 +  tests
      
      * add new line
      
      * add to intelGPU manifest
      
      * Address Fenglei's comment
      
      * address comments
      
      * Small change to emitter logic.
      
      * Chris's comments incorporated
      
      * minor edits + clang
      
      * edit
      7188b5bd
  16. 21 Nov, 2018 1 commit
    • Ayan Moitra's avatar
      Use cuda_reduce for int32 & int8 input_type instead of cudnn_reduce (#2070) · d36c180f
      Ayan Moitra authored
      * add cuda reduce for product and max and added tests
      
      * a quick fix for empty reduce axis and 0 axis
      
      * adding min cuda reduce
      
      * add for min
      
      * fix bug and format
      
      * add another min test
      
      * adding sum to the mix and adding tests to intelGPU manifest
      
      * Incorporate Chris's first comment + clang
      
      * Some mods to the last commit
      
      * Addressed Bob's comments + added more tests int8
      
      * Added more int8 tests + added tests to IntelGPU manifest
      
      * CI test failure debug attempt
      
      * clang
      
      * edit
      
      * Adding the CPU failing test to manifest
      
      * pulled changes from master to address travis ci build failure
      d36c180f
  17. 16 Nov, 2018 1 commit
  18. 14 Nov, 2018 1 commit
  19. 13 Nov, 2018 1 commit
    • Ayan Moitra's avatar
      Add cudnn sum and test that exposes tf test failure for nvGPU (#2033) · cccdc304
      Ayan Moitra authored
      * Add cudnn sum and test that exposes failure
      
      * Add test to interpreter manifest and move test to sum file
      
      * change test name
      
      * Fails INTELGPU and no need to run on CPU
      
      * Comment incorporation + kahan summation for interpreter
      
      * small oversight correction
      
      * resolve conflict
      
      * minor variable name edit
      cccdc304
  20. 19 Oct, 2018 1 commit