1. 13 Nov, 2018 4 commits
    • Ayan Moitra's avatar
      Add cudnn sum and test that exposes tf test failure for nvGPU (#2033) · cccdc304
      Ayan Moitra authored
      * Add cudnn sum and test that exposes failure
      
      * Add test to interpreter manifest and move test to sum file
      
      * change test name
      
      * Fails INTELGPU and no need to run on CPU
      
      * Comment incorporation + kahan summation for interpreter
      
      * small oversight correction
      
      * resolve conflict
      
      * minor variable name edit
      cccdc304
    • Adam Rogowiec's avatar
      [ONNX] Fix MatMul op for vec @ tensor multiplication (#1969) · 76b8b4d4
      Adam Rogowiec authored
      * Add static keyword for helper function.
      
      * Fix MatMul for cases where left hand side is 1D vector.
      
      - Add unit-test for this case.
      
      * Add new line at the end of file.
      
      * Log warning when dealing with scalars
      
      * Apply clang-format
      
      * Review: fix spelling, rename test model.
      76b8b4d4
    • Amy Zhuang's avatar
      Ayzhuang/propagate cacheability (#1982) · 6e234d65
      Amy Zhuang authored
      * Add cacheablility propagation pass.
      
      * Use a functor to create op annotations.
      
      * Address PR feedback.
      
      * Address PR feedback.
      
      * Address PR feedback.
      6e234d65
    • mchrusci's avatar
      [ONNX CI] ONNX CI Improvements (#2026) · f0c17477
      mchrusci authored
      * Kill previous builds on PR update
      
      * Remove Jenkinsfile.groovy
      
      * Lower case method names
      
      * Fix method notify()
      
      * Added comment
      
      Previous build deletion workaround to be removed as soon as better, less security vulnerable solution is found.
      
      * Fix inconsistent stage names
      f0c17477
  2. 12 Nov, 2018 5 commits
  3. 11 Nov, 2018 2 commits
    • Fenglei's avatar
      nvgpu softmax cuda version (#2014) · be9f031e
      Fenglei authored
      * add softmax cuda support
      
      * optimize block size
      
      * remove debug info
      
      * remove debug
      
      * style
      
      * remove unused
      
      * remove cudnn softmax
      
      * format
      
      * using nullptr
      
      * move helper, add test
      
      * fix style
      
      * using all_close_f
      
      * using kahansum
      
      * style
      
      * remove commentted out code
      be9f031e
    • Fenglei's avatar
      add isfinite check for all_close (#2028) · 702d465a
      Fenglei authored
      * add isfinite check
      
      * style
      
      * output 5 diff and total diff
      
      * output limit of diff for all_close_f
      
      * dix bug
      
      * disable tests
      
      * remove failing unit test that does not make sense.
      702d465a
  4. 10 Nov, 2018 4 commits
  5. 09 Nov, 2018 11 commits
  6. 08 Nov, 2018 6 commits
  7. 07 Nov, 2018 8 commits
    • Jayaram Bobba's avatar
      IAT: Collapse dims for Dot ops (#1991) · e5d9b540
      Jayaram Bobba authored
      * Collapse dimensions for inputs to Dot
      
      * Remove eigen kernels for higher dimension dots since they will collapse to cblas_gemm kernels
      
      * Moved collapse dims pass after the fusion passes to prevent interference with fusion patterns. Use cblas_gemm for 2D dot
      e5d9b540
    • Chris Sullivan's avatar
      add dtype-generic load definitions and clean up nvrtc helpers (#1975) · f33317cc
      Chris Sullivan authored
      * Refactor include_helpers into an nvrtc specific helper file. Add templated define functions for coherent and noncoherent memory loads.
      
      * Format
      
      * const refs.
      
      * Remove cast of zero.
      f33317cc
    • Adam Procter's avatar
      af889535
    • Robert Kimball's avatar
      NOP backend (#1979) · 4918449c
      Robert Kimball authored
      * add nop backend
      
      * nop backend
      
      * fix flag name
      
      * add new switch to cmake output of switch settings
      
      * add new unit test to igpu manifest
      
      * remove redundant test
      4918449c
    • Robert Kimball's avatar
      address issues which surface with clang 6.0 (#1980) · 79802dcf
      Robert Kimball authored
      * address issues which surface with clang 6.0
      
      * revert changes due to new clang warning and disable new warning
      79802dcf
    • Adam Straw's avatar
      graph builders for quantize scale (#1976) · 8bd3846f
      Adam Straw authored
      * quantize scale passing unit tests
      
      * epsilon bump
      
      * finished with quantization scale
      
      * unit tests passing with convolution scale as builder
      
      * broadcasted constants and cleanup
      
      * api consistency for quant builders
      
      * code style
      
      * cleanup
      
      * newline at EOF
      
      * use requantization_scale
      
      * drop TF license as we are no longer using TF code directly
      8bd3846f
    • Amy Zhuang's avatar
      Do not fuse nodes if one node is predecessor of another node in horiz… (#1928) · 2a26558a
      Amy Zhuang authored
      * Do not fuse nodes if one node is predecessor of another node in horizontal fusion.
      
      * Add dead node check and remove predecessor check in horizontal fusion.
      2a26558a
    • Robert Kimball's avatar
      Add a real HybridBackend (#1998) · 45fba7b1
      Robert Kimball authored
      * wip
      
      * wip
      
      * wip
      
      * move hybrid wrapper to hybrid backend dir
      
      * move hybrid wrapper to correct namespace
      
      * wip
      
      * sorta working
      
      * remove debug from sorta working homogeneous hybrid backend
      
      * is_supported is supported for GPU
      
      * cleanup debug
      
      * more progress
      
      * remove debug
      
      * cleanup
      
      * turn off hybrid by default
      
      * revert change
      
      * revert
      
      * rename wrapper to backend
      
      * revert
      
      * address review comments
      
      * style
      45fba7b1