1. 15 Nov, 2018 1 commit
  2. 14 Nov, 2018 5 commits
  3. 13 Nov, 2018 5 commits
  4. 12 Nov, 2018 5 commits
  5. 11 Nov, 2018 2 commits
    • Fenglei's avatar
      nvgpu softmax cuda version (#2014) · be9f031e
      Fenglei authored
      * add softmax cuda support
      
      * optimize block size
      
      * remove debug info
      
      * remove debug
      
      * style
      
      * remove unused
      
      * remove cudnn softmax
      
      * format
      
      * using nullptr
      
      * move helper, add test
      
      * fix style
      
      * using all_close_f
      
      * using kahansum
      
      * style
      
      * remove commentted out code
      be9f031e
    • Fenglei's avatar
      add isfinite check for all_close (#2028) · 702d465a
      Fenglei authored
      * add isfinite check
      
      * style
      
      * output 5 diff and total diff
      
      * output limit of diff for all_close_f
      
      * dix bug
      
      * disable tests
      
      * remove failing unit test that does not make sense.
      702d465a
  6. 10 Nov, 2018 4 commits
  7. 09 Nov, 2018 11 commits
  8. 08 Nov, 2018 6 commits
  9. 07 Nov, 2018 1 commit
    • Jayaram Bobba's avatar
      IAT: Collapse dims for Dot ops (#1991) · e5d9b540
      Jayaram Bobba authored
      * Collapse dimensions for inputs to Dot
      
      * Remove eigen kernels for higher dimension dots since they will collapse to cblas_gemm kernels
      
      * Moved collapse dims pass after the fusion passes to prevent interference with fusion patterns. Use cblas_gemm for 2D dot
      e5d9b540