1. 16 Nov, 2018 1 commit
  2. 15 Nov, 2018 4 commits
  3. 14 Nov, 2018 5 commits
  4. 13 Nov, 2018 5 commits
  5. 12 Nov, 2018 5 commits
  6. 11 Nov, 2018 2 commits
    • Fenglei's avatar
      nvgpu softmax cuda version (#2014) · be9f031e
      Fenglei authored
      * add softmax cuda support
      
      * optimize block size
      
      * remove debug info
      
      * remove debug
      
      * style
      
      * remove unused
      
      * remove cudnn softmax
      
      * format
      
      * using nullptr
      
      * move helper, add test
      
      * fix style
      
      * using all_close_f
      
      * using kahansum
      
      * style
      
      * remove commentted out code
      be9f031e
    • Fenglei's avatar
      add isfinite check for all_close (#2028) · 702d465a
      Fenglei authored
      * add isfinite check
      
      * style
      
      * output 5 diff and total diff
      
      * output limit of diff for all_close_f
      
      * dix bug
      
      * disable tests
      
      * remove failing unit test that does not make sense.
      702d465a
  7. 10 Nov, 2018 4 commits
  8. 09 Nov, 2018 11 commits
  9. 08 Nov, 2018 3 commits
    • Chris Sullivan's avatar
      Address potential bug in cudnnGetReductionWorkspaceSize (#1990) · dfc20454
      Chris Sullivan authored
      * When CUDNN_DATA_TYPE == CUDNN_DATA_DOUBLE, it appears that the cudnn calculated workspace size is incorrect.
      Adding a temporary fix here until the underlying issue is found.
      
      * Add softmax test illustrating bug in cudnn impl.
      
      * disable new unit test in intel GPU
      dfc20454
    • Fenglei's avatar
      nvgpu cuda reduce (#1988) · 32398641
      Fenglei authored
      * change reduce using cuda, add support for AND, OR
      
      * fix bug and format
      
      * remove unused code
      
      * style
      
      * change reduce_op to reduce_func to avoid shadow, thansk Ayan.
      
      * using dynamic_pointer_cast
      32398641
    • mchrusci's avatar
      [ONNX CI] ONNX CI fixes (#2024) · 77899668
      mchrusci authored
      77899668