1. 18 Nov, 2018 1 commit
    • Jayaram Bobba's avatar
      Support to selectively enable/disable passes through env variable (#2049) · 880594ba
      Jayaram Bobba authored
      * Support to selectively enable/disable passes through env variable
      
      * Address compiler warning about zero arg macros
      
      * Move pass selection options to PassConfig object
      
      * remove unnecessary header include
      
      * use default copy constructor
      
      * Address PR feedback
      
      * switch to map to keep xcode clang happy. doesn't seem to have a hashing function for string
      880594ba
  2. 16 Nov, 2018 3 commits
  3. 15 Nov, 2018 4 commits
  4. 14 Nov, 2018 5 commits
  5. 13 Nov, 2018 5 commits
  6. 12 Nov, 2018 5 commits
  7. 11 Nov, 2018 2 commits
    • Fenglei's avatar
      nvgpu softmax cuda version (#2014) · be9f031e
      Fenglei authored
      * add softmax cuda support
      
      * optimize block size
      
      * remove debug info
      
      * remove debug
      
      * style
      
      * remove unused
      
      * remove cudnn softmax
      
      * format
      
      * using nullptr
      
      * move helper, add test
      
      * fix style
      
      * using all_close_f
      
      * using kahansum
      
      * style
      
      * remove commentted out code
      be9f031e
    • Fenglei's avatar
      add isfinite check for all_close (#2028) · 702d465a
      Fenglei authored
      * add isfinite check
      
      * style
      
      * output 5 diff and total diff
      
      * output limit of diff for all_close_f
      
      * dix bug
      
      * disable tests
      
      * remove failing unit test that does not make sense.
      702d465a
  8. 10 Nov, 2018 4 commits
  9. 09 Nov, 2018 11 commits