1. 07 Aug, 2018 1 commit
  2. 06 Aug, 2018 3 commits
  3. 05 Aug, 2018 4 commits
  4. 04 Aug, 2018 2 commits
  5. 03 Aug, 2018 15 commits
  6. 02 Aug, 2018 12 commits
  7. 01 Aug, 2018 3 commits
    • Louis Feng's avatar
      More efficient sum for some cases (#1251) · f8941a12
      Louis Feng authored
      * hacking to support dot of 3 by 2 inputs with gemm_batch.
      
      * clean up.
      
      * testing inplace reshape.
      
      * fixed a compile error.
      
      * added comments on todo.
      
      * check for output.
      
      * check for annotation.
      
      * more optimizations WIP.
      
      * sum simd.
      
      * moved parallel for
      
      * testing sum vectorization.
      
      * fixed merge errors.
      
      * sum wip.
      
      * more logic.
      
      * sum refactor and clean up.
      
      * clean up.
      
      * removed unrelated changes.
      
      * removed related changes from merge.
      
      * fixed clang compile errors.
      f8941a12
    • shssf's avatar
    • Fenglei's avatar
      move onehot and reverse op to cuda_emitter (#1266) · cb84305e
      Fenglei authored
      * move to cuda_emiiter
      
      * fix bug, clang format
      
      * size_t to uint32_t
      
      * reverse_axes
      
      * add rank back, clang format
      
      * remove unused code and file
      
      * remove unused code and file
      
      * manually merge with master
      cb84305e