1. 08 Mar, 2018 14 commits
    • fenglei.tian's avatar
      add sign op, fix constant bug · dd5a6769
      fenglei.tian authored
      dd5a6769
    • fenglei.tian's avatar
      clang format · b5414ba5
      fenglei.tian authored
      :
      b5414ba5
    • Fenglei Tian's avatar
      Merge branch 'tfl/gpu_emitter_template' of… · 6204a154
      Fenglei Tian authored
      Merge branch 'tfl/gpu_emitter_template' of github.com:NervanaSystems/private-ngraph-cpp into tfl/gpu_emitter_template
      6204a154
    • Fenglei Tian's avatar
      namespace · 2e94fe52
      Fenglei Tian authored
      2e94fe52
    • fenglei.tian's avatar
      clang format · d7039a34
      fenglei.tian authored
      d7039a34
    • Fenglei Tian's avatar
      resolve conflict · 5b5bb51b
      Fenglei Tian authored
      5b5bb51b
    • Fenglei Tian's avatar
      merge master and resolve conflict · 0a77e3d9
      Fenglei Tian authored
      0a77e3d9
    • Chris Sullivan's avatar
      Abstraction for GPU unary elementwise ops (#587) · 529362b5
      Chris Sullivan authored
      * straightforward gpu.cos implementation following previous patterns prior to refactor
      
      * Generalized unary elementwise gpu op impl.. New unary elementwise ops can
      be added to the type annotations in gpu_cuda_kernel_ops.hpp. Next step
      is to refactor the llvm interface in gpu_emitters.hpp for similar generality.
      
      * Added gpu_emitter.hpp:EmitUnaryElementwise.
      
      Function adds cuda kernel based on ngraph::op::op_type::description.
      This can service all unary elementwise ops run on the gpu.
      
      * The following elementwise unary ops now use the EmitUnaryElementwise emitter:
      * GPU.abs
      * GPU.acos
      * GPU.asin
      * GPU.atan
      * GPU.ceiling
      * GPU.cos
      * GPU.cosh
      * GPU.exp
      * GPU.floor
      * GPU.log
      * GPU.not
      * GPU.sign
      * GPU.sin
      * GPU.sinh
      * GPU.tan
      * GPU.tanh
      Unary elementwise ops Sign and Not need extra consideration.
      
      * tanh test changed to test::all_close for fp comparison (also done for tan in commit 65fa7c6de34c8277fe2a4801644f6bb64574f4ff).
      
      * GPU backend skips added for recent softmax test and updated aliased output test that uses op::Constant.
      
      * code format update
      
      * changed cuda builder interface names to unary/binary/arbitrary, added impl. note to gpu_cuda_kernel_ops, cleaned code format
      
      * updated ngraph-cpp reference
      
      * Fixing incorrect github conflict resolution.
      
      * Added GPU emitter for op::Result.
      For now it simply copies the output tensor.
      
      All but 3 tests now pass. The remaining
      failing tests are:
      * GPU.dot_0_0
      * GPU.dot_matrix_2x0_0x2
      * GPU.dot_2x0_0
      
      * Removed call to handle memory aliasing in gpu_external_function.
      
      * fix gpu emitter bug that will return in the middle of function
      
      * Merge pull request #609 from NervanaSystems/tfl/fix_return_bug
      
      fix gpu emitter bug that will return in the middle of function
      
      * GPU backend skips added for recent softmax test and updated aliased output test that uses op::Constant.
      529362b5
    • fenglei.tian's avatar
      remove unused variable · 998d7c6b
      fenglei.tian authored
      998d7c6b
    • fenglei.tian's avatar
      fix merge bug and apply clang format · 2e295d27
      fenglei.tian authored
      2e295d27
    • Fenglei Tian's avatar
      resolve coflict when merge master · 809dda4f
      Fenglei Tian authored
      809dda4f
    • Fenglei's avatar
      Merge pull request #599 from NervanaSystems/tfl/gpu_fix_constant_bug · a02aab01
      Fenglei authored
      Fix constant bug on GPU
      a02aab01
    • Robert Kimball's avatar
      61fa9d55
    • Chris Sullivan's avatar
      GPU op::Result implementation (#611) · 905cafd2
      Chris Sullivan authored
      * Added GPU emitter for op::Result.
      For now it simply copies the output tensor.
      
      All but 3 tests now pass. The remaining
      failing tests are:
      * GPU.dot_0_0
      * GPU.dot_matrix_2x0_0x2
      * GPU.dot_2x0_0
      
      * Removed call to handle memory aliasing in gpu_external_function.
      
      * fix gpu emitter bug that will return in the middle of function
      
      * Merge pull request #609 from NervanaSystems/tfl/fix_return_bug
      
      fix gpu emitter bug that will return in the middle of function
      
      * GPU backend skips added for recent softmax test and updated aliased output test that uses op::Constant.
      905cafd2
  2. 07 Mar, 2018 10 commits
  3. 06 Mar, 2018 13 commits
  4. 05 Mar, 2018 3 commits