1. 13 Jul, 2018 4 commits
    • shssf's avatar
      Backend/API: Implementation of ADD and MUL operations in the compile() (#1200) · 2c345798
      shssf authored
      * Backend/API: Implementation of ADD and MUL operations in the compile method for IntelGPU
      
      * Branch merge conflicts resolved
      
      * Parameters number check moved to function. RESULT operation handling added.
      2c345798
    • Louis Feng's avatar
      268853d0
    • Chris Sullivan's avatar
      Fix incorrect hash strings for softmax and 1d maxpool. (#1195) · 4659d60d
      Chris Sullivan authored
      * Bug fix in softmax cache parameters.
      
      * Additional bug fix for maxpool1d cache parameters.
      
      * Formatting.
      
      * Use nthreads in primitive hash.
      4659d60d
    • Fenglei's avatar
      gpu reshape optimization (#1174) · b5e69eaa
      Fenglei authored
      * add gpu_timer to external function
      
      * compiled version
      
      * working version
      
      * using block_begin and block_end
      
      * add the missing '
      ;'
      
      * move slice to cuda emiter
      
      * change size_t to uint32_t in kernel
      
      * working version
      
      * change block size from 1 to 64
      
      * fix bugs
      
      * nthreads need to be size_t in broadcast op
      
      * add rank to kernel name hash
      
      * change reshape to cuda_emitter
      
      * fix bugs
      
      * bug, remove rank from kernel
      
      * clang format
      
      * update slice in convolution
      
      * resolve index conflict
      
      * change align to align_to_blocksize, add overflow check
      
      * add gird size check and fix pool merge bug
      
      * code style, change names
      
      * fix merge conflict
      
      * change kernel_runner to kernel_launch
      b5e69eaa
  2. 12 Jul, 2018 4 commits
  3. 11 Jul, 2018 2 commits
  4. 10 Jul, 2018 1 commit
  5. 09 Jul, 2018 4 commits
  6. 08 Jul, 2018 2 commits
  7. 07 Jul, 2018 4 commits
  8. 06 Jul, 2018 4 commits
  9. 05 Jul, 2018 4 commits
  10. 04 Jul, 2018 1 commit
  11. 03 Jul, 2018 6 commits
  12. 02 Jul, 2018 4 commits
    • Sandeep's avatar
      move sigmoid to core fusion (#1132) · d05b5e39
      Sandeep authored
      * declare sigmoid for core fusion
      
      * add simple test for sigmoid
      
      * info fusion status
      
      * cp op as main op
      
      * builds as expected
      
      * move sigmoid fusion code
      
      * add reference kernel
      
      * sigmoid bprop reference kernel and clang-format
      
      * add delta to bprop
      
      * fprop called
      
      * compiles bprop
      
      * move tests
      
      * serializer support
      
      * address comments in code
      
      * add doc
      
      * naming similar to core ops
      
      * fix failing test
      
      * fix failing test
      
      * address clang issue
      
      * more changes
      
      * change test macro
      d05b5e39
    • L.S. Cook's avatar
      18e58ea9
    • Pruthvi's avatar
      MKLDNN BoundedRelu implementation for Relu6 (#1179) · eaa6091c
      Pruthvi authored
      * 1. Added MKLDNNN BoundedRelu op support for Relu6
      2. CpuLayout && CPU assignment pass for BoundedRelu Op
      3. Unit test inter v/s CPU for BoundedReluOp
      4. MKLDNN and default emitter code for BoundedReluOp
      
      * Removed Debug prints
      
      * 1. Added support for boundedrelu to work on any constant literal
      2. unit test case for rank2, rank3, rank4 for bounded relu without serialized graph
      
      * Removed is_six() method
      eaa6091c
    • Louis Feng's avatar
      Conv+bias shape check for better error detection (#1176) · e42e5815
      Louis Feng authored
      * Reshape bias to 1D for conv + bias bprop fusion
      
      * Reshape goe2 back to 2D before replacing
      
      * added shape checks to validate conv+bias op.
      
      * removed conv+bias backprop merge for separate PR review.
      
      * fixed conv_bias_bprop test.
      
      * minor changes to error messages.
      e42e5815