1. 28 Jun, 2018 6 commits
    • Nishant Patel's avatar
      Support dimshuffle/transpose with MKLDNN (#1129) · 846f6bfe
      Nishant Patel authored
      * Reshape 4d
      
      * Support dimshuffles/transpose with MKLDNN
      
      * Addressing PR Feedback
      
      * Use Eigen for 3D dimshuffles
      846f6bfe
    • Pruthvi's avatar
      - Added workspace for rnn fprop kernel (#1153) · d861ba32
      Pruthvi authored
      - fixes segfault issue for GNMT model execution through ngraph-mxnet
      d861ba32
    • Matthew Brookhart's avatar
      working generate_adjoints (#1173) · aa36865c
      Matthew Brookhart authored
      aa36865c
    • Fenglei's avatar
      enable cudnn datatype support (#1122) · eef2b19d
      Fenglei authored
      * enable multi datatpye support for Cudnn. refactor binary ops using cudnn
      
      * fix bugs
      
      * add tests to skip list that CUDNN does not support
      
      * not int support on cudnn for backward pooling
      
      * no GPU.dot_4d_5d_multi_axis_big_fp64_VERY_SLOW test anymore
      
      * clang format
      
      * throw if datatype is int8 or int32 for backward pooling
      
      * comments
      
      * fix list in unit_test.manifest
      
      * add type support for alpha, beta
      
      * fix bugs
      
      * datatype support for alpha, beta
      
      * missing ()
      
      * clang format
      
      * batchnorm backward bug fix
      
      * remove debug info
      
      * change member function name to snake case. remove comments
      
      * use nullptr instead of NULL
      
      * code style, use cuDNN everywhere in comments
      
      * add cudnn host parameters memory manager.
      
      * change name to allocate_by_datatype
      
      * compiled
      
      * debug
      
      * fix bug: using list instead of vector, vector address will change each time it resize
      
      * add CUDNN_DATA_UINT8 and CUDNN_DATA_UINT8x4
      eef2b19d
    • Adam Straw's avatar
      constant broadcast folding (#1139) · 35b04e6a
      Adam Straw authored
      * constant broadcast folding
      
      * code review feedback
      35b04e6a
    • Chris Sullivan's avatar
      Add extra hash parameters to broadcast and max pool (#1163) · 13f00048
      Chris Sullivan authored
      * Move maxpool and avgpool into CudaKernelBuilder and add cache parameters to kernel name for broadcast which are required for correct lookup.
      
      * Styling.
      
      * Add space before avg_pool.
      13f00048
  2. 27 Jun, 2018 5 commits
  3. 26 Jun, 2018 10 commits
  4. 25 Jun, 2018 4 commits
  5. 23 Jun, 2018 1 commit
  6. 22 Jun, 2018 2 commits
  7. 21 Jun, 2018 2 commits
  8. 20 Jun, 2018 3 commits
  9. 19 Jun, 2018 4 commits
    • Nick Korovaiko's avatar
      add check to make sure we don't replace unreachable nodes (#1039) · 85f04dfb
      Nick Korovaiko authored
      * add assert to make sure we don't replace unreachable nodes
      
      * fix unittest failures
      
      * sparsity fix
      85f04dfb
    • Robert Kimball's avatar
      Bob/cmake (#1118) · 4847b2de
      Robert Kimball authored
      * fix mkldnn rpath
      
      * fix compile warning
      
      * close backends when exiting
      
      * set backend output directory of backends to the ngraph output directory
      
      * Aprocter/patch patch (#1119)
      
      * Move more rpath stuff inside if(NOT APPLE)
      
      * fix repatch problem with mkldnn library
      
      * add updated patch command for older versions of cmake
      4847b2de
    • Nick Korovaiko's avatar
      Loop Kernel Op + Tests (#1028) · 96295aaa
      Nick Korovaiko authored
      * loop kernel + tests
      
      * remove commented out code
      
      * remove commented code; add comments
      
      * copy_with_new_args +test
      
      * add comment
      
      * fix comp errors
      96295aaa
    • Jayaram Bobba's avatar
      Minor bug fix in function outlining (#1056) · 5203a301
      Jayaram Bobba authored
      * Move to depth-first serialization of graph for better cache behavior
      
      * Added comment
      
      * Force 64 byte stack alignment to avoid crashes from unaligned AVX loads/stores
      
      * Revert "Force 64 byte stack alignment to avoid crashes from unaligned AVX loads/stores"
      
      This reverts commit 84346420fbd0fbd5d05a4a1e8f5fae12bdc7348b.
      
      * revert to breadth-first serialization
      5203a301
  10. 18 Jun, 2018 3 commits