Commits · 846f6bfea8e4df64eb4cae8805bb1f63b52ebe89 · submodule / ngraph

28 Jun, 2018 6 commits

Support dimshuffle/transpose with MKLDNN (#1129) · 846f6bfe

Nishant Patel authored Jun 28, 2018

* Reshape 4d

* Support dimshuffles/transpose with MKLDNN

* Addressing PR Feedback

* Use Eigen for 3D dimshuffles

846f6bfe

- Added workspace for rnn fprop kernel (#1153) · d861ba32
Pruthvi authored Jun 28, 2018
```
- fixes segfault issue for GNMT model execution through ngraph-mxnet
```
d861ba32
working generate_adjoints (#1173) · aa36865c
Matthew Brookhart authored Jun 28, 2018

aa36865c

enable cudnn datatype support (#1122) · eef2b19d

Fenglei authored Jun 28, 2018

* enable multi datatpye support for Cudnn. refactor binary ops using cudnn

* fix bugs

* add tests to skip list that CUDNN does not support

* not int support on cudnn for backward pooling

* no GPU.dot_4d_5d_multi_axis_big_fp64_VERY_SLOW test anymore

* clang format

* throw if datatype is int8 or int32 for backward pooling

* comments

* fix list in unit_test.manifest

* add type support for alpha, beta

* fix bugs

* datatype support for alpha, beta

* missing ()

* clang format

* batchnorm backward bug fix

* remove debug info

* change member function name to snake case. remove comments

* use nullptr instead of NULL

* code style, use cuDNN everywhere in comments

* add cudnn host parameters memory manager.

* change name to allocate_by_datatype

* compiled

* debug

* fix bug: using list instead of vector, vector address will change each time it resize

* add CUDNN_DATA_UINT8 and CUDNN_DATA_UINT8x4

eef2b19d

constant broadcast folding (#1139) · 35b04e6a
Adam Straw authored Jun 28, 2018
```
* constant broadcast folding

* code review feedback
```
35b04e6a

Add extra hash parameters to broadcast and max pool (#1163) · 13f00048

Chris Sullivan authored Jun 28, 2018

* Move maxpool and avgpool into CudaKernelBuilder and add cache parameters to kernel name for broadcast which are required for correct lookup.

* Styling.

* Add space before avg_pool.

13f00048

27 Jun, 2018 5 commits

add gpu timer (#1143) · b69f0734

Fenglei authored Jun 27, 2018

* add gpu_timer to external function

* compiled version

* working version

* using block_begin and block_end

* add the missing '
;'

b69f0734

get_output_elements (#1154) · 4db318a3
Nick Korovaiko authored Jun 27, 2018
```
* get_get_output_elements

* fix comp error

* address scott's feedback
```
4db318a3
Properly setting OC for Group Convolution (#1161) · f7a34a02
Nick Korovaiko authored Jun 27, 2018
```
* group conv fix

* group conv fix

* fix typo
```
f7a34a02

MKLDNN Softmax (#1113) · bb06c80b

Pruthvi authored Jun 27, 2018

* 1. Added mkldnn support for Softmax
2. layout assignment for mkldnn softmax

* added assert to check softmax axis for mkldnn

bb06c80b

onnx [1]: add importer cmakes (#1145) · b3f0a474

Artur Wojcik authored Jun 27, 2018

* onnx: add importer cmakes
* onnx: use file(DOWNLOAD ...) command to download onnx.proto
* onnx: add Protobuf minimal required version

b3f0a474

26 Jun, 2018 10 commits

remove unused file (#1159) · e4db82ec
Robert Kimball authored Jun 26, 2018

e4db82ec
remove debug code (#1158) · 2c71cffe
Robert Kimball authored Jun 26, 2018

2c71cffe
make sure ngraph name is correct (#1157) · 2f9faecd
Robert Kimball authored Jun 26, 2018

2f9faecd
Updates towards building on windows native (#1156) · ed112464
Robert Kimball authored Jun 26, 2018
```
* cmake runs for interpreter

* more updates towards building on windows
```
ed112464

Convolution sum fusion (#1146) · 82ee0a77

Jayaram Bobba authored Jun 26, 2018

* inplace compute

* fix warnings

* Initial support for convolution sum fusion

* Added in-place support for conv sum fusion and test cases

* reverting spurious changes

* Bug fix to account for inplace input in conv sum fusion

* fix compilation error

* Addressed PR feedback

82ee0a77

use empty consistently instead of size == 0 checks (#1126) · f7069237
Nick Korovaiko authored Jun 26, 2018

f7069237
Backend/API: First IntelGPU backend based on clDNN with empty functions (#1116) · ab325ce6
shssf authored Jun 26, 2018
```
* First IntelGPU backend based on clDNN with empty functions

* Backend/API:Conflicts resolved and comments addressed
```
ab325ce6

Leona/patternmatchdoc (#1057) · a2732033

L.S. Cook authored Jun 26, 2018

* editing how to execute computation file for clarity and linenos

* Add placeholder for runtime docs

* Update section on backends, interpreter, and FPGA options

* add updated master to fix python_ci

* Weird autosummary issue reverted

* Clarify new section

* fix up docs

* Update pattern matcher doc based on Nik's presentation slides WIP

* Update doc structure and examples

* remove old folder

* Fix broken Tensorview refs

* . helping people document code more efficiently

* PR review edits

* Finish PR review comment fixes so far

* split patternmatcher PR

* small fixes to PM docs

* remove mark tags from source code

* Final PR cleanup edits

a2732033

Moved maxpool padding to GPUAllocator, changed pad_required bool to include… · 7758cf5d

Chris Sullivan authored Jun 26, 2018

Moved maxpool padding to GPUAllocator, changed pad_required bool to include asymmetric padding check, and remove an error in gpu_emitter where allocation was happening twice for temporary memory (merge failure). (#1152)

7758cf5d

OS X support (#1098) · 5395a378

Igor Kaplounenko authored Jun 26, 2018

* updated to work with llvm 8.1 that tensorflow is built with

* sane extensions on the mac

* not doing rpath on apple

* apply style

5395a378

25 Jun, 2018 4 commits

check if file exists in mnist_loader (#1127) · 009e5bb1
Nick Korovaiko authored Jun 25, 2018

009e5bb1

inplace compute (#1141) · 88aa9e9c

Nick Korovaiko authored Jun 25, 2018

* inplace compute

* fix warnings

* address bob's feedback

* bob's feedback 2

* bobs feedback 3

* address bob's feedback 4

88aa9e9c

Fix build for MacOS (#1112) · e2e814e3

Robert Kimball authored Jun 25, 2018

* remove reference to ngraph core code from codegen. add stand-alone implementations of needed funcions

* fixed potential pointer leak

* clean up file_util

* more file util cleanup, removing unused functions

* interpreter works on mac

* CPU and INTERPRETER build and pass unmit tests on macos

* move get_directory to file_util

* cleanup

e2e814e3

Switch to using has_class for trivial op::Skip predicates (#1148) · d18a9faf
Nick Korovaiko authored Jun 25, 2018
```
* switch to using has_class for op::Skip

* apply format
```
d18a9faf

23 Jun, 2018 1 commit
- move is_unreachable to ngrapH_util.cpp (#1144) · 1ebf4e6a
  Nick Korovaiko authored Jun 23, 2018
  
  1ebf4e6a
22 Jun, 2018 2 commits
- replace maxpool + broadcast with broadcast of appropriate shapes (#1142) · f15877e2
  Nick Korovaiko authored Jun 22, 2018
  
  f15877e2
- refactor cache_prop to reuse bprop inputs (#1134) · 3b49dd1a
  Matthew Brookhart authored Jun 22, 2018
  
  3b49dd1a
21 Jun, 2018 2 commits
- Constant folding for Reshapes (#1130) · b9a77a9d
  Adam Straw authored Jun 21, 2018
```
* adding constant propagation pass

* adding test/constant_propagation.cpp

* template make_constant_reshape function

* code review feedback

* add missing files
```
  b9a77a9d
- remove dlclose in backend destructor as it is causing touble with python bindings (#1133) · 9be92aae
  Robert Kimball authored Jun 21, 2018
  
  9be92aae
20 Jun, 2018 3 commits
- serialize logic for reverse_sequence (#1125) · 9d66d9a7
  Nick Korovaiko authored Jun 20, 2018
```
* serialize logic for reverse_sequence

* Added serializer support for Softmax
```
  9d66d9a7
- Fix two bugs with concat for 0-size tensors (#1120) · 22e783ff
  Adam Procter authored Jun 20, 2018
```
* Fix bug with concat for 0-size tensors

* Simplify test for zero-length axes, per PR comments
```
  22e783ff
- Boncheolgu/fix doc (#1123) · 9441ea0c
  Scott Cyphers authored Jun 20, 2018
```
* [doc] Fix code snippet in derive-for-training

* Fix another code snippet in derive-for-training
```
  9441ea0c
19 Jun, 2018 4 commits

add check to make sure we don't replace unreachable nodes (#1039) · 85f04dfb
Nick Korovaiko authored Jun 19, 2018
```
* add assert to make sure we don't replace unreachable nodes

* fix unittest failures

* sparsity fix
```
85f04dfb

Bob/cmake (#1118) · 4847b2de

Robert Kimball authored Jun 19, 2018

* fix mkldnn rpath

* fix compile warning

* close backends when exiting

* set backend output directory of backends to the ngraph output directory

* Aprocter/patch patch (#1119)

* Move more rpath stuff inside if(NOT APPLE)

* fix repatch problem with mkldnn library

* add updated patch command for older versions of cmake

4847b2de

Loop Kernel Op + Tests (#1028) · 96295aaa

Nick Korovaiko authored Jun 19, 2018

* loop kernel + tests

* remove commented out code

* remove commented code; add comments

* copy_with_new_args +test

* add comment

* fix comp errors

96295aaa

Minor bug fix in function outlining (#1056) · 5203a301

Jayaram Bobba authored Jun 19, 2018

* Move to depth-first serialization of graph for better cache behavior

* Added comment

* Force 64 byte stack alignment to avoid crashes from unaligned AVX loads/stores

* Revert "Force 64 byte stack alignment to avoid crashes from unaligned AVX loads/stores"

This reverts commit 84346420fbd0fbd5d05a4a1e8f5fae12bdc7348b.

* revert to breadth-first serialization

5203a301

18 Jun, 2018 3 commits
- Merge pull request #1108 from NervanaSystems/jmenon/dex2 · 4135f59d
  Jayaram Bobba authored Jun 18, 2018
```
DEX Part 2
```
  4135f59d
- Merge branch 'master' into jmenon/dex2 · b3c8b5ea
  Jayaram Bobba authored Jun 18, 2018
  
  b3c8b5ea
- making arg getters constant (#1121) · 291d927c
  Nick Korovaiko authored Jun 18, 2018
  
  291d927c