Commits · 784735d61295d21f3f2aa9347eff785e54d7da63 · submodule / ngraph

30 Jun, 2018 1 commit

Nick Korovaiko authored Jun 30, 2018

* collector

* keeping track of inputs; simplifying a merging stratey; adding LKGraph

* LoopKernel Collector

* address feedback

* address feedback 2

* address feedback 3

784735d6

29 Jun, 2018 4 commits

Customizable handler for logger function (#1177) · 47ad79fd
Yixing Lao authored Jun 29, 2018
```
* add lambda handler support for logger
* reuse logger function
```
47ad79fd

Nd convolution via blocked GEMM for C{d1,...,dn}N layout (#1131) · ae45c984

Chris Sullivan authored Jun 29, 2018

* Added blank convolution kernel and refactored coordinate transform kernel helper.

* Added op::Reshape to the CUDAEmitter.

* Added 2-Nd tiled convolution.

* Bug fixes with data_dilation and filter loop. Still need to add test for coverage of register tiling.

* Styling.

* Removed some comments and code added for testing.

* Some tests became enabled in merge, removing them.

ae45c984

IntelGPUBackend: create_tensor functionality implementation with Intel clDNN (#1168) · 3a43bdac
shssf authored Jun 29, 2018
```
* IntelGPUBackend: create_tensor

* 9 tests are passes. List updated
```
3a43bdac
workaround for depthwise convolution (#1178) · 09adba0c
Nick Korovaiko authored Jun 29, 2018
```
* workaround for depthwise convolution

* fixe error msg
```
09adba0c

28 Jun, 2018 8 commits

Reshape bias to 1D for cpufusion of conv+bias bprop (#1151) · 1574031c
Nishant Patel authored Jun 28, 2018
```
* Reshape bias to 1D for conv + bias bprop fusion

* Reshape goe2 back to 2D before replacing
```
1574031c
check cudnn version (#1175) · cf3e2992
Fenglei authored Jun 28, 2018

cf3e2992

Support dimshuffle/transpose with MKLDNN (#1129) · 846f6bfe

Nishant Patel authored Jun 28, 2018

* Reshape 4d

* Support dimshuffles/transpose with MKLDNN

* Addressing PR Feedback

* Use Eigen for 3D dimshuffles

846f6bfe

- Added workspace for rnn fprop kernel (#1153) · d861ba32
Pruthvi authored Jun 28, 2018
```
- fixes segfault issue for GNMT model execution through ngraph-mxnet
```
d861ba32
working generate_adjoints (#1173) · aa36865c
Matthew Brookhart authored Jun 28, 2018

aa36865c

enable cudnn datatype support (#1122) · eef2b19d

Fenglei authored Jun 28, 2018

* enable multi datatpye support for Cudnn. refactor binary ops using cudnn

* fix bugs

* add tests to skip list that CUDNN does not support

* not int support on cudnn for backward pooling

* no GPU.dot_4d_5d_multi_axis_big_fp64_VERY_SLOW test anymore

* clang format

* throw if datatype is int8 or int32 for backward pooling

* comments

* fix list in unit_test.manifest

* add type support for alpha, beta

* fix bugs

* datatype support for alpha, beta

* missing ()

* clang format

* batchnorm backward bug fix

* remove debug info

* change member function name to snake case. remove comments

* use nullptr instead of NULL

* code style, use cuDNN everywhere in comments

* add cudnn host parameters memory manager.

* change name to allocate_by_datatype

* compiled

* debug

* fix bug: using list instead of vector, vector address will change each time it resize

* add CUDNN_DATA_UINT8 and CUDNN_DATA_UINT8x4

eef2b19d

constant broadcast folding (#1139) · 35b04e6a
Adam Straw authored Jun 28, 2018
```
* constant broadcast folding

* code review feedback
```
35b04e6a

Add extra hash parameters to broadcast and max pool (#1163) · 13f00048

Chris Sullivan authored Jun 28, 2018

* Move maxpool and avgpool into CudaKernelBuilder and add cache parameters to kernel name for broadcast which are required for correct lookup.

* Styling.

* Add space before avg_pool.

13f00048

27 Jun, 2018 5 commits

add gpu timer (#1143) · b69f0734

Fenglei authored Jun 27, 2018

* add gpu_timer to external function

* compiled version

* working version

* using block_begin and block_end

* add the missing '
;'

b69f0734

get_output_elements (#1154) · 4db318a3
Nick Korovaiko authored Jun 27, 2018
```
* get_get_output_elements

* fix comp error

* address scott's feedback
```
4db318a3
Properly setting OC for Group Convolution (#1161) · f7a34a02
Nick Korovaiko authored Jun 27, 2018
```
* group conv fix

* group conv fix

* fix typo
```
f7a34a02

MKLDNN Softmax (#1113) · bb06c80b

Pruthvi authored Jun 27, 2018

* 1. Added mkldnn support for Softmax
2. layout assignment for mkldnn softmax

* added assert to check softmax axis for mkldnn

bb06c80b

onnx [1]: add importer cmakes (#1145) · b3f0a474

Artur Wojcik authored Jun 27, 2018

* onnx: add importer cmakes
* onnx: use file(DOWNLOAD ...) command to download onnx.proto
* onnx: add Protobuf minimal required version

b3f0a474

26 Jun, 2018 10 commits

remove unused file (#1159) · e4db82ec
Robert Kimball authored Jun 26, 2018

e4db82ec
remove debug code (#1158) · 2c71cffe
Robert Kimball authored Jun 26, 2018

2c71cffe
make sure ngraph name is correct (#1157) · 2f9faecd
Robert Kimball authored Jun 26, 2018

2f9faecd
Updates towards building on windows native (#1156) · ed112464
Robert Kimball authored Jun 26, 2018
```
* cmake runs for interpreter

* more updates towards building on windows
```
ed112464

Convolution sum fusion (#1146) · 82ee0a77

Jayaram Bobba authored Jun 26, 2018

* inplace compute

* fix warnings

* Initial support for convolution sum fusion

* Added in-place support for conv sum fusion and test cases

* reverting spurious changes

* Bug fix to account for inplace input in conv sum fusion

* fix compilation error

* Addressed PR feedback

82ee0a77

use empty consistently instead of size == 0 checks (#1126) · f7069237
Nick Korovaiko authored Jun 26, 2018

f7069237
Backend/API: First IntelGPU backend based on clDNN with empty functions (#1116) · ab325ce6
shssf authored Jun 26, 2018
```
* First IntelGPU backend based on clDNN with empty functions

* Backend/API:Conflicts resolved and comments addressed
```
ab325ce6

Leona/patternmatchdoc (#1057) · a2732033

L.S. Cook authored Jun 26, 2018

* editing how to execute computation file for clarity and linenos

* Add placeholder for runtime docs

* Update section on backends, interpreter, and FPGA options

* add updated master to fix python_ci

* Weird autosummary issue reverted

* Clarify new section

* fix up docs

* Update pattern matcher doc based on Nik's presentation slides WIP

* Update doc structure and examples

* remove old folder

* Fix broken Tensorview refs

* . helping people document code more efficiently

* PR review edits

* Finish PR review comment fixes so far

* split patternmatcher PR

* small fixes to PM docs

* remove mark tags from source code

* Final PR cleanup edits

a2732033

Moved maxpool padding to GPUAllocator, changed pad_required bool to include… · 7758cf5d

Chris Sullivan authored Jun 26, 2018

Moved maxpool padding to GPUAllocator, changed pad_required bool to include asymmetric padding check, and remove an error in gpu_emitter where allocation was happening twice for temporary memory (merge failure). (#1152)

7758cf5d

OS X support (#1098) · 5395a378

Igor Kaplounenko authored Jun 26, 2018

* updated to work with llvm 8.1 that tensorflow is built with

* sane extensions on the mac

* not doing rpath on apple

* apply style

5395a378

25 Jun, 2018 4 commits

check if file exists in mnist_loader (#1127) · 009e5bb1
Nick Korovaiko authored Jun 25, 2018

009e5bb1

inplace compute (#1141) · 88aa9e9c

Nick Korovaiko authored Jun 25, 2018

* inplace compute

* fix warnings

* address bob's feedback

* bob's feedback 2

* bobs feedback 3

* address bob's feedback 4

88aa9e9c

Fix build for MacOS (#1112) · e2e814e3

Robert Kimball authored Jun 25, 2018

* remove reference to ngraph core code from codegen. add stand-alone implementations of needed funcions

* fixed potential pointer leak

* clean up file_util

* more file util cleanup, removing unused functions

* interpreter works on mac

* CPU and INTERPRETER build and pass unmit tests on macos

* move get_directory to file_util

* cleanup

e2e814e3

Switch to using has_class for trivial op::Skip predicates (#1148) · d18a9faf
Nick Korovaiko authored Jun 25, 2018
```
* switch to using has_class for op::Skip

* apply format
```
d18a9faf

23 Jun, 2018 1 commit
- move is_unreachable to ngrapH_util.cpp (#1144) · 1ebf4e6a
  Nick Korovaiko authored Jun 23, 2018
  
  1ebf4e6a
22 Jun, 2018 2 commits
- replace maxpool + broadcast with broadcast of appropriate shapes (#1142) · f15877e2
  Nick Korovaiko authored Jun 22, 2018
  
  f15877e2
- refactor cache_prop to reuse bprop inputs (#1134) · 3b49dd1a
  Matthew Brookhart authored Jun 22, 2018
  
  3b49dd1a
21 Jun, 2018 2 commits
- Constant folding for Reshapes (#1130) · b9a77a9d
  Adam Straw authored Jun 21, 2018
```
* adding constant propagation pass

* adding test/constant_propagation.cpp

* template make_constant_reshape function

* code review feedback

* add missing files
```
  b9a77a9d
- remove dlclose in backend destructor as it is causing touble with python bindings (#1133) · 9be92aae
  Robert Kimball authored Jun 21, 2018
  
  9be92aae
20 Jun, 2018 3 commits
- serialize logic for reverse_sequence (#1125) · 9d66d9a7
  Nick Korovaiko authored Jun 20, 2018
```
* serialize logic for reverse_sequence

* Added serializer support for Softmax
```
  9d66d9a7
- Fix two bugs with concat for 0-size tensors (#1120) · 22e783ff
  Adam Procter authored Jun 20, 2018
```
* Fix bug with concat for 0-size tensors

* Simplify test for zero-length axes, per PR comments
```
  22e783ff
- Boncheolgu/fix doc (#1123) · 9441ea0c
  Scott Cyphers authored Jun 20, 2018
```
* [doc] Fix code snippet in derive-for-training

* Fix another code snippet in derive-for-training
```
  9441ea0c