Commits · a7c5eb01b9c4e911da3d3b2008b9b135bf5ac1a4 · submodule / ngraph

05 Jul, 2018 3 commits
- make logical ops input type aware (#1203) · a7c5eb01
  Nick Korovaiko authored Jul 05, 2018
  
  a7c5eb01
- fix bugs in align_to_block_size function (#1191) · f1ebcd3e
  Fenglei authored Jul 05, 2018
```
* extra *block_size

* change grid_size to threads
```
  f1ebcd3e
- fix namespace error in macro (#1194) · af956916
  Yixing Lao authored Jul 05, 2018
  
  af956916
04 Jul, 2018 1 commit
- [ONNX] add 'Add' operator (#1192) · 15d743f1
  Artur Wojcik authored Jul 04, 2018
  
  15d743f1
03 Jul, 2018 6 commits
- Update documentation link to new ngraph-tf (#1185) · 08cabb12
  Adam Procter authored Jul 03, 2018
  
  08cabb12
- Batch dot operation for rank 3 multiply with rank 2 tensors (#1180) · 238ce788
  Louis Feng authored Jul 03, 2018
```
* hacking to support dot of 3 by 2 inputs with gemm_batch.

* clean up.
```
  238ce788
- nbench cleanup (#1183) · 9d09c7e5
  Robert Kimball authored Jul 03, 2018
```
* nbench cleanup

* update style
```
  9d09c7e5
- TF-flavoured group convolution (#1182) · b6bc86bf
  Nick Korovaiko authored Jul 03, 2018
```
* tf group convolution

* change perms
```
  b6bc86bf
- [Py] API helper function broadcast_to (#1170) · 2fc0bbb4
  tsocha authored Jul 03, 2018
  
  2fc0bbb4
- onnx [2]: add core wrappers (#1169) · c086eb2d
  Artur Wojcik authored Jul 03, 2018
```
* onnx: add core wrappers
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnx: add '\n' at end of files
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnx: fix compilation with clang
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnx: fix code style
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>
```
  c086eb2d
02 Jul, 2018 5 commits

move sigmoid to core fusion (#1132) · d05b5e39

Sandeep authored Jul 02, 2018

* declare sigmoid for core fusion

* add simple test for sigmoid

* info fusion status

* cp op as main op

* builds as expected

* move sigmoid fusion code

* add reference kernel

* sigmoid bprop reference kernel and clang-format

* add delta to bprop

* fprop called

* compiles bprop

* move tests

* serializer support

* address comments in code

* add doc

* naming similar to core ops

* fix failing test

* fix failing test

* address clang issue

* more changes

* change test macro

d05b5e39

include additional dependency for clang format (#1181) · 18e58ea9
L.S. Cook authored Jul 02, 2018

18e58ea9

MKLDNN BoundedRelu implementation for Relu6 (#1179) · eaa6091c

Pruthvi authored Jul 02, 2018

* 1. Added MKLDNNN BoundedRelu op support for Relu6
2. CpuLayout && CPU assignment pass for BoundedRelu Op
3. Unit test inter v/s CPU for BoundedReluOp
4. MKLDNN and default emitter code for BoundedReluOp

* Removed Debug prints

* 1. Added support for boundedrelu to work on any constant literal
2. unit test case for rank2, rank3, rank4 for bounded relu without serialized graph

* Removed is_six() method

eaa6091c

Conv+bias shape check for better error detection (#1176) · e42e5815

Louis Feng authored Jul 02, 2018

* Reshape bias to 1D for conv + bias bprop fusion

* Reshape goe2 back to 2D before replacing

* added shape checks to validate conv+bias op.

* removed conv+bias backprop merge for separate PR review.

* fixed conv_bias_bprop test.

* minor changes to error messages.

e42e5815

gpu slice optimization (#1172) · f243d035

Fenglei authored Jul 02, 2018

* add gpu_timer to external function

* compiled version

* working version

* using block_begin and block_end

* add the missing '
;'

* move slice to cuda emiter

* change size_t to uint32_t in kernel

* working version

* change block size from 1 to 64

* fix bugs

* nthreads need to be size_t in broadcast op

* add rank to kernel name hash

* update slice in convolution

* resolve index conflict

* change align to align_to_blocksize, add overflow check

* add gird size check and fix pool merge bug

* code style, change names

f243d035

30 Jun, 2018 2 commits

Pruthvi/fix rnn output (#1135) · c4c24cb0

Pruthvi authored Jun 30, 2018

* - Fixed replace output for the multi layer recurrent cell state tensor output
- Modified rnn add_output to consider direction and n_layer while calculating the output size for mkldnn dst_layer and dst_iter

* fix unit test failure

c4c24cb0

LoopKernel Collector (#1128) · 784735d6

Nick Korovaiko authored Jun 30, 2018

* collector

* keeping track of inputs; simplifying a merging stratey; adding LKGraph

* LoopKernel Collector

* address feedback

* address feedback 2

* address feedback 3

784735d6

29 Jun, 2018 4 commits

Customizable handler for logger function (#1177) · 47ad79fd
Yixing Lao authored Jun 29, 2018
```
* add lambda handler support for logger
* reuse logger function
```
47ad79fd

Nd convolution via blocked GEMM for C{d1,...,dn}N layout (#1131) · ae45c984

Chris Sullivan authored Jun 29, 2018

* Added blank convolution kernel and refactored coordinate transform kernel helper.

* Added op::Reshape to the CUDAEmitter.

* Added 2-Nd tiled convolution.

* Bug fixes with data_dilation and filter loop. Still need to add test for coverage of register tiling.

* Styling.

* Removed some comments and code added for testing.

* Some tests became enabled in merge, removing them.

ae45c984

IntelGPUBackend: create_tensor functionality implementation with Intel clDNN (#1168) · 3a43bdac
shssf authored Jun 29, 2018
```
* IntelGPUBackend: create_tensor

* 9 tests are passes. List updated
```
3a43bdac
workaround for depthwise convolution (#1178) · 09adba0c
Nick Korovaiko authored Jun 29, 2018
```
* workaround for depthwise convolution

* fixe error msg
```
09adba0c

28 Jun, 2018 8 commits

Reshape bias to 1D for cpufusion of conv+bias bprop (#1151) · 1574031c
Nishant Patel authored Jun 28, 2018
```
* Reshape bias to 1D for conv + bias bprop fusion

* Reshape goe2 back to 2D before replacing
```
1574031c
check cudnn version (#1175) · cf3e2992
Fenglei authored Jun 28, 2018

cf3e2992

Support dimshuffle/transpose with MKLDNN (#1129) · 846f6bfe

Nishant Patel authored Jun 28, 2018

* Reshape 4d

* Support dimshuffles/transpose with MKLDNN

* Addressing PR Feedback

* Use Eigen for 3D dimshuffles

846f6bfe

- Added workspace for rnn fprop kernel (#1153) · d861ba32
Pruthvi authored Jun 28, 2018
```
- fixes segfault issue for GNMT model execution through ngraph-mxnet
```
d861ba32
working generate_adjoints (#1173) · aa36865c
Matthew Brookhart authored Jun 28, 2018

aa36865c

enable cudnn datatype support (#1122) · eef2b19d

Fenglei authored Jun 28, 2018

* enable multi datatpye support for Cudnn. refactor binary ops using cudnn

* fix bugs

* add tests to skip list that CUDNN does not support

* not int support on cudnn for backward pooling

* no GPU.dot_4d_5d_multi_axis_big_fp64_VERY_SLOW test anymore

* clang format

* throw if datatype is int8 or int32 for backward pooling

* comments

* fix list in unit_test.manifest

* add type support for alpha, beta

* fix bugs

* datatype support for alpha, beta

* missing ()

* clang format

* batchnorm backward bug fix

* remove debug info

* change member function name to snake case. remove comments

* use nullptr instead of NULL

* code style, use cuDNN everywhere in comments

* add cudnn host parameters memory manager.

* change name to allocate_by_datatype

* compiled

* debug

* fix bug: using list instead of vector, vector address will change each time it resize

* add CUDNN_DATA_UINT8 and CUDNN_DATA_UINT8x4

eef2b19d

constant broadcast folding (#1139) · 35b04e6a
Adam Straw authored Jun 28, 2018
```
* constant broadcast folding

* code review feedback
```
35b04e6a

Add extra hash parameters to broadcast and max pool (#1163) · 13f00048

Chris Sullivan authored Jun 28, 2018

* Move maxpool and avgpool into CudaKernelBuilder and add cache parameters to kernel name for broadcast which are required for correct lookup.

* Styling.

* Add space before avg_pool.

13f00048

27 Jun, 2018 5 commits

add gpu timer (#1143) · b69f0734

Fenglei authored Jun 27, 2018

* add gpu_timer to external function

* compiled version

* working version

* using block_begin and block_end

* add the missing '
;'

b69f0734

get_output_elements (#1154) · 4db318a3
Nick Korovaiko authored Jun 27, 2018
```
* get_get_output_elements

* fix comp error

* address scott's feedback
```
4db318a3
Properly setting OC for Group Convolution (#1161) · f7a34a02
Nick Korovaiko authored Jun 27, 2018
```
* group conv fix

* group conv fix

* fix typo
```
f7a34a02

MKLDNN Softmax (#1113) · bb06c80b

Pruthvi authored Jun 27, 2018

* 1. Added mkldnn support for Softmax
2. layout assignment for mkldnn softmax

* added assert to check softmax axis for mkldnn

bb06c80b

onnx [1]: add importer cmakes (#1145) · b3f0a474

Artur Wojcik authored Jun 27, 2018

* onnx: add importer cmakes
* onnx: use file(DOWNLOAD ...) command to download onnx.proto
* onnx: add Protobuf minimal required version

b3f0a474

26 Jun, 2018 6 commits
- remove unused file (#1159) · e4db82ec
  Robert Kimball authored Jun 26, 2018
  
  e4db82ec
- remove debug code (#1158) · 2c71cffe
  Robert Kimball authored Jun 26, 2018
  
  2c71cffe
- make sure ngraph name is correct (#1157) · 2f9faecd
  Robert Kimball authored Jun 26, 2018
  
  2f9faecd
- Updates towards building on windows native (#1156) · ed112464
  Robert Kimball authored Jun 26, 2018
```
* cmake runs for interpreter

* more updates towards building on windows
```
  ed112464
- Convolution sum fusion (#1146) · 82ee0a77
  Jayaram Bobba authored Jun 26, 2018
```
* inplace compute

* fix warnings

* Initial support for convolution sum fusion

* Added in-place support for conv sum fusion and test cases

* reverting spurious changes

* Bug fix to account for inplace input in conv sum fusion

* fix compilation error

* Addressed PR feedback
```
  82ee0a77
- use empty consistently instead of size == 0 checks (#1126) · f7069237
  Nick Korovaiko authored Jun 26, 2018
  
  f7069237