Commits · 88b1ff33a0c0c1dcea1661aab25d9d9856609949 · submodule / ngraph

18 Jul, 2018 3 commits
- Pool tests updated to check all backends (#1245) · e2255fbd
  Robert Kimball authored 6 years ago
```
* make pool test check backends other than CPU

* more unit test cleanup
```
  e2255fbd
- Fix incorrect divide-by-zero test (#1243) · 7c7c5d62
  Jaikrishnan Menon authored 6 years ago
  
  7c7c5d62
- CPU Loop Kernel Fusion optimization (#1190) · e3ad1b31
  Nick Korovaiko authored 6 years ago
```
* cpu loop kernel fusion pass

*  remove extra code

* bounded relu test

* address scotts feedback
```
  e3ad1b31
17 Jul, 2018 1 commit

Added more convolution variants to DEX (#1223) · 9bb0b653

Jayaram Bobba authored 6 years ago

* CPU Direct Execution: Implement ConvertLayout and refactor

* CPU Direct Execution: Implement Convolution

* 1) Adds computation reuse to direct execution
2) Add avg_pool, broadcast and convolution_bias to direct execution
3) Moved some computation reuse utility functions to graph_utils

* Use lists instead of vectors to avoid reallocation overheads

* - Added convolution variants to direct execution
- Removed ConvolutionBiasRelu, use ConvolutionBias instead
- Reduced code duplication by moving functionality to mkldnn_emitter
  from cpu_emitter

* Style fix

* Moved mkldnn build_convolution to a templated method

* Style fix

* refactored mkldnn conv bprop builders

* Style fix

9bb0b653

14 Jul, 2018 1 commit
- move long building tests to the be the first tests built with the hope of… · cce0c224
  Robert Kimball authored 6 years ago
```
move long building tests to the be the first tests built with the hope of reducing build time. (#1229)
```
  cce0c224
13 Jul, 2018 1 commit
- get_subgraph_outputs (towards checking that intermediate nodes in a matched graph not used) (#1207) · 83e7dba5
  Nick Korovaiko authored 6 years ago
```
* get_subgraph_outputs

* simplify the condition
```
  83e7dba5
12 Jul, 2018 2 commits

Added reshape and broadcast to CSE (#1221) · cf568ef9

Louis Feng authored 6 years ago

* reshape inplace without copy data if possible.

* added reshape and broadcast to CSE.

* Fixed debug messages.

cf568ef9

Bob/backend list (#1220) · 8e1954d0

Robert Kimball authored 6 years ago

* open only the unversioned library but check that it is built against the correct version of ngraph

* review comments

8e1954d0

11 Jul, 2018 1 commit
- Disabeled RNN fusion pass in IA transformer (#1217) · 4cd2c602
  Pruthvi authored 6 years ago
  
  4cd2c602
09 Jul, 2018 2 commits

Liveness optimizations (#1210) · 0c721561

Robert Kimball authored 6 years ago

* Faster liveness.

Memory manager optimized for non-sharing of tensors.
Add pass manager profiler.

* Move pass profiler to a separate PR

* Move Memory Layout optimizations to a separate PR

* use find instead of count

0c721561

Cache functions so the backend does not need to recompile (#1209) · ffe3a631
Robert Kimball authored 6 years ago
```
* Cache some generated functions in backwards tests to speed performance

* more caching
```
ffe3a631

07 Jul, 2018 1 commit

New backend construction/destruction API (#1171) · ad4dd5b0

Robert Kimball authored 6 years ago

* complete the new backend construction/destruction API
* close each dlopen
* don't close libraries for now as it causes python to segfault

ad4dd5b0

06 Jul, 2018 2 commits

Use mkldnn reorder only for transpose/dimshuffles. (#1188) · 5be99c0a

Nishant Patel authored 6 years ago

* Usage of mkldnn reshape updated

* update reshape condition for mkldnn

* Add a test case and order in which conditions are checked

5be99c0a

Collect matched nodes (#1166) · e07637c0
Nick Korovaiko authored 6 years ago
```
* collect matched nodes

* clear m_matched_list

* tests

* address feedback
```
e07637c0

03 Jul, 2018 2 commits
- Batch dot operation for rank 3 multiply with rank 2 tensors (#1180) · 238ce788
  Louis Feng authored 6 years ago
```
* hacking to support dot of 3 by 2 inputs with gemm_batch.

* clean up.
```
  238ce788
- nbench cleanup (#1183) · 9d09c7e5
  Robert Kimball authored 6 years ago
```
* nbench cleanup

* update style
```
  9d09c7e5
02 Jul, 2018 3 commits

move sigmoid to core fusion (#1132) · d05b5e39

Sandeep authored 6 years ago

* declare sigmoid for core fusion

* add simple test for sigmoid

* info fusion status

* cp op as main op

* builds as expected

* move sigmoid fusion code

* add reference kernel

* sigmoid bprop reference kernel and clang-format

* add delta to bprop

* fprop called

* compiles bprop

* move tests

* serializer support

* address comments in code

* add doc

* naming similar to core ops

* fix failing test

* fix failing test

* address clang issue

* more changes

* change test macro

d05b5e39

MKLDNN BoundedRelu implementation for Relu6 (#1179) · eaa6091c

Pruthvi authored 6 years ago

* 1. Added MKLDNNN BoundedRelu op support for Relu6
2. CpuLayout && CPU assignment pass for BoundedRelu Op
3. Unit test inter v/s CPU for BoundedReluOp
4. MKLDNN and default emitter code for BoundedReluOp

* Removed Debug prints

* 1. Added support for boundedrelu to work on any constant literal
2. unit test case for rank2, rank3, rank4 for bounded relu without serialized graph

* Removed is_six() method

eaa6091c

Conv+bias shape check for better error detection (#1176) · e42e5815

Louis Feng authored 6 years ago

* Reshape bias to 1D for conv + bias bprop fusion

* Reshape goe2 back to 2D before replacing

* added shape checks to validate conv+bias op.

* removed conv+bias backprop merge for separate PR review.

* fixed conv_bias_bprop test.

* minor changes to error messages.

e42e5815

30 Jun, 2018 2 commits

Pruthvi/fix rnn output (#1135) · c4c24cb0

Pruthvi authored 6 years ago

* - Fixed replace output for the multi layer recurrent cell state tensor output
- Modified rnn add_output to consider direction and n_layer while calculating the output size for mkldnn dst_layer and dst_iter

* fix unit test failure

c4c24cb0

LoopKernel Collector (#1128) · 784735d6

Nick Korovaiko authored 6 years ago

* collector

* keeping track of inputs; simplifying a merging stratey; adding LKGraph

* LoopKernel Collector

* address feedback

* address feedback 2

* address feedback 3

784735d6

28 Jun, 2018 2 commits
- Support dimshuffle/transpose with MKLDNN (#1129) · 846f6bfe
  Nishant Patel authored 6 years ago
```
* Reshape 4d

* Support dimshuffles/transpose with MKLDNN

* Addressing PR Feedback

* Use Eigen for 3D dimshuffles
```
  846f6bfe
- constant broadcast folding (#1139) · 35b04e6a
  Adam Straw authored 6 years ago
```
* constant broadcast folding

* code review feedback
```
  35b04e6a
26 Jun, 2018 3 commits

remove unused file (#1159) · e4db82ec
Robert Kimball authored 6 years ago

e4db82ec

Convolution sum fusion (#1146) · 82ee0a77

Jayaram Bobba authored 6 years ago

* inplace compute

* fix warnings

* Initial support for convolution sum fusion

* Added in-place support for conv sum fusion and test cases

* reverting spurious changes

* Bug fix to account for inplace input in conv sum fusion

* fix compilation error

* Addressed PR feedback

82ee0a77

OS X support (#1098) · 5395a378

Igor Kaplounenko authored 6 years ago

* updated to work with llvm 8.1 that tensorflow is built with

* sane extensions on the mac

* not doing rpath on apple

* apply style

5395a378

25 Jun, 2018 2 commits

inplace compute (#1141) · 88aa9e9c

Nick Korovaiko authored 6 years ago

* inplace compute

* fix warnings

* address bob's feedback

* bob's feedback 2

* bobs feedback 3

* address bob's feedback 4

88aa9e9c

Fix build for MacOS (#1112) · e2e814e3

Robert Kimball authored 6 years ago

* remove reference to ngraph core code from codegen. add stand-alone implementations of needed funcions

* fixed potential pointer leak

* clean up file_util

* more file util cleanup, removing unused functions

* interpreter works on mac

* CPU and INTERPRETER build and pass unmit tests on macos

* move get_directory to file_util

* cleanup

e2e814e3

22 Jun, 2018 1 commit
- refactor cache_prop to reuse bprop inputs (#1134) · 3b49dd1a
  Matthew Brookhart authored 6 years ago
  
  3b49dd1a
21 Jun, 2018 1 commit

Constant folding for Reshapes (#1130) · b9a77a9d

Adam Straw authored 6 years ago

* adding constant propagation pass

* adding test/constant_propagation.cpp

* template make_constant_reshape function

* code review feedback

* add missing files

b9a77a9d

20 Jun, 2018 1 commit
- Fix two bugs with concat for 0-size tensors (#1120) · 22e783ff
  Adam Procter authored 6 years ago
```
* Fix bug with concat for 0-size tensors

* Simplify test for zero-length axes, per PR comments
```
  22e783ff
19 Jun, 2018 2 commits

Bob/cmake (#1118) · 4847b2de

Robert Kimball authored 6 years ago

* fix mkldnn rpath

* fix compile warning

* close backends when exiting

* set backend output directory of backends to the ngraph output directory

* Aprocter/patch patch (#1119)

* Move more rpath stuff inside if(NOT APPLE)

* fix repatch problem with mkldnn library

* add updated patch command for older versions of cmake

4847b2de

Loop Kernel Op + Tests (#1028) · 96295aaa

Nick Korovaiko authored 6 years ago

* loop kernel + tests

* remove commented out code

* remove commented code; add comments

* copy_with_new_args +test

* add comment

* fix comp errors

96295aaa

16 Jun, 2018 2 commits

Strided Convolution (#1058) · 94844d13

Nick Korovaiko authored 6 years ago

* optimized strided convolutions

* clean up debug messages

* format fixes

* more tests

* even more tests

* adapt to resnet-50.v1

* fix format errors; remove changes from diff PRs

94844d13

enable cse for reduction ops (#1030) · 656dfa55
Nick Korovaiko authored 6 years ago
```
* enable cse for reduction ops

* reduction tests
```
656dfa55

15 Jun, 2018 2 commits

move tbb test from backend_test to cpu_test because it is CPU only (#1102) · 7d6a0d1c
Robert Kimball authored 6 years ago

7d6a0d1c

RNN fusion across layers (#1085) · f75b8006

Pruthvi authored 6 years ago

* - Added graph pass for fusing RNN op across layer
- Added test case for inter v/s cpu for verifying layer fused RNN
- more sanity checks in the RNN fusion graph pass
- added support to replace the recurrent cell state correctly in the fused RNN op

* Fixed multi layer rnn fusion unit test failure

* Addressed PR comments

f75b8006

13 Jun, 2018 3 commits

Ubuntu 18 build support (#1101) · 838ba3f1

Robert Kimball authored 6 years ago

* backend libraries now found in tree

dynamically read header search paths

fix running from install

838ba3f1

Group Convolution (#1041) · 4a2c3c9c

Nick Korovaiko authored 6 years ago

*  group conv init

* add GroupConvolution op; refine checks in fusion logic

* add an emitter, cpu assigment

* cpu_layout

* add checks to algebraic simplification

* updating emitter logic for groupconvolution

* working before refactoring

* moving primitive creation logic to mkldnn_emitter

* group convolution graph test

* rename an opt

* address jbobba's feedback

4a2c3c9c

gpu deconvolution (#1099) · 40069d27

Fenglei authored 6 years ago

* add pad_dilation function

* add dilation to gpu_emitter

* add CoordinateDiff constructor to GPUShape

* remove unecessary cast

* working version for forward

* forward working

* forward test all pass

* deconvolution forward

* backward data dilation

* forward test passed

* initial to 0

* fix bug for get_padded_shape and clang format

* code style, change variable names

* refactor convolution conditions

* fix bug padding_below_diff

* change pad_dilation to pad_dynamic, compare to pad

* remove passed convolution test from skip list, clang format

* change pad to use GPUShape

40069d27