Commits · 05a404a8a75ca430b110b170961154ca944c9c48 · submodule / ngraph

31 Oct, 2018 5 commits

Change Backend::create to return std::unique_ptr<Backend> (#1909) · 05a404a8

Robert Kimball authored 6 years ago

* create unique_ptr backend

* unit test cleanup

* address more code that was recently added

* change from reference to pointer when passing backend to reduce the number of lines changed.

* fix build error

* fix python wrapper

* style

* more specific treatment for unique_ptr

05a404a8

IntelGPU backend: Remove nodes "Result" from cldnn::topology (#1952) · b5beac87
Sergey Shalnov authored 6 years ago

b5beac87

[ONNX] Generic N-dimensional MatMul operation. (#1921) · 92c1cc19

Adam Rogowiec authored 6 years ago

* [WIP] Draft for matmul

* Numpy-style broadcasting for matrix multiplication.

* 3D matrix multiplication with one big Dot/slice/concat.

* Generic ND matmul implementation using slice/dot/concat pattern.

* Code formatting.

* remove unused header

* Add missing header

* Utility reshape-like functions.

* Use utility functions.

* Review comments.

* Code format

* Use if/else instead of ternary operator for readability.

* Remove unused function overloading

* Utility function expanding tensor shape with empty axes.

* Use helper functions.

* Use type for auto variable initializer to fix Centos build

* Fix Centos build errors.

92c1cc19

[PlaidML] Specialize within namespaces (for Linux) (#1948) · 61df6725
Rob Earhart authored 6 years ago

61df6725
Fix call to reference quantize (#1944) · 46d0376f
Nishant Patel authored 6 years ago

46d0376f

30 Oct, 2018 4 commits

pass FunctionInstance to interpreter op execution (#1947) · 5cfc7e92
Robert Kimball authored 6 years ago

5cfc7e92
[ONNX] Support for legacy broadcasting rules (#1924) · 8ef1ec04
Michał Karzyński authored 6 years ago

8ef1ec04

Gauri/groupconv batchnorm (#1900) · c637d629

gaurides authored 6 years ago

* Initial implementation of GroupConv+BatchNorm fusion

* Added GroupConv+BatchNorm with Relu fusion

* Added changes to fuse with BoundedRelu

* Changed BoundedRelu to Relu

* Added test; Code cleanup

* Code formatting

* Removed dead code

* Added test cases and other misc

* Bug fix in group conv callback and general cleanup

* Address PR feedback

* Minor edit to comment. MKLDNN divides both input and output channels by groups

* Style fixes and PR feedback

c637d629

[ONNX] Add ArgMin/Max operators (#1898) · 2a49f1c8

Michał Karzyński authored 6 years ago

* Add ArgMin operator

* Add ArgMax and a basic test case

* Rename variables

* Apply workaround for problems with Reshape on i64

* Review comments

* Review comments

2a49f1c8

29 Oct, 2018 5 commits

IntelGPU backend: Add Slice to dump (#1936) · fe5673de
Sergey Shalnov authored 6 years ago

fe5673de
IntelGPU backend: Implemented some clDNN controls (#1934) · 8306d049
Sergey Shalnov authored 6 years ago

8306d049

Add PlaidML backend (#1888) · f0acb7da

Rob Earhart authored 6 years ago

* Add PlaidML backend

* CR comments

Used m_ prefix for members; removed trailing underscores
Updated license headers
Moved associated header inclusions to project blocks
Wrapped comments to 100 chars
Added missing newlines between functions
Removed nested namespaces in operation implementations

* Add earhart to CODEOWNERS

* Rebase updates

* style

f0acb7da

Support TopK for NvidiaGPU backend (#1908) · d901446d

Ayan Moitra authored 6 years ago

* fresh commit for the changes

* Working topk on ndims for nvGPU

* fix

* clang

* Added unit test, improved kernel hash and Bob's comment

* int64 test+clang

* Moved argReduce and topk tests to a separate file

* TopK unsupported for IntelGPU

* addressed Fenglei and Chris's comments

* addressed Fenglei and Chris's comments

d901446d

IntelGPU backend: Profile data improved (#1932) · 239322e0
Sergey Shalnov authored 6 years ago
```
* IntelGPU backend: Profile data improved

* PR1932. Comments addressed
```
239322e0

28 Oct, 2018 1 commit
- IntelGPU backend: Improved dump graph functionality (#1935) · ffa20eee
  Sergey Shalnov authored 6 years ago
  
  ffa20eee
27 Oct, 2018 3 commits
- IntelGPU backend: Fix profile data for nbench (#1931) · 36544d1b
  shssf authored 6 years ago
  
  36544d1b
- using create_gpu_buffer and free_gpu_buffer (#1930) · dc4320af
  Fenglei authored 6 years ago
  
  dc4320af
- Move to TBB2019 and bug fix to capture functor (#1917) · 981bf4f2
  Jayaram Bobba authored 6 years ago
```
* Move to TBB2019 and bug fix to capture functor

* Change to use TBB release tag

* remove lightweight from codegen

* Enable TBB flow graph tracing
```
  981bf4f2
26 Oct, 2018 8 commits

nvgpu concat split (#1894) · 58bd00de

Fenglei authored 6 years ago

* add split concat

* fix bug

* fix bug

* fix bug

* add test

* fix test bug

* add comments

* format

* return intead of check processed

* remove .back() since it's not vector anymore.

* format

* change to paramter tests based on Geoff's comments

* types-> type

* change split size to 256

58bd00de

IntelGPU backend: Add graph dump ability (#1925) · bbf66498
shssf authored 6 years ago

bbf66498

Reenabled Chris's nvcc building (#1903) · 4e08d9aa

gcwenger authored 6 years ago

* Reenabled Chris's nvcc building. Improved support for build paths and variations of cuda 8/9 + clang/gcc

* Improved build messages based on feedback

4e08d9aa

Add builder for {de}quantize to make API's consistent and support {de}quantize with mkldnn (#1839) · 6b36a480

Nishant Patel authored 6 years ago

* Add builder for {de}quantize

* Add declaration in header

* Add mkldnn support for {de}quantize

* Add support for {de}quantize with mkldnn

* Add Dex support

* Generalizing some api's and adding a test case for DQ in backend_test.in.cpp

* Unify scale between ngraph and mkldnn

* Check for nullptrs

* PR feedback

* fix unit test failure

* Adding tests for builder and deleting the backend tests

* curly braces

* test rename

6b36a480

DEX Debugger (#1798) · fc5842d9

Nick Korovaiko authored 6 years ago

* gdb-like interface + tests

* fix not being able to run call twice without call

* fix continue bug

* fix enables; rename kontinue to resume

* switch from lists of functors,enables to vector

* address scott's feedback

* adding a debugger object

* address jayarams feedback

fc5842d9

fix ztde concat (#1918) · 8a041166
Nick Korovaiko authored 6 years ago

8a041166
argmin/max fix (#1922) · 8df14206
Nick Korovaiko authored 6 years ago

8df14206
Move SigmoidBackprop to BinaryElementwiseArithmetic (#1914) · fb3f9e95
Adam Procter authored 6 years ago

fb3f9e95

25 Oct, 2018 5 commits
- Fix for AllReduce partial shape/type validation (#1913) · 08483fbd
  Adam Procter authored 6 years ago
  
  08483fbd
- Implement partial shape/type validation for TopK (#1912) · 759f79c0
  Adam Procter authored 6 years ago
  
  759f79c0
- address complaint when compiling on debian (#1916) · 7246875e
  Robert Kimball authored 6 years ago
  
  7246875e
- Remove unused variable. · 7bdb11de
  amy.zhuang authored 6 years ago
  
  7bdb11de
- m_direct_execution is used but not defined when NGRAPH_DEX_ONLY=TRUE (#1910) · c62f2b23
  Chris Sullivan authored 6 years ago
```
* m_direct_execution is used but not defined when NGRAPH_DEX_ONLY=TRUE

* keep the ifdef and move m_direct_execution out of the ifdef
```
  c62f2b23
24 Oct, 2018 9 commits

Rename two variables. · fe456412
amy.zhuang authored 6 years ago

fe456412
No in place concat if input format differs from output format. · 87197ec3
amy.zhuang authored 6 years ago

87197ec3

ArgReduce 64 bit indices (#1862) · 9f0589a8

Chris Sullivan authored 6 years ago

* Update ArgReduce to handle i64 indices.

* Formatting.

* Add throw for output types other than int32/64.

* Add output type to hash.

* Add type to throw.

* Interpreter doesn't currently support 64bit output indices for argmin/max and so disabling this test [JIRA:NGRAPH-3183].

9f0589a8

Partial Shapes and Types, Part 4λ: Convolution and backprops (#1890) · ccfcf4f9

Adam Procter authored 6 years ago

* Implement partial shape/type propagation for Convolution; fail for want of unit tests

* Implement unit tests for partial shapes/types for Convolution

ccfcf4f9

fix Klockwork warnings CPU part 1 (#1902) · 0d693fc3
Nick Korovaiko authored 6 years ago
```
* fix Klockwork warnings CPU part 1

* fix spelling error

* fix a typo
```
0d693fc3
[ONNX] Gemm fix (#1877) · 92c1d504
Adam Rogowiec authored 6 years ago
```
* Fix gemm `input_c` broadcasting.

* Comments.

* Add comment
```
92c1d504
Enable Trigonometric ops (#1879) · 835ecad9
tsocha authored 6 years ago

835ecad9
[ONNX] Non-linear ops (#1864) · a804c3d7
tsocha authored 6 years ago
```
* [ONNX] Non-linear ops

* Style check
```
a804c3d7

Cache and use fprop stats in cudnn batchnorm bprop (#1841) · fbc3a940

Chris Sullivan authored 6 years ago

* Temp bn update commit.

* Add CUDNNBatchNorm which adds two additional outputs to batchnorm, the batch mean and batch inv variance.
The batch mean is the same as the output mean if the cummulative average factor is 1.0. Add BatchNormCache pass which replaces all BatchNorm ops that are inputs to BatchNormBackprop
with CUDNNBatchNorm which outputs the saved batch statistics directly to the backprop step.

* Updated bn cache pass, removed extra tests, added test checking that provided stats are used in bprop instead of batch stats.
This test was disabled for interpreter as the reference kernel needs to be updated to use provided statistics.

* Formatting.

* Update to new batch norm API.

* CUDNNBatchNorm -> BatchNormTrainingWithStats

* new line

* Preprocess input variance into BN denominator for cudnn (#1885)

* Add explicit cuda kernel to calculate what cuDNN describes as the inverse
variance. In reality, the backward cudnn kernel for BN requires 1.0f / sqrt(variance + eps),
which is the batchnorm denominator for each channel (a numerically stable inverse stddev).

This introduces op annotations for batch norm backprop and updates the cudnn_emitter to support the insertion of this cuda kernel when required.

* Disable second test on INTERPRETER.

fbc3a940