Commits · f0acb7daadf2fd42b0ad8c4d4a7e2607f43399d6 · submodule / ngraph

29 Oct, 2018 3 commits

Rob Earhart authored Oct 29, 2018

* Add PlaidML backend

* CR comments

Used m_ prefix for members; removed trailing underscores
Updated license headers
Moved associated header inclusions to project blocks
Wrapped comments to 100 chars
Added missing newlines between functions
Removed nested namespaces in operation implementations

* Add earhart to CODEOWNERS

* Rebase updates

* style

f0acb7da

Support TopK for NvidiaGPU backend (#1908) · d901446d

Ayan Moitra authored Oct 29, 2018

* fresh commit for the changes

* Working topk on ndims for nvGPU

* fix

* clang

* Added unit test, improved kernel hash and Bob's comment

* int64 test+clang

* Moved argReduce and topk tests to a separate file

* TopK unsupported for IntelGPU

* addressed Fenglei and Chris's comments

* addressed Fenglei and Chris's comments

d901446d

IntelGPU backend: Profile data improved (#1932) · 239322e0
Sergey Shalnov authored Oct 29, 2018
```
* IntelGPU backend: Profile data improved

* PR1932. Comments addressed
```
239322e0

28 Oct, 2018 1 commit
- IntelGPU backend: Improved dump graph functionality (#1935) · ffa20eee
  Sergey Shalnov authored Oct 28, 2018
  
  ffa20eee
27 Oct, 2018 4 commits
- IntelGPU backend: Fix profile data for nbench (#1931) · 36544d1b
  shssf authored Oct 27, 2018
  
  36544d1b
- using create_gpu_buffer and free_gpu_buffer (#1930) · dc4320af
  Fenglei authored Oct 27, 2018
  
  dc4320af
- remove js (#1923) · 96875313
  L.S. Cook authored Oct 27, 2018
  
  96875313
- Move to TBB2019 and bug fix to capture functor (#1917) · 981bf4f2
  Jayaram Bobba authored Oct 27, 2018
```
* Move to TBB2019 and bug fix to capture functor

* Change to use TBB release tag

* remove lightweight from codegen

* Enable TBB flow graph tracing
```
  981bf4f2
26 Oct, 2018 10 commits

nvgpu concat split (#1894) · 58bd00de

Fenglei authored Oct 26, 2018

* add split concat

* fix bug

* fix bug

* fix bug

* add test

* fix test bug

* add comments

* format

* return intead of check processed

* remove .back() since it's not vector anymore.

* format

* change to paramter tests based on Geoff's comments

* types-> type

* change split size to 256

58bd00de

IntelGPU backend: Add graph dump ability (#1925) · bbf66498
shssf authored Oct 26, 2018

bbf66498

Reenabled Chris's nvcc building (#1903) · 4e08d9aa

gcwenger authored Oct 26, 2018

* Reenabled Chris's nvcc building. Improved support for build paths and variations of cuda 8/9 + clang/gcc

* Improved build messages based on feedback

4e08d9aa

Add builder for {de}quantize to make API's consistent and support {de}quantize with mkldnn (#1839) · 6b36a480

Nishant Patel authored Oct 26, 2018

* Add builder for {de}quantize

* Add declaration in header

* Add mkldnn support for {de}quantize

* Add support for {de}quantize with mkldnn

* Add Dex support

* Generalizing some api's and adding a test case for DQ in backend_test.in.cpp

* Unify scale between ngraph and mkldnn

* Check for nullptrs

* PR feedback

* fix unit test failure

* Adding tests for builder and deleting the backend tests

* curly braces

* test rename

6b36a480

Documentation update for BatchNorm Ops (#1927) · 1c53fd36
L.S. Cook authored Oct 26, 2018

1c53fd36

DEX Debugger (#1798) · fc5842d9

Nick Korovaiko authored Oct 26, 2018

* gdb-like interface + tests

* fix not being able to run call twice without call

* fix continue bug

* fix enables; rename kontinue to resume

* switch from lists of functors,enables to vector

* address scott's feedback

* adding a debugger object

* address jayarams feedback

fc5842d9

fix ztde concat (#1918) · 8a041166
Nick Korovaiko authored Oct 26, 2018

8a041166
[ONNX CI] Multibranch Pipeline Jenkinsfile (#1915) · 40f5c049
mchrusci authored Oct 26, 2018

40f5c049
argmin/max fix (#1922) · 8df14206
Nick Korovaiko authored Oct 26, 2018

8df14206
Move SigmoidBackprop to BinaryElementwiseArithmetic (#1914) · fb3f9e95
Adam Procter authored Oct 26, 2018

fb3f9e95

25 Oct, 2018 10 commits
- Fix for AllReduce partial shape/type validation (#1913) · 08483fbd
  Adam Procter authored Oct 25, 2018
  
  08483fbd
- Implement partial shape/type validation for TopK (#1912) · 759f79c0
  Adam Procter authored Oct 25, 2018
  
  759f79c0
- address complaint when compiling on debian (#1916) · 7246875e
  Robert Kimball authored Oct 25, 2018
  
  7246875e
- Merge pull request #1774 from NervanaSystems/ayzhuang/in-place-concat · 65600444
  Jayaram Bobba authored Oct 25, 2018
```
Add in place concat optimization.
```
  65600444
- Merge branch 'master' into ayzhuang/in-place-concat · c5f4db5d
  Robert Kimball authored Oct 25, 2018
  
  c5f4db5d
- update git_tags to handle the case where there are no labels in the repo (#1906) · cf241d26
  Robert Kimball authored Oct 25, 2018
  
  cf241d26
- Merge branch 'ayzhuang/in-place-concat' of… · a3115a03
  amy.zhuang authored Oct 25, 2018
```
Merge branch 'ayzhuang/in-place-concat' of https://github.com/NervanaSystems/ngraph into ayzhuang/in-place-concat
```
  a3115a03
- Remove unused variable. · 7bdb11de
  amy.zhuang authored Oct 25, 2018
  
  7bdb11de
- Merge branch 'master' into ayzhuang/in-place-concat · 3a2bfd7e
  Matthew Brookhart authored Oct 25, 2018
  
  3a2bfd7e
- m_direct_execution is used but not defined when NGRAPH_DEX_ONLY=TRUE (#1910) · c62f2b23
  Chris Sullivan authored Oct 25, 2018
```
* m_direct_execution is used but not defined when NGRAPH_DEX_ONLY=TRUE

* keep the ifdef and move m_direct_execution out of the ifdef
```
  c62f2b23
24 Oct, 2018 12 commits

Merge branch 'ayzhuang/in-place-concat' of… · c468bd71

amy.zhuang authored Oct 24, 2018

Merge branch 'ayzhuang/in-place-concat' of https://github.com/NervanaSystems/ngraph into ayzhuang/in-place-concat

c468bd71

Rename two variables. · fe456412
amy.zhuang authored Oct 24, 2018

fe456412
Merge branch 'master' into ayzhuang/in-place-concat · ce2df863
Amy Zhuang authored Oct 24, 2018

ce2df863
No in place concat if input format differs from output format. · 87197ec3
amy.zhuang authored Oct 24, 2018

87197ec3

ArgReduce 64 bit indices (#1862) · 9f0589a8

Chris Sullivan authored Oct 24, 2018

* Update ArgReduce to handle i64 indices.

* Formatting.

* Add throw for output types other than int32/64.

* Add output type to hash.

* Add type to throw.

* Interpreter doesn't currently support 64bit output indices for argmin/max and so disabling this test [JIRA:NGRAPH-3183].

9f0589a8

Partial Shapes and Types, Part 4λ: Convolution and backprops (#1890) · ccfcf4f9

Adam Procter authored Oct 24, 2018

* Implement partial shape/type propagation for Convolution; fail for want of unit tests

* Implement unit tests for partial shapes/types for Convolution

ccfcf4f9

fix Klockwork warnings CPU part 1 (#1902) · 0d693fc3
Nick Korovaiko authored Oct 24, 2018
```
* fix Klockwork warnings CPU part 1

* fix spelling error

* fix a typo
```
0d693fc3
[ONNX] Gemm fix (#1877) · 92c1d504
Adam Rogowiec authored Oct 24, 2018
```
* Fix gemm `input_c` broadcasting.

* Comments.

* Add comment
```
92c1d504
[ONNX CI] Update GitHub credentials (#1905) · 4552d024
mchrusci authored Oct 24, 2018

4552d024
Enable Trigonometric ops (#1879) · 835ecad9
tsocha authored Oct 24, 2018

835ecad9
[ONNX] Non-linear ops (#1864) · a804c3d7
tsocha authored Oct 24, 2018
```
* [ONNX] Non-linear ops

* Style check
```
a804c3d7

Cache and use fprop stats in cudnn batchnorm bprop (#1841) · fbc3a940

Chris Sullivan authored Oct 24, 2018

* Temp bn update commit.

* Add CUDNNBatchNorm which adds two additional outputs to batchnorm, the batch mean and batch inv variance.
The batch mean is the same as the output mean if the cummulative average factor is 1.0. Add BatchNormCache pass which replaces all BatchNorm ops that are inputs to BatchNormBackprop
with CUDNNBatchNorm which outputs the saved batch statistics directly to the backprop step.

* Updated bn cache pass, removed extra tests, added test checking that provided stats are used in bprop instead of batch stats.
This test was disabled for interpreter as the reference kernel needs to be updated to use provided statistics.

* Formatting.

* Update to new batch norm API.

* CUDNNBatchNorm -> BatchNormTrainingWithStats

* new line

* Preprocess input variance into BN denominator for cudnn (#1885)

* Add explicit cuda kernel to calculate what cuDNN describes as the inverse
variance. In reality, the backward cudnn kernel for BN requires 1.0f / sqrt(variance + eps),
which is the batchnorm denominator for each channel (a numerically stable inverse stddev).

This introduces op annotations for batch norm backprop and updates the cudnn_emitter to support the insertion of this cuda kernel when required.

* Disable second test on INTERPRETER.

fbc3a940