Commits · dc4320af73c3f4903b5fe28ef70163a6fcd308e8 · submodule / ngraph

26 Oct, 2018 2 commits

Fenglei authored 6 years ago

* add split concat

* fix bug

* fix bug

* fix bug

* add test

* fix test bug

* add comments

* format

* return intead of check processed

* remove .back() since it's not vector anymore.

* format

* change to paramter tests based on Geoff's comments

* types-> type

* change split size to 256

58bd00de

Add builder for {de}quantize to make API's consistent and support {de}quantize with mkldnn (#1839) · 6b36a480

Nishant Patel authored 6 years ago

* Add builder for {de}quantize

* Add declaration in header

* Add mkldnn support for {de}quantize

* Add support for {de}quantize with mkldnn

* Add Dex support

* Generalizing some api's and adding a test case for DQ in backend_test.in.cpp

* Unify scale between ngraph and mkldnn

* Check for nullptrs

* PR feedback

* fix unit test failure

* Adding tests for builder and deleting the backend tests

* curly braces

* test rename

6b36a480

24 Oct, 2018 2 commits

ArgReduce 64 bit indices (#1862) · 9f0589a8

Chris Sullivan authored 6 years ago

* Update ArgReduce to handle i64 indices.

* Formatting.

* Add throw for output types other than int32/64.

* Add output type to hash.

* Add type to throw.

* Interpreter doesn't currently support 64bit output indices for argmin/max and so disabling this test [JIRA:NGRAPH-3183].

9f0589a8

Cache and use fprop stats in cudnn batchnorm bprop (#1841) · fbc3a940

Chris Sullivan authored 6 years ago

* Temp bn update commit.

* Add CUDNNBatchNorm which adds two additional outputs to batchnorm, the batch mean and batch inv variance.
The batch mean is the same as the output mean if the cummulative average factor is 1.0. Add BatchNormCache pass which replaces all BatchNorm ops that are inputs to BatchNormBackprop
with CUDNNBatchNorm which outputs the saved batch statistics directly to the backprop step.

* Updated bn cache pass, removed extra tests, added test checking that provided stats are used in bprop instead of batch stats.
This test was disabled for interpreter as the reference kernel needs to be updated to use provided statistics.

* Formatting.

* Update to new batch norm API.

* CUDNNBatchNorm -> BatchNormTrainingWithStats

* new line

* Preprocess input variance into BN denominator for cudnn (#1885)

* Add explicit cuda kernel to calculate what cuDNN describes as the inverse
variance. In reality, the backward cudnn kernel for BN requires 1.0f / sqrt(variance + eps),
which is the batchnorm denominator for each channel (a numerically stable inverse stddev).

This introduces op annotations for batch norm backprop and updates the cudnn_emitter to support the insertion of this cuda kernel when required.

* Disable second test on INTERPRETER.

fbc3a940

22 Oct, 2018 3 commits

add support for Quantize round mode (#1859) · 51104813

Adam Straw authored 6 years ago

* added half_toward_zero; all previous tests passing

* all rounding modes added with unit tests

* fix cpu emitter

* round mode doc

* round out round modes

* doc typo

* using  names for round modes

* use ceil/floor for rounding functions instead of round/nearbyint

* clean up doc

* equidistant

51104813

BatchNorm splitting into ops (2nd try) (#1828) · 1beec46b

Nick Korovaiko authored 6 years ago

* split bn into bn_inference bn_training

* fix warnings

* Add GPU support for the new BN ops (#1569)

* Add GPU support and change batchnorm_globalstats test to use BNInference.

* Changed test back to using BNTraining for global stats and updated cudnn backend to account for it.

* Fix issues in merge with master.

* Formatting.

* CPU fixes

* remove 5-arg training BN for now

* more fixes

* python batchnorm changes

* fix onnx_import

* fix a call BatchNormInference c-tor

* yet another fix to BatchNormInference c-tor

* AND yet another fix to batchnorm_inference c-tor

* ops.py

* address adam's feedback

* Remove unnecessary parameter/argument.

* remove batch_norm_training_relu_with_global_stats

* remove bn_relu (training)

1beec46b

move unit tests out of backend_test.in.cpp (#1880) · e07147f8
Robert Kimball authored 6 years ago

e07147f8

19 Oct, 2018 1 commit
- Move unit tests out of backend_test.in.cpp (#1865) · 925e7b27
  Robert Kimball authored 6 years ago
```
* comparisons

* move more unit test out of backend_test.in.cpp

* move more tests

* move more tests
```
  925e7b27
14 Oct, 2018 1 commit

Improved AvgPool unit test coverage. Fixed small bug that was revealed. (#1813) · 67844320

gcwenger authored 6 years ago

* Improved AvgPool unit test coverage. Fixed small bug that was revealed.

* Renamed disabled unit tests to reflect new names.

* Ran clang-format on backend_test.in.cpp to fix format.

* Renamed cpu_results->backend_results in two unit tests.

67844320

12 Oct, 2018 1 commit

Support ArgMin and ArgMax for NVGPU Backend (#1737) · 6f30b32b

Ayan Moitra authored 6 years ago

* Project initialization commit

* Added unit tests for 3D tensors for argmax

* Refactored reduce to be used by argmax argmin. argmax argmin still has some issues. WIP

* [WIP]First working version of ArgMax ArgMin

* added reduce buffer for the cudnn api calls

* added reduce buffer for the cudnn api calls

* Further modifications. Using rvalues to pass enums to build reduce method

* more unit tests added

* Incorporate Fenglei's comments

* Incorporating Chris's first set of comments

* small change to test file

* Resolving clang issue that was causing argmin test to fail

* Incorporate Chris's  comments

* clang format issue

6f30b32b

09 Oct, 2018 1 commit
- enable some unit tests that were disabled. (#1766) · 369b95e3
  Robert Kimball authored 6 years ago
  
  369b95e3
08 Oct, 2018 3 commits

Remove a redundant declaration. · 9c564e8c
amy.zhuang authored 6 years ago

9c564e8c
Add in place concat optimization. · 2efc0065
amy.zhuang authored 6 years ago

2efc0065

Update pad on nvpgu (#1759) · 40ff77bd

Chris Sullivan authored 6 years ago

* Add pad with fill operator using the outward-in index pattern.

* Remove static pad and rename build_pad_dynamic -> build_pad. Update maxpool 1d padding.

* Formatting.

* Split build_pad_dynamic into build_pad and build_pad_fill.

* Add test coverage for fixed bug in op::Pad for gpu.

40ff77bd

04 Oct, 2018 1 commit

nvgpu maxpool bug fix (#1741) · 0051f201

Fenglei authored 6 years ago

* add a test failed on gpu, pass on cpu

* fixed bug

* get datatype size

* add descript for test

* update comment

* update comments and name

0051f201

02 Oct, 2018 1 commit
- IntelGPU backend: Datatype workaround for NCF model (#1729) · 0e008cc5
  shssf authored 6 years ago
  
  0e008cc5
29 Sep, 2018 1 commit
- Rename runtime::TensorView to runtime::Tensor (#1699) · 5fc7cf65
  Robert Kimball authored 6 years ago
```
* rename files

* rename runtime TensorView to Tensor

* rename HostTensorView to HostTensor
```
  5fc7cf65
28 Sep, 2018 3 commits
- IntelGPU backend: Use custom eltwise kernel for signed integers (#1716) · fd80d8ee
  shssf authored 6 years ago
  
  fd80d8ee
- IntelGPU backend: Avoid scalar to matrix operation in clDNN (#1715) · 8d70e2a3
  shssf authored 6 years ago
  
  8d70e2a3
- add nGraph dequantize op (#1700) · f6e4323f
  Adam Straw authored 6 years ago
```
* add ngraph dequantize op

* use a floating point offset

* code format

* reminder to fix serializer

* add serializer support

* add dequantize test cases

* cleanup and code format

* fix build warning for implicit conversion
```
  f6e4323f
26 Sep, 2018 1 commit

add nGraph quantize op (#1661) · d640fac3

Adam Straw authored 6 years ago

* adding nGraph Quantize op

* unit test failing for floating point exception

* unit test working in float

* unit test working in uint8

* improved type checking and polished unit test - passing

* quantized axes working

* inclusive project method

* add round mode

* TODO cleanup

* code format

* adding serializer support - fails build

* add serializer support

* make CPU quantize op work; new tests for int8, clamp)

* fix build failure

* fix GPU build issue

* fix GPU unit test manifest

* use quantized offset

* add is_quantized field to element::Type

* add reduce function to coordinate.hpp

d640fac3

18 Sep, 2018 2 commits

Fix bug in cpu_layout: explicitly handle , add test for coverage. (#1621) · 4782e060
Chris Sullivan authored 6 years ago

4782e060

nvgpu optimize reshape v3 (#1617) · 84de3bf4

Fenglei authored 6 years ago

* pass args instead of pointer to array

* add 3d tiled reshpae

* working version

* add shared mem version of 2d, 3d reshape

* remove unused code

* style

* resolve commits

* add test for 3D reshape, some 3D reshape will be treat as 2D

84de3bf4

13 Sep, 2018 1 commit

Handle unsupported op in nbench (#1531) · fe676f72

Robert Kimball authored 6 years ago

* add unsupported_op exception

* unsupported_op test

* add printout of unsupported op in model

* fix GPU dispatcher check

* fix test designation

* catch exceptions on single file runs too

* add unsupported_op exception where needed

* remove unsupported_op class

* add unassigned op exception

* add unit test

* catch unsupported op in nbench

* add cpu test back

* update all latest merges

* mode change

fe676f72

12 Sep, 2018 1 commit

Add in_place support for ReplaceSlice (#1559) · bb6de284

gaurides authored 6 years ago

* Add in_place suport for ReplaceSlice

* Add emit_replace_slice_inplace kernel

* changed file permissions to original

* Formatted code using maint/apply-code-format.sh

* Removed data type check and removed dead code

* Removed setting mkldnn_op(true). ReplaceSlice is not mkldnn op

bb6de284

07 Sep, 2018 1 commit
- IntelGPU backend: Reshape operation optimization (#1566) · 3609cc74
  shssf authored 6 years ago
  
  3609cc74
06 Sep, 2018 1 commit

TopK (w/ArgMax, ArgMin python wrapper) (#1560) · 3548772b

Sang Ik Lee authored 6 years ago

* Implement TopK.

* Update python wrappers for TopK, ArgMin and ArgMax.

* Address some reviewer comments.

* Add type property check tests for TopK.
Set correct TopK behavior for K==0.

* TopK: Add 1d and 3d unit tests.

* Address more reviewer comments.

* Apply code style.

3548772b

04 Sep, 2018 2 commits

nvgpu reduce to scalar optimization (#1491) · 5f40d957

Fenglei authored 6 years ago

* add cuda reduce

* clang format

* fix bugs

* fix bug

* add 1d reduce

* clang format

* fix bugs

* unroll loop

* remove debug info

* revert tests

* unroll 1D reduce op

* add comments

* using cudnn for nd to scalar reduction

* remove cuda 1d reduction since cudnn version is faster

* remove 1D kernel

* fix bugs

* 1d multi block size

* remove debug

* change kernel name

* add reduce to scalar optimization, add test

* fix bugs and tune parameters

* clang format

* update comments

* update comments

* update comments

* clang format

* update comments

* remove wrong comments, apply clang format

* resolve Bob's comment

* clang format

* pass shared mem size from cuLaunchKernel, set unroll loop size through host code

* remove unused code.clang format

* change reduce to thread with shfl for each warp first

* add seed

* unroll size

5f40d957

IntelGPU backend: Sum operation optimization (#1545) · ed22bf6c

shssf authored 6 years ago

* IntelGPU backend: Sum operation optimization

* PR1545. Comments addressed. Test added. Helper function refactored.

ed22bf6c

03 Sep, 2018 1 commit
- TEST: simple test with one constant to two outputs (#1537) · b9cbd039
  shssf authored 6 years ago
  
  b9cbd039
29 Aug, 2018 1 commit

Change license header to use single-line comment (#1508) · a17ec605

Robert Kimball authored 6 years ago

* use line comments instead of multiline comments for license header

* update more

* update new files

* more header updates

* style

a17ec605

27 Aug, 2018 1 commit
- normalize comments (#1492) · 9c48c327
  Robert Kimball authored 6 years ago
```
* normalize comments

* address review comments
```
  9c48c327
22 Aug, 2018 1 commit
- ArgMax (#1453) · 822aa81d
  Nick Korovaiko authored 6 years ago
```
* argmax

* manifests and serailizer
```
  822aa81d
21 Aug, 2018 1 commit

ArgMin (#1435) · 951e77b4

Nick Korovaiko authored 6 years ago

* argmin

* address feedbacka argmin

* add new lines

*  addnew lines

* address adam's nitpicks

* scott's feedback

* fix unit tests

951e77b4

13 Aug, 2018 2 commits

enable parameter validation for all unit tests (#1385) · 24b41844
Robert Kimball authored 6 years ago
```
* enable parameter validation for all unit tests
```
24b41844

Remove validation checks from performance critical code paths and ski… (#1327) · af1201fd

Jayaram Bobba authored 6 years ago

* Remove validation checks from performance critical code paths and skip layout propagation to inputs

* Add templated call method to backend for cases where users need input validation

* Added missing return

* fix python api compile error due to ngraph api change.

* disable parameter validation in python api

* make validating call a separate call rather than templated

af1201fd

08 Aug, 2018 1 commit
- add missing unit tests (#1373) · 104fd3ee
  Robert Kimball authored 6 years ago
  
  104fd3ee
02 Aug, 2018 1 commit

LRN (#1282) · 237c4803

Nick Korovaiko authored 6 years ago

* lrn init

* fix comment

* mkldnn lrn (#1295)

* add serializer + fix compiler warnings

237c4803

26 Jul, 2018 1 commit

IntelGPU backend: broadcast operation (#1252) · d4349db8

shssf authored 6 years ago

* IntelGPUBackend: Broadcast operation

* IntelGPUBackend: more tests for Broadcast operation

* Move macro to static C function in Broadcast tests

d4349db8

18 Jul, 2018 1 commit
- Pool tests updated to check all backends (#1245) · e2255fbd
  Robert Kimball authored 6 years ago
```
* make pool test check backends other than CPU

* more unit test cleanup
```
  e2255fbd