Commits · 94d93423a3c0d43683195dc0febbf6c3f28e92fa · submodule / ngraph

27 Jan, 2019 1 commit
- add unit test to check that set_parameters_and_results is called · b7e2fd7f
  Robert Kimball authored 6 years ago
  
  b7e2fd7f
23 Jan, 2019 1 commit
- Divert dynamic offset to reference (#2339) · ab7f23d3
  Nishant Patel authored 6 years ago
```
* Divert dynamic offset to reference

* update gpu manifest

* Avoid dynamic cast
```
  ab7f23d3
18 Jan, 2019 1 commit
- Fix a CoordinateTransform corner case exposed by ONNX unit tests · 4ffa0d83
  Adam Procter authored 6 years ago
  
  4ffa0d83
17 Jan, 2019 1 commit

Retire FunctionCall, Reduce, ReduceWindow, SelectAndScatter (#2223) · 18d0993e

Adam Procter authored 6 years ago

* Retire Reduce, ReduceWindow, SelectAndScatter

* Remove lingering AnyAllReplacement code

* Remove apparently-now-unused macro

* Remove lingering op/reduce.hpp includes

* Remove FunctionCall

* Update Python stuff to remove FunctionCall, Reduce

* Add Any/All tests to GPU manifest

* Remove deleted .hpp #include from gpu_compiled_function.cpp

* Restore reduce_function.cpp since that is where the All/Any implementations ended up residing

* Add reduce_function.cpp back into CMakeLists.txt

* Remove #include of deleted reduce.hpp

* Re-remove AnyAllReplacement from GPU passes

* Remove deleted #includes from gpu_internal_function.cpp

* Remove git conflict stuff (whoops)

* Add newline at EOF, per review comment

* Address flake8 complaint about unused import

18d0993e

07 Jan, 2019 1 commit

Simplified all_close_f interface and tightened default criteria (#2285) · 0eaa960c

gcwenger authored 6 years ago

* Simplified & tightened all_close_f parameters

Removed specification of mantissa bits for all_close_f in favor
of just specifying tolerance bits. Tightened up all_close_f default.
Fixed LRN unit test which had insufficient result precision to pass
tighter all_close_f tolerance.

* Addressed PR comments.

Reworked mantissa bit and tolerance constants.
Clarified and improved graph comparison tolerance calculation flexibility.
Clarified unit test tolerance testing.

0eaa960c

03 Jan, 2019 1 commit
- update licenses for 2019 (#2275) · ba299b93
  Robert Kimball authored 6 years ago
```
* update licenses for 2019

* style
```
  ba299b93
19 Dec, 2018 2 commits

Make explicit compile call in unit tests (#2224) · 7693f74e

Robert Kimball authored 6 years ago

* make validate public

* move compile call outside of call for unit tests

* fix compile error

* one more error

7693f74e

Enable in place slice when slice's arg is function input. (#2214) · 02d4aa59

Amy Zhuang authored 6 years ago

* Enable in place slice when slice's arg is function input.

* Add a corner case.

* Add codegen support.

* Set the correct offset for in-place-slice when there is a chain of in place ops starting from parameter.

02d4aa59

11 Dec, 2018 1 commit

Windows build support (#2177) · 9234cc69

Robert Kimball authored 6 years ago

* files pulled from bob/winbuild

* fix compile problems

* fix a few windows build errors

* add windows file to exclude from git

* add comment why change was made

* revert obsolete change

* more cleanup

* building interpreter and unit test on windows with DLLs

* Add flag for windows to export all symbols. Short term fix.

* enable MD build

* address warnings

* dump all windows build results to a single directory

* fix windows backend dll open issue

* remove debug

* fix file iterator for windows

* fix merge error

* fix test failure

* change header from h to hpp in hopes of making python happy

* address more linux build issues

* fix visibility enable

9234cc69

07 Dec, 2018 1 commit

Backend API change pre-work (#2064) · e0933553

Robert Kimball authored 6 years ago

* change compile call to return Handle

* make CPU require compile() before call()

* fix unit tests to call compile() before call()

* fix failing ops

* update unit test

* revert some changes

* more fixups

* more diff cleanup

* a few more issues addressed

* more fixes

* update API

* more updates

* fix test_ops.py

* fix

* another attempt to fix

* fix unit test

* fix test error

e0933553

05 Dec, 2018 1 commit

Support for 5D batchnorm (#2055) · d4f8bfdc

Pruthvi authored 6 years ago

* - modified cpu_assignment pass to support bn with input 5D
- added test cases for 5D bn and 5D bn+relu

* - Address PR comments
- used mkldnn_utils to validate bn for mkldnn

* fix compilation error

* Addressed PR comments
- added helpers in mkldnn_utils for assigning ngraph Op as MKLDNN op
- helper funnction for bn mkldnn assignment

* fix clang error

d4f8bfdc

28 Nov, 2018 1 commit

Cyphers/bnorm back (#2129) · 403a09ce

Scott Cyphers authored 6 years ago

* Fix batchnorm argument order, cleanup some comments, fix backprop

* Merge error

* Clean up training function, organize inference test

* BatchNormInference tests

* Training case

* Training test

* Fix autodiff BatchNorm test

* Cleanup

* Move file to doc checkout

* Update disabled test name in igpu manifest
Fix unnused variable

* Unit tests disables

* Review comments

403a09ce

21 Nov, 2018 1 commit

Use cuda_reduce for int32 & int8 input_type instead of cudnn_reduce (#2070) · d36c180f

Ayan Moitra authored 6 years ago

* add cuda reduce for product and max and added tests

* a quick fix for empty reduce axis and 0 axis

* adding min cuda reduce

* add for min

* fix bug and format

* add another min test

* adding sum to the mix and adding tests to intelGPU manifest

* Incorporate Chris's first comment + clang

* Some mods to the last commit

* Addressed Bob's comments + added more tests int8

* Added more int8 tests + added tests to IntelGPU manifest

* CI test failure debug attempt

* clang

* edit

* Adding the CPU failing test to manifest

* pulled changes from master to address travis ci build failure

d36c180f

16 Nov, 2018 1 commit

Move ParameterVector and ResultVector to the ngraph namespace (#2054) · 803c38aa

Robert Kimball authored 6 years ago

* Move ParameterVector and ResultVector to the ngraph namespace where they belong

* update python wrapper

* more python fixes

* style

* Update setup.py

* fix some new code

803c38aa

15 Nov, 2018 1 commit

adding i32 as quantized type (#2050) · a9686f94

Adam Straw authored 6 years ago

* adding i32 as quantized type

* code format

* mask gpu unit tests

* unused variable

* intel gpu unit test manifest

* fix typo in unit test manifest

a9686f94

11 Nov, 2018 1 commit

nvgpu softmax cuda version (#2014) · be9f031e

Fenglei authored 6 years ago

* add softmax cuda support

* optimize block size

* remove debug info

* remove debug

* style

* remove unused

* remove cudnn softmax

* format

* using nullptr

* move helper, add test

* fix style

* using all_close_f

* using kahansum

* style

* remove commentted out code

be9f031e

09 Nov, 2018 2 commits

Add experimental ShapeOf op (#2023) · 3a47eafc

Adam Procter authored 6 years ago

* Add ShapeOf op

* Helps to check in the source files

* Add shape_of_scalar to unit test manifests

* Add missing include to gpu_emitter.cpp

* Change 'this op is experimental' wording per @indie's suggestion

* New idea: let's try not mallocing 300 terabytes

* Update interpreter implementation

3a47eafc

Add in-place-slice optimization for CPU backend. (#1967) · 65355a17

Amy Zhuang authored 6 years ago

* Add in-place-slice optimization for CPU backend.

* Modify slice emitter function for in place slice.

* Allow arg node to have multiple outputs for in place slice.

* Remove unused variable.

* Add CPUExecutionContext argument to slice builder.

* Address PR feedback: move computation out of the functor.

* Move size computation out of the functor for in place concat.

65355a17

08 Nov, 2018 1 commit

Address potential bug in cudnnGetReductionWorkspaceSize (#1990) · dfc20454

Chris Sullivan authored 6 years ago

* When CUDNN_DATA_TYPE == CUDNN_DATA_DOUBLE, it appears that the cudnn calculated workspace size is incorrect.
Adding a temporary fix here until the underlying issue is found.

* Add softmax test illustrating bug in cudnn impl.

* disable new unit test in intel GPU

dfc20454

07 Nov, 2018 1 commit

NOP backend (#1979) · 4918449c

Robert Kimball authored 6 years ago

* add nop backend

* nop backend

* fix flag name

* add new switch to cmake output of switch settings

* add new unit test to igpu manifest

* remove redundant test

4918449c

02 Nov, 2018 1 commit
- Update unhandled op tests to support late validation (#1961) · b00530a5
  Adam Procter authored 6 years ago
  
  b00530a5
01 Nov, 2018 1 commit

Dropout for CPU (#1942) · 5b994011

Nick Korovaiko authored 6 years ago

* generate mask

* add codegen+dex

* states in context

* CPU dropout

* remove dead code

* remove dead code

* remove state.cpp

* change perms, add state.cpp

* address bobs feedback

* restore igpu unit-test manifest after a bad merge

* better error msgs

* throw on GPUs to keep a compiler happy

* address more feedback

* fix tests

5b994011

29 Oct, 2018 1 commit

Support TopK for NvidiaGPU backend (#1908) · d901446d

Ayan Moitra authored 6 years ago

* fresh commit for the changes

* Working topk on ndims for nvGPU

* fix

* clang

* Added unit test, improved kernel hash and Bob's comment

* int64 test+clang

* Moved argReduce and topk tests to a separate file

* TopK unsupported for IntelGPU

* addressed Fenglei and Chris's comments

* addressed Fenglei and Chris's comments

d901446d

26 Oct, 2018 2 commits

nvgpu concat split (#1894) · 58bd00de

Fenglei authored 6 years ago

* add split concat

* fix bug

* fix bug

* fix bug

* add test

* fix test bug

* add comments

* format

* return intead of check processed

* remove .back() since it's not vector anymore.

* format

* change to paramter tests based on Geoff's comments

* types-> type

* change split size to 256

58bd00de

Add builder for {de}quantize to make API's consistent and support {de}quantize with mkldnn (#1839) · 6b36a480

Nishant Patel authored 6 years ago

* Add builder for {de}quantize

* Add declaration in header

* Add mkldnn support for {de}quantize

* Add support for {de}quantize with mkldnn

* Add Dex support

* Generalizing some api's and adding a test case for DQ in backend_test.in.cpp

* Unify scale between ngraph and mkldnn

* Check for nullptrs

* PR feedback

* fix unit test failure

* Adding tests for builder and deleting the backend tests

* curly braces

* test rename

6b36a480

24 Oct, 2018 2 commits

ArgReduce 64 bit indices (#1862) · 9f0589a8

Chris Sullivan authored 6 years ago

* Update ArgReduce to handle i64 indices.

* Formatting.

* Add throw for output types other than int32/64.

* Add output type to hash.

* Add type to throw.

* Interpreter doesn't currently support 64bit output indices for argmin/max and so disabling this test [JIRA:NGRAPH-3183].

9f0589a8

Cache and use fprop stats in cudnn batchnorm bprop (#1841) · fbc3a940

Chris Sullivan authored 6 years ago

* Temp bn update commit.

* Add CUDNNBatchNorm which adds two additional outputs to batchnorm, the batch mean and batch inv variance.
The batch mean is the same as the output mean if the cummulative average factor is 1.0. Add BatchNormCache pass which replaces all BatchNorm ops that are inputs to BatchNormBackprop
with CUDNNBatchNorm which outputs the saved batch statistics directly to the backprop step.

* Updated bn cache pass, removed extra tests, added test checking that provided stats are used in bprop instead of batch stats.
This test was disabled for interpreter as the reference kernel needs to be updated to use provided statistics.

* Formatting.

* Update to new batch norm API.

* CUDNNBatchNorm -> BatchNormTrainingWithStats

* new line

* Preprocess input variance into BN denominator for cudnn (#1885)

* Add explicit cuda kernel to calculate what cuDNN describes as the inverse
variance. In reality, the backward cudnn kernel for BN requires 1.0f / sqrt(variance + eps),
which is the batchnorm denominator for each channel (a numerically stable inverse stddev).

This introduces op annotations for batch norm backprop and updates the cudnn_emitter to support the insertion of this cuda kernel when required.

* Disable second test on INTERPRETER.

fbc3a940

22 Oct, 2018 3 commits

add support for Quantize round mode (#1859) · 51104813

Adam Straw authored 6 years ago

* added half_toward_zero; all previous tests passing

* all rounding modes added with unit tests

* fix cpu emitter

* round mode doc

* round out round modes

* doc typo

* using  names for round modes

* use ceil/floor for rounding functions instead of round/nearbyint

* clean up doc

* equidistant

51104813

BatchNorm splitting into ops (2nd try) (#1828) · 1beec46b

Nick Korovaiko authored 6 years ago

* split bn into bn_inference bn_training

* fix warnings

* Add GPU support for the new BN ops (#1569)

* Add GPU support and change batchnorm_globalstats test to use BNInference.

* Changed test back to using BNTraining for global stats and updated cudnn backend to account for it.

* Fix issues in merge with master.

* Formatting.

* CPU fixes

* remove 5-arg training BN for now

* more fixes

* python batchnorm changes

* fix onnx_import

* fix a call BatchNormInference c-tor

* yet another fix to BatchNormInference c-tor

* AND yet another fix to batchnorm_inference c-tor

* ops.py

* address adam's feedback

* Remove unnecessary parameter/argument.

* remove batch_norm_training_relu_with_global_stats

* remove bn_relu (training)

1beec46b

move unit tests out of backend_test.in.cpp (#1880) · e07147f8
Robert Kimball authored 6 years ago

e07147f8

19 Oct, 2018 1 commit
- Move unit tests out of backend_test.in.cpp (#1865) · 925e7b27
  Robert Kimball authored 6 years ago
```
* comparisons

* move more unit test out of backend_test.in.cpp

* move more tests

* move more tests
```
  925e7b27
14 Oct, 2018 1 commit

Improved AvgPool unit test coverage. Fixed small bug that was revealed. (#1813) · 67844320

gcwenger authored 6 years ago

* Improved AvgPool unit test coverage. Fixed small bug that was revealed.

* Renamed disabled unit tests to reflect new names.

* Ran clang-format on backend_test.in.cpp to fix format.

* Renamed cpu_results->backend_results in two unit tests.

67844320

12 Oct, 2018 1 commit

Support ArgMin and ArgMax for NVGPU Backend (#1737) · 6f30b32b

Ayan Moitra authored 6 years ago

* Project initialization commit

* Added unit tests for 3D tensors for argmax

* Refactored reduce to be used by argmax argmin. argmax argmin still has some issues. WIP

* [WIP]First working version of ArgMax ArgMin

* added reduce buffer for the cudnn api calls

* added reduce buffer for the cudnn api calls

* Further modifications. Using rvalues to pass enums to build reduce method

* more unit tests added

* Incorporate Fenglei's comments

* Incorporating Chris's first set of comments

* small change to test file

* Resolving clang issue that was causing argmin test to fail

* Incorporate Chris's  comments

* clang format issue

6f30b32b

09 Oct, 2018 1 commit
- enable some unit tests that were disabled. (#1766) · 369b95e3
  Robert Kimball authored 6 years ago
  
  369b95e3
08 Oct, 2018 3 commits

Remove a redundant declaration. · 9c564e8c
amy.zhuang authored 6 years ago

9c564e8c
Add in place concat optimization. · 2efc0065
amy.zhuang authored 6 years ago

2efc0065

Update pad on nvpgu (#1759) · 40ff77bd

Chris Sullivan authored 6 years ago

* Add pad with fill operator using the outward-in index pattern.

* Remove static pad and rename build_pad_dynamic -> build_pad. Update maxpool 1d padding.

* Formatting.

* Split build_pad_dynamic into build_pad and build_pad_fill.

* Add test coverage for fixed bug in op::Pad for gpu.

40ff77bd

04 Oct, 2018 1 commit

nvgpu maxpool bug fix (#1741) · 0051f201

Fenglei authored 6 years ago

* add a test failed on gpu, pass on cpu

* fixed bug

* get datatype size

* add descript for test

* update comment

* update comments and name

0051f201

02 Oct, 2018 1 commit
- IntelGPU backend: Datatype workaround for NCF model (#1729) · 0e008cc5
  shssf authored 6 years ago
  
  0e008cc5
29 Sep, 2018 1 commit
- Rename runtime::TensorView to runtime::Tensor (#1699) · 5fc7cf65
  Robert Kimball authored 6 years ago
```
* rename files

* rename runtime TensorView to Tensor

* rename HostTensorView to HostTensor
```
  5fc7cf65