- 31 Oct, 2018 5 commits
-
-
Robert Kimball authored
* create unique_ptr backend * unit test cleanup * address more code that was recently added * change from reference to pointer when passing backend to reduce the number of lines changed. * fix build error * fix python wrapper * style * more specific treatment for unique_ptr
-
Sergey Shalnov authored
-
Adam Rogowiec authored
* [WIP] Draft for matmul * Numpy-style broadcasting for matrix multiplication. * 3D matrix multiplication with one big Dot/slice/concat. * Generic ND matmul implementation using slice/dot/concat pattern. * Code formatting. * remove unused header * Add missing header * Utility reshape-like functions. * Use utility functions. * Review comments. * Code format * Use if/else instead of ternary operator for readability. * Remove unused function overloading * Utility function expanding tensor shape with empty axes. * Use helper functions. * Use type for auto variable initializer to fix Centos build * Fix Centos build errors.
-
Rob Earhart authored
-
Nishant Patel authored
-
- 30 Oct, 2018 4 commits
-
-
Robert Kimball authored
-
Michał Karzyński authored
-
gaurides authored
* Initial implementation of GroupConv+BatchNorm fusion * Added GroupConv+BatchNorm with Relu fusion * Added changes to fuse with BoundedRelu * Changed BoundedRelu to Relu * Added test; Code cleanup * Code formatting * Removed dead code * Added test cases and other misc * Bug fix in group conv callback and general cleanup * Address PR feedback * Minor edit to comment. MKLDNN divides both input and output channels by groups * Style fixes and PR feedback
-
Michał Karzyński authored
* Add ArgMin operator * Add ArgMax and a basic test case * Rename variables * Apply workaround for problems with Reshape on i64 * Review comments * Review comments
-
- 29 Oct, 2018 5 commits
-
-
Sergey Shalnov authored
-
Sergey Shalnov authored
-
Rob Earhart authored
* Add PlaidML backend * CR comments Used m_ prefix for members; removed trailing underscores Updated license headers Moved associated header inclusions to project blocks Wrapped comments to 100 chars Added missing newlines between functions Removed nested namespaces in operation implementations * Add earhart to CODEOWNERS * Rebase updates * style
-
Ayan Moitra authored
* fresh commit for the changes * Working topk on ndims for nvGPU * fix * clang * Added unit test, improved kernel hash and Bob's comment * int64 test+clang * Moved argReduce and topk tests to a separate file * TopK unsupported for IntelGPU * addressed Fenglei and Chris's comments * addressed Fenglei and Chris's comments
-
Sergey Shalnov authored
* IntelGPU backend: Profile data improved * PR1932. Comments addressed
-
- 28 Oct, 2018 1 commit
-
-
Sergey Shalnov authored
-
- 27 Oct, 2018 3 commits
-
-
shssf authored
-
Fenglei authored
-
Jayaram Bobba authored
* Move to TBB2019 and bug fix to capture functor * Change to use TBB release tag * remove lightweight from codegen * Enable TBB flow graph tracing
-
- 26 Oct, 2018 8 commits
-
-
Fenglei authored
* add split concat * fix bug * fix bug * fix bug * add test * fix test bug * add comments * format * return intead of check processed * remove .back() since it's not vector anymore. * format * change to paramter tests based on Geoff's comments * types-> type * change split size to 256
-
shssf authored
-
gcwenger authored
* Reenabled Chris's nvcc building. Improved support for build paths and variations of cuda 8/9 + clang/gcc * Improved build messages based on feedback
-
Nishant Patel authored
* Add builder for {de}quantize * Add declaration in header * Add mkldnn support for {de}quantize * Add support for {de}quantize with mkldnn * Add Dex support * Generalizing some api's and adding a test case for DQ in backend_test.in.cpp * Unify scale between ngraph and mkldnn * Check for nullptrs * PR feedback * fix unit test failure * Adding tests for builder and deleting the backend tests * curly braces * test rename
-
Nick Korovaiko authored
* gdb-like interface + tests * fix not being able to run call twice without call * fix continue bug * fix enables; rename kontinue to resume * switch from lists of functors,enables to vector * address scott's feedback * adding a debugger object * address jayarams feedback
-
Nick Korovaiko authored
-
Nick Korovaiko authored
-
Adam Procter authored
-
- 25 Oct, 2018 5 commits
-
-
Adam Procter authored
-
Adam Procter authored
-
Robert Kimball authored
-
amy.zhuang authored
-
Chris Sullivan authored
* m_direct_execution is used but not defined when NGRAPH_DEX_ONLY=TRUE * keep the ifdef and move m_direct_execution out of the ifdef
-
- 24 Oct, 2018 9 commits
-
-
amy.zhuang authored
-
amy.zhuang authored
-
Chris Sullivan authored
* Update ArgReduce to handle i64 indices. * Formatting. * Add throw for output types other than int32/64. * Add output type to hash. * Add type to throw. * Interpreter doesn't currently support 64bit output indices for argmin/max and so disabling this test [JIRA:NGRAPH-3183].
-
Adam Procter authored
* Implement partial shape/type propagation for Convolution; fail for want of unit tests * Implement unit tests for partial shapes/types for Convolution
-
Nick Korovaiko authored
* fix Klockwork warnings CPU part 1 * fix spelling error * fix a typo
-
Adam Rogowiec authored
* Fix gemm `input_c` broadcasting. * Comments. * Add comment
-
tsocha authored
-
tsocha authored
* [ONNX] Non-linear ops * Style check
-
Chris Sullivan authored
* Temp bn update commit. * Add CUDNNBatchNorm which adds two additional outputs to batchnorm, the batch mean and batch inv variance. The batch mean is the same as the output mean if the cummulative average factor is 1.0. Add BatchNormCache pass which replaces all BatchNorm ops that are inputs to BatchNormBackprop with CUDNNBatchNorm which outputs the saved batch statistics directly to the backprop step. * Updated bn cache pass, removed extra tests, added test checking that provided stats are used in bprop instead of batch stats. This test was disabled for interpreter as the reference kernel needs to be updated to use provided statistics. * Formatting. * Update to new batch norm API. * CUDNNBatchNorm -> BatchNormTrainingWithStats * new line * Preprocess input variance into BN denominator for cudnn (#1885) * Add explicit cuda kernel to calculate what cuDNN describes as the inverse variance. In reality, the backward cudnn kernel for BN requires 1.0f / sqrt(variance + eps), which is the batchnorm denominator for each channel (a numerically stable inverse stddev). This introduces op annotations for batch norm backprop and updates the cudnn_emitter to support the insertion of this cuda kernel when required. * Disable second test on INTERPRETER.
-