Commits · v0.11.0 · submodule / ngraph

11 Dec, 2018 3 commits
- fix crash in ReshapeConvertLayout (#2205) · bf14a381
  gaurides authored Dec 11, 2018
```
* fix crash in ngraph-tf test conv_ops_test.Conv2DTest.testConv2DKernelSmallerThanStrideSame

* fix file perms

* correct checks
```
  bf14a381
- Fix TF test failures on Mac. (#2210) · 1640d21e
  Amy Zhuang authored Dec 11, 2018
```
* Bug fixes to unordered map checks

* No in-place slice for non-native MKLDNN layouts

* is_op
```
  1640d21e
- is_op (#2203) · c9eef901
  Nick Korovaiko authored Dec 11, 2018
  
  c9eef901
10 Dec, 2018 1 commit

Harryk remove winml ref (#2204) · 90aa7336

harryskim authored Dec 10, 2018

* Removed winml from stack diagram

* Removed winml from full stack diagram

* Update README.md

* update the diagram without winml

* Changed sentence about WinML

* Removed duplication

90aa7336

08 Dec, 2018 4 commits

change all_close tests to return gtest AssertionResult instead of bool (#2195) · fcdfc4ce

Robert Kimball authored Dec 08, 2018

* change all_close tests to return gtest AssertionResult instead of bool to allow for better error messages

* change throw to return error

* address PR comments and fix compile error

fcdfc4ce

reenable mkldnn convolution for large padding (#2168) · 15d9b658

Jayaram Bobba authored Dec 08, 2018

* reenable mkldnn convolution for large padding

* specify precision tolerance to unit test

* pass tolerance values to all_close

15d9b658

move GPU specific test to GPU only (#2191) · 40dda4eb

Robert Kimball authored Dec 08, 2018

* move GPU specific test to GPU only

* fix unit test invocation

* fix compile error

* fix compile error

* style

* fix runtime error

40dda4eb

make GOE extend from util::Op (#2153) · 453a6a3c
Nick Korovaiko authored Dec 08, 2018
```
* make GOE extend from util::Op

* fix build breaks
```
453a6a3c

07 Dec, 2018 6 commits

Update slice kernels (#2180) · a16c4961

Jayaram Bobba authored Dec 07, 2018

* initial commit for update slice op

* Finished up update_slice fusion and added codegen support

* style fixes

* Added unit test for in-place update-slice strided

* change pattern name

a16c4961

Backend API change pre-work (#2064) · e0933553

Robert Kimball authored Dec 07, 2018

* change compile call to return Handle

* make CPU require compile() before call()

* fix unit tests to call compile() before call()

* fix failing ops

* update unit test

* revert some changes

* more fixups

* more diff cleanup

* a few more issues addressed

* more fixes

* update API

* more updates

* fix test_ops.py

* fix

* another attempt to fix

* fix unit test

* fix test error

e0933553

IntelGPU backend: Fix memory copy into zero tensors (#2192) · c95bdf64
Sergey Shalnov authored Dec 07, 2018

c95bdf64

Support for all_close_f w/ doubles (#2184) · 125f7242

gcwenger authored Dec 07, 2018

* Double support for all_close_f

* all_close_f uses fixed number of mantissa bits now. Simplified testing code.

* Initialize test data members in constructor to values which will cause test failure. Setup then sets them correctly.

* Reduce info printed out during all_close_f unit tests.

125f7242

Update TBB from 2019_U1 to 2019_U2. (#2154) · 91c4b553
Sang Ik Lee authored Dec 07, 2018

91c4b553
re-enable quantize_clamp_int32 test on CPU (#2090) · bba2b3bd
Adam Straw authored Dec 07, 2018
```
* re-enable quantize_clamp_int32 test on CPU

* MLKDNN typo
```
bba2b3bd

06 Dec, 2018 14 commits

QCBiasAdd and QCBiasSignedAdd for mkldnn (#2062) · 1f40160d

Nishant Patel authored Dec 06, 2018

* Quantize the bias to int32

* Bias scale fix

* mnist works

* Quantize Bias

* Introduce Quantize op in the graph to quantize bias & feedback

* Add QuantizedConvBiasAdd

* Comments and some refactoring

* Add test case with float bias and enable int32 as quantized type in ngraph

* Change shape of scale from Shape{} to Shape{1} in the backend

* Add QuantizedConvBiasSignedAdd

* Fix Layouts, clean up and a test case for QCBA

* Test case for QCBSA

* cleanup mkldnn_emitter.hpp

* fix build error

* Constant fold

1f40160d

IntelGPU backend: Allow more cases for clDNN gemm (#2187) · 4034a0c2
Sergey Shalnov authored Dec 06, 2018

4034a0c2

DEX Loop Kernel (updated) (#2156) · 8fc481a3

Nick Korovaiko authored Dec 06, 2018

* one output

passing tests

clean up

fix build breaks

* move generators into a separate file

8fc481a3

add a throw in lieu of a return stmt (#2183) · 56980738
Nick Korovaiko authored Dec 06, 2018

56980738
an env var to disable individual fusions (#2185) · 504e78f8
Nick Korovaiko authored Dec 06, 2018
```
* an env var to disable individual fusions

* fix env var name
```
504e78f8
Give Fusions Names (#2178) · a09d5f88
Nick Korovaiko authored Dec 06, 2018
```
* give fusions names

* fix build breaks

* fix perms
```
a09d5f88
Abort messages in Matcher to better understand cases where we fail to match (#2179) · 06916cbc
Nick Korovaiko authored Dec 06, 2018
```
*  abort messages in matcher.cpp

* style fixes
```
06916cbc

Graph comparison - isolated per op testing (#2144) · 1feb49f1

gcwenger authored Dec 06, 2018

* Isolated per op testing when comparing graphs for better determination of source of accuracy divergence.

* Improve clarity of comment

1feb49f1

[Py] Update README for PyPI (#2151) · 8a9cf8aa

Michał Karzyński authored Dec 06, 2018

* Update README for PyPI

* Update README for PyPI

* Remove redundant newlines

* Fix links

8a9cf8aa

[Py] setup.py code style formatting. (#2164) · 8249bf9f

Adam Rogowiec authored Dec 06, 2018

* Uniform quotes style .

* Fix comment style.

* Check setup.py with flake8.

- Fix flake8 errors.

* Move function out of class scope.

* Fix function paramter list

* Fix formatting.

8249bf9f

nvgpu cuda reduce with stable sum (#2076) · 606f3f93

Fenglei authored Dec 06, 2018

* add some helper function

* update with new helper function

* update reduce to nd with new helper function

* update float sum to stable sum

* fix bug

* update all reduce to stable sum for float

* fix bug and pass the sum stable test

* remove debug info

* style

* update with shape

* fix bug

* add host parameters to cuda_emitter

* clang format

* fix bugs

* add element::type support

* format

* add a cached value with datatype name

* add init_reduce_value

* unroll loop

* optimization

* remove the need for init_value

* add memset kernel

* add memcpy

* working version

* remove debug info

* add comments, clean up code.

* change in_idx to input_idx

* fix bug

* change args name for memset in emitter

* pass element::Type instead of string

* the op::reduce come with init value, add support

* resolve codacy-bot comment

* fix bug

* resove codacy-bot comment

* remove unused comments, resolve comments

* cuda reduce for max, min, mul, reduce op init value, format

* use type::info

* use type info for numeric_limits

* remove code from gpu_host_parameters

* header

* remvoe outdated comments

* add helper to check if stable sum is needed

* add stable sum test for double

* remove extra line

* consolidate helper functions

* no need list now.

* remove extra ;

* clang format

* style

* add skip test for cpu and intelGPU side

* add line between groups of headers

* add two simple stable sum test for float and double

* skip test for intelGPU

606f3f93

Fix compiler error GCC with 7.1 (#2155) · 4b0445d1
Fabian Boemer authored Dec 06, 2018

4b0445d1

Pruthvi/fix rnn precision (#1874) · 73da681a

Pruthvi authored Dec 06, 2018

* - Added reorder support for rnn weights_layer/iter

* i) fixed compilation issues ii) working but still observing precision error

* i) fixed failing rnn unit test for DEX ii) refactored workspace in RNN mkldnn emitter

* i) added support for src reorder to TNC from NTC

* reorder support for rnn output fron NTC to TNC

* - added support for rnn weight reorder ldgoi -> ldigo
- code refactor for lstm/rnn kernel in mkldnn emitter

* - refactor rnn mkldnnn kernel, change variable names

* fix RNN codegen kernel

* disbale layer rnn fusion pass, to test CI

* method to validate recurrent rnn inputs

* add correlated macthes for Recurrent RNN PM

* - simplify reorder logic for rnn_weights
- fix graph pattern for fusing rnn cell across time steps

* do weights reorders in rnn timesteps fusion

* refactored LSTM graph pass

* - Bug fix for finding the lstm inputs determenstically
- Refactored LSTM graph pass to single pass
- made changes to LSTM RNN time step fusion graph pass

* - use replace_node instead of replace_output in Lstm_step_wise fusion graph pass

* fix compilation error

* Fix GNMT rnn fusion

* check if the node is in use before replacing in RNN graph passes

*  i) fix style ii) fix topo sort issue in RNN graph pass

* style fix

* fix bug in simplify_concat pass

* replaces Lstm1 -> {GOE1, GOE2} -> {Slice1, Slice2} -> Concat -> Lstm2 with Lstm1 -> Lstm2

* cse for convert layout

* addressed PR comments

* - optimization pass to remove  Lstm1 -> {GOE1, GOE2} -> {Slice1, Slice2} -> Lstm2
- conditional fusing of LSTM cells only for the decoder

* made changes to multi layer RNN fusion callback

* fix asserts in RNN op

* - added support to fuse layers when slc=dlc for RNN cells
- bug fix on the sanity checks for RNN Op

* - support RNN layer fusion till slc = dlc
- bug fixes in multi layer rnn fusion call back

* capture reshape in the RNN weights

* Addressed PR comments

* - added comments in multi layer PM call back
- fuse only if slc == DLC across layers

* restore deleted 3_lstm_cell_forward.json file

* fix typo

* fix failing unit tets

* When processing in place slice, do not change the offset of the slice node if the argument pointer comes from function input.

* Address PR feedback: process in place slice after propagating in place input.

* Set INTERMEDIATE role before propagating in place input.

* Do not add temporaries to the variable name map before propagating in place input in codegen.

* Fix a bug in codegen.

* Fix a bug in codegen slice.

* reenable disabled rnn unit test

* fix compiler error

* - bug fix in the slicing logic for the layer fused rnn cell
- fix failing rnn unit test

* - Addressed PR comments
- removed redundant checks from the rnn graph pass
- simplified rnn call back replace node logic

* - added new multilayer rnn *.json file
- fix test case

* [PRIVATE BRANCH] Style fixes (#2080)

* Style fixes

* change order of lstm gates

* [PRIVATE BRANCH] Jbobba/rnn fusion review (#2113)

* Style fixes for single-layer RNN fusion

* Style fixes to multi-layer RNN

* style fix

* disable GPU test

73da681a

fix failing bn test (#2175) · 86b783c6
Pruthvi authored Dec 06, 2018
```
* fix fialing bn test

* fix style
```
86b783c6

05 Dec, 2018 9 commits
- Jbobba/fix squeeze padded layouts (#2136) · 05c7fbe4
  Jayaram Bobba authored Dec 05, 2018
```
* fix expand layout for padded dimensions

* enable squeeze padded layouts
```
  05c7fbe4
- remove reshapes on both hands of a binary op (#2157) · f2038de2
  Nick Korovaiko authored Dec 05, 2018
  
  f2038de2
- fix address sanitizer issue in batchnorm fprop and bprop kernel (#2159) · 9e885d05
  Pruthvi authored Dec 05, 2018
  
  9e885d05
- Support for 5D batchnorm (#2055) · d4f8bfdc
  Pruthvi authored Dec 05, 2018
```
* - modified cpu_assignment pass to support bn with input 5D
- added test cases for 5D bn and 5D bn+relu

* - Address PR comments
- used mkldnn_utils to validate bn for mkldnn

* fix compilation error

* Addressed PR comments
- added helpers in mkldnn_utils for assigning ngraph Op as MKLDNN op
- helper funnction for bn mkldnn assignment

* fix clang error
```
  d4f8bfdc
- Make MKLDNN_ENABLE_CONCURRENT_EXEC ON to use concurrent scratchpad in mkldnn. (#2170) · c5dd80be
  Amy Zhuang authored Dec 05, 2018
  
  c5dd80be
- Merge pull request #2174 from NervanaSystems/bob/ext2 · fffc5679
  Robert Kimball authored Dec 05, 2018
```
Bob/ext2
```
  fffc5679
- Merge branch 'master' into master · 39f56f10
  Chris Sullivan authored Dec 05, 2018
  
  39f56f10
- Revert "Bug fix for invalid memory access to Constant (GPU backend) (#2162)" (#2172) · 592c375e
  Robert Kimball authored Dec 05, 2018
```
This reverts commit 1c4aa225.
```
  592c375e
- Bug fix for invalid memory access to Constant (GPU backend) (#2162) · 1c4aa225
  Robert Kimball authored Dec 05, 2018
```
Fix the incorrect way to query the size of Constant tensor, which lead invalid memory access
```
  1c4aa225
04 Dec, 2018 3 commits
- Merge branch 'master' into master · 0f05495c
  Scott Cyphers authored Dec 04, 2018
  
  0f05495c
- IntelGPU backend: Use clDNN matrix operations for nGraph::Dot (#2105) · 244c9fcf
  Sergey Shalnov authored Dec 04, 2018
```
* IntelGPU backend: Use clDNN matrix operations for nGraph::Dot

* Update unit_test.manifest
```
  244c9fcf
- Merge branch 'master' into master · 1b5340c4
  Scott Cyphers authored Dec 04, 2018
  
  1b5340c4