Commits · c32512fa07debc86f4b6948104d4dcfc769c031f · submodule / ngraph

27 Feb, 2019 7 commits

Add info about lib versions in an easy to find place (#2508) · c32512fa
Scott Cyphers authored Feb 27, 2019
```
* Add info about lib versions in an easy to find place

* Review comments
```
c32512fa
IntelGPU backend: Concat operation fix (#2506) · 6d9bc696
Sergey Shalnov authored Feb 27, 2019

6d9bc696

Function call working (#2472) · 84167659

Robert Kimball authored Feb 27, 2019

* function call working

* fix compile error

* fix compile error

* add attribute support to plot_graph

* fix build error

* fix merge error

* better colors for FunctionCall op

84167659

Update namespace table and fix broken links (#2503) · 1575e2d1

Leona C authored Feb 27, 2019

* Cleaner API doc reference for compile call

* Add a useful table for nGraph namespaces

* Remove layout namespace

* Show exploding kernel problem on illustration like IEEE preso

* WIP branch for new documentation restructuring that is a huge pain

* Fix the doc reorg mess

* Fix underline

* List of passes disclaimer note

* Update disclaimers on README

* More cleanup of doc reorg

* Update core docs

* Update overview on core

* Add PR feedback

* Get rid of all the gazillion of doc build errors from rearranging stuff

* Add section on tutorials

* Update branch

* Cleanup intro

* Add better detail to overview

* Revise buildlb instructions and add better title for contributing to doc

* Note about unit tests

* Editing

* Update core overview namespace table and fix more broken links due to ToC changes

* Update normalized boolean build defaults

* Update for PR 2507

* Incorporate new PR feedback review

1575e2d1

Unit tests for relevant resnet50 integer ops (#2456) · 86394f10

Ayan Moitra authored Feb 27, 2019

* Int unit tests that fail with bfloat

* move tests out of single file

* style

* Incorporate Bob's comments

* edits

* Incorporate comments

* style

* edits

* Add failing test to intel gpu manifest

* comments incoprorated

86394f10

IntelGPU backend: ConvolutionBackpropData stability fix (#2505) · 77fb38bd
Sergey Shalnov authored Feb 27, 2019

77fb38bd

[ONNX] numpy broadcasting refactoring (#2496) · 50334cbf

tsocha authored Feb 27, 2019

* Remove get_numpy_broadcast_shape helper function

* Remove numpy_style_broadcast_for_binary_operation helper function

* Remove TODO

* Review fix pt. 1

* Remove parameters as shape containers

* Fix LSTM

* Review fix pt. 1

* Style apply

* Use old comment

50334cbf

26 Feb, 2019 11 commits

More quantized fusion patterns (#2480) · b8106133

Jayaram Bobba authored Feb 26, 2019

* Add QuantizedConcat

* Remove unused variables and add check for size of mins and maxes vector

* Resolve conflicts

* Merged with master and addressed some PR feedback

* Maxpool and Avgpool fusions. Exclude Q from conv+relu fusion

* Remove single-user check from fusions

* Quantized concat fusion

* workaround: do reshape sinking by default

* style fix

* check scales for QuantizedConcat

* use compare_constants

* remove stale comment

* Handle all concat cases from arg size 2 to 6

* addressed feedback

b8106133

IntelGPU backend: Relu and Sigmoid datatypes support (#2500) · 3863180d

Sergey Shalnov authored Feb 26, 2019

* IntelGPU backend: Relu and Sigmoid datatypes support

* fix for OpenCL constants

* add const to variables

* PR2500. Style fix

3863180d

[ONNX] GlobalLpPool operator (#2476) · d357cb92

Adam Rogowiec authored Feb 26, 2019

* Utility functions for calculating Lp norm.

* Use functor object as a reduction operation.

* Use new api of make_ng_reduction_op.

* Use utility norm functions for reduction operations.

* Onnx GlobalLpPool operator.

* Ensure correct shapes after lp_norm reduction.

* Remove unused function overload.

* Fix shapes and tensor types.

* Unit tests.

* Update comments.

* Update supported ops status table.

* Fix: take absolute value of input tensor elements.

* UT: with odd value p-norm.

* Fix: move taking abs value into respective lp-norm functions.

* Fix clang -Wdocumentation-unknown-command error.

* Update supported op status table with new Jira ticket for Erf op.

* Update supported_ops status table.

* Update interface of make_ng_reduction_op - accept std::function object.

* Update to use new make_ng_reduction_op api.

* Remove unused header.

* Fix errors on CentOS.

d357cb92

Move CodeWriter out of codegen to ngraph root. (#2473) · c2974ac2

Robert Kimball authored Feb 26, 2019

* Move codewriter out of codegen to ngraph root. It is useful for more than writing code.

* remove codewriter.* from intel gpu backend and use ngraph version

* fix merge issues

c2974ac2

Convert PlaidML Tile op to generic ngraph passthrough op (#2361) · cf33669b

Rob Earhart authored Feb 26, 2019

* Add a direct-to-Tile op

* Disable dequantize_dynamic_offset

* Add missing Py op defn

* Generic passthrough op; serialization

* Appease Linux builds

* Add gpu handlers

* Disable floor_int32 for now

cf33669b

Rollback accidental change to CGO.RelocationModel (#2499) · fb4db5f6
Sang Ik Lee authored Feb 26, 2019

fb4db5f6

fix a bug on finalize when uninitialized bool (#2498) · ee5567c4

Sandeep authored Feb 26, 2019

* fix a bug on finalize when uninitialized bool

* change this_init_comm -> m_init_comm

move init to header

ee5567c4

Upgrades MKLDNN to V0.18-rc (#2486) · 278632dd

Pruthvi authored Feb 26, 2019

* - MKLDNN would choose the algorithm which will potentially give best performance based on
- convolution dimensions number of logical processors available.

- (For auto-dispatching to work as intended,
- use the same thread affinity settings when creating the convolution as when executing the convolution.)
- The relationship between convolution sizes and the best performing algorithm is empirically based on performance observations

* bump mkldnn version to V0.18-rc

* Revert "- MKLDNN would choose the algorithm which will potentially give best performance based on"

This reverts commit 904beb8ad8d4e829fbae5f38a803ea80a72b3ffd.

* Update mkl-dnn patch for soversion removal.

278632dd

[ONNX] Enhance LSTM support. (#2408) · 6e6c8af4
Adam Rogowiec authored Feb 26, 2019

6e6c8af4
use friendly name for serialization and DOT files (#2493) · 25c9152f
Robert Kimball authored Feb 26, 2019

25c9152f
[ONNX] Fix the global alias shadowing warning (#2497) · e8538ba0
Tomasz Dołbniak authored Feb 26, 2019

e8538ba0

25 Feb, 2019 5 commits

[Py] Change author name in Python package (#2489) · 521e31fd
Michał Karzyński authored Feb 25, 2019

521e31fd
Update of MLSL git tag (#2474) · d3453447
Aleksey Marchuk authored Feb 25, 2019
```
* Update of MLSL git tag

* Use last MLSL commit

* Use last valid MLSL commit
```
d3453447

Update mkl-dnn build script. (#2487) · 65ac0e68

Sang Ik Lee authored Feb 25, 2019

Update TBB build script for Windows.

Fix typo.

Fix incorrect omp lib name on Windows.

Fix incorrect tbb.dll path on Windows.

Make LIBRARY and ARCHIVE output directory consistent.

Function missing on Windows.

Update test::util::all_close() to fix compilation issue on Windows

Export CPU_Executable on Windows.

Change nbench path for unit-test on Windows.

Change copy to copy_if_different.

Install CPU backend on Windows.

Disable tools test on Windows.

Disable two failing unit test on Windows CPU.

Fix incorrect CPU backend install path on Windows.

65ac0e68

[Standalone] Introduce CPURuntimeContextCG for standalone codegen generation. (#2421) · e9162eb5

Diego Caballero authored Feb 25, 2019

* [CPUCodegen] Remove unnecessary forward declaration.

* [CPUCodegen] Introduce CPURuntimeContextCG for standalone codegen generation.

This patch introduces CPURuntimeContextCG. This class is aimed at
removing the dependency between nGraph and the generated code in
codegen mode. It will be used to hold the runtime context in
codegen mode and it will be emitted in the generated code. For now,
CPURuntimeContextCG only contains TBB's graph and global context.
Follow-up patches will migrate more members in CPURuntimeContext to
CPURuntimeContextCG for codegen mode.

Testing results:
  - Before: NGRAPH_CODEGEN=1 test/unit-test
    [----------] Global test environment tear-down
    [==========] 2503 tests from 54 test cases ran. (290406 ms total)
    [  PASSED  ] 2490 tests.

  - After: NGRAPH_CODEGEN=1 test/unit-test
    [----------] Global test environment tear-down
    [==========] 2503 tests from 54 test cases ran. (412616 ms total)
    [  PASSED  ] 2490 tests.

* [CPUCodegen] Refactor function parameters string

* Fix bug in CPU_CallFrame destructor impacting DEX

* [Standalone] Replace assert with NGRAPH_ASSERT

e9162eb5

Pruthvi/bi rnn (#2232) · a444f7a9

Pruthvi authored Feb 25, 2019

* - Added reorder support for rnn weights_layer/iter

* i) fixed compilation issues ii) working but still observing precision error

* i) fixed failing rnn unit test for DEX ii) refactored workspace in RNN mkldnn emitter

* i) added support for src reorder to TNC from NTC

* reorder support for rnn output fron NTC to TNC

* - added support for rnn weight reorder ldgoi -> ldigo
- code refactor for lstm/rnn kernel in mkldnn emitter

* - refactor rnn mkldnnn kernel, change variable names

* fix RNN codegen kernel

* disbale layer rnn fusion pass, to test CI

* method to validate recurrent rnn inputs

* add correlated macthes for Recurrent RNN PM

* - simplify reorder logic for rnn_weights
- fix graph pattern for fusing rnn cell across time steps

* do weights reorders in rnn timesteps fusion

* refactored LSTM graph pass

* - Bug fix for finding the lstm inputs determenstically
- Refactored LSTM graph pass to single pass
- made changes to LSTM RNN time step fusion graph pass

* - use replace_node instead of replace_output in Lstm_step_wise fusion graph pass

* fix compilation error

* Fix GNMT rnn fusion

* check if the node is in use before replacing in RNN graph passes

*  i) fix style ii) fix topo sort issue in RNN graph pass

* style fix

* fix bug in simplify_concat pass

* replaces Lstm1 -> {GOE1, GOE2} -> {Slice1, Slice2} -> Concat -> Lstm2 with Lstm1 -> Lstm2

* cse for convert layout

* addressed PR comments

* - optimization pass to remove  Lstm1 -> {GOE1, GOE2} -> {Slice1, Slice2} -> Lstm2
- conditional fusing of LSTM cells only for the decoder

* made changes to multi layer RNN fusion callback

* fix asserts in RNN op

* - added support to fuse layers when slc=dlc for RNN cells
- bug fix on the sanity checks for RNN Op

* - support RNN layer fusion till slc = dlc
- bug fixes in multi layer rnn fusion call back

* capture reshape in the RNN weights

* Addressed PR comments

* - added comments in multi layer PM call back
- fuse only if slc == DLC across layers

* restore deleted 3_lstm_cell_forward.json file

* fix typo

* fix failing unit tets

* When processing in place slice, do not change the offset of the slice node if the argument pointer comes from function input.

* Address PR feedback: process in place slice after propagating in place input.

* Set INTERMEDIATE role before propagating in place input.

* Do not add temporaries to the variable name map before propagating in place input in codegen.

* Fix a bug in codegen.

* Fix a bug in codegen slice.

* reenable disabled rnn unit test

* fix compiler error

* - bug fix in the slicing logic for the layer fused rnn cell
- fix failing rnn unit test

* - Addressed PR comments
- removed redundant checks from the rnn graph pass
- simplified rnn call back replace node logic

* - added new multilayer rnn *.json file
- fix test case

* [PRIVATE BRANCH] Style fixes (#2080)

* Style fixes

* change order of lstm gates

* WIP bi rnn

* [PRIVATE BRANCH] Jbobba/rnn fusion review (#2113)

* Style fixes for single-layer RNN fusion

* Style fixes to multi-layer RNN

* added callback routine for bi-directional rnn

* fix rnn op ctor, rnn mkldnn emitter to accomodate bi directional rnn

* style fix

* added helper function for rnn's to query direction and cell_type

* fix clang error

* - unit test case for bi rnn fusion
- style fix

* - updated bi-rnn graph pass to handle reverse and reverse_seq ops in the predicate
- added bi-rnn inter v/s cpu unit test case
- add support to in mkldnn_utils to create_md with tnc/ntc format

* - added enum type to deduce rnn_type

* Addressed PR comments
    - handle reshapes from {t, n, c} to {n, t, c} in the graph pass

* fix style

* fix clang error

* fix style

* i) move enum specific to rnn to seperate header

a444f7a9

23 Feb, 2019 4 commits

IntelGPU backend: Max and Avg pool fix (#2482) · f8632ea0
Sergey Shalnov authored Feb 23, 2019

f8632ea0

Reorganize doc folders for core-related doc on fusion, graph rewrite, and compiler passes (#2466) · fd0ed37c

Leona C authored Feb 23, 2019

* Cleaner API doc reference for compile call

* Add a useful table for nGraph namespaces

* Remove layout namespace

* Show exploding kernel problem on illustration like IEEE preso

* WIP branch for new documentation restructuring that is a huge pain

* Fix the doc reorg mess

* Fix underline

* List of passes disclaimer note

* Update disclaimers on README

* More cleanup of doc reorg

* Update core docs

* Update overview on core

* Add PR feedback

* Get rid of all the gazillion of doc build errors from rearranging stuff

* Add section on tutorials

* Update branch

* Cleanup intro

* Add better detail to overview

fd0ed37c

[ONNX] Handle trimmed optional outputs. (#2434) · 12b5f085

Adam Rogowiec authored Feb 23, 2019

* Function for retrieving number of node outputs.

* Handle optional trimmed outputs.

* Fix compilation err on clang.

* Fix error for number of outputs.

- Iterate over the minimum of number of outputs we return and the number
  of outputs of respective node in the graph. Some outputs may be
  optional and trimmed, as well as for some op implementations we may
  return not all outputs (ie. Dropout - where we do not return additional
  optional output).

* Update graph.cpp

* Add dropout ONNX op.

* Revert to iterate over node outputs in graph.

* Use more apropriate word in comment.

12b5f085

Do not allow builder to access tensor_data map directly. (#2494) · 718e2ef1
Amy Zhuang authored Feb 23, 2019

718e2ef1

22 Feb, 2019 6 commits

Don't use git shallow clone. (#2492) · f1c72364
Sang Ik Lee authored Feb 22, 2019

f1c72364

Add QuantizedConcat (#2060) · b9ff5d1f

Nishant Patel authored Feb 22, 2019

* Add QuantizedConcat

* Remove unused variables and add check for size of mins and maxes vector

* Resolve conflicts

* Merged with master and addressed some PR feedback

* Avoid float comparison

* make min/max vector, add dequant/quanti

* fix dequant/quant scales

* fix CI build issue

b9ff5d1f

use calls for new backend API in unit tests (#2427) · 26bba737
Robert Kimball authored Feb 22, 2019
```
* use calls for new backend API in unit tests

* fix compile error

* fix compile error
```
26bba737
IntelGPU backend: Convolution support for double and code minor clean up (#2479) · 578f7d8f
Sergey Shalnov authored Feb 22, 2019
```
* IntelGPU backend: Comvolution support for double and code minor clean up

* PR2479. custom kernel selection fix
```
578f7d8f
[ONNX] Fix in overriding ops (#2477) · f64b0e0c
tsocha authored Feb 22, 2019
```
* [ONNX] Overriding custom ops

* Add UT

* Style Check

* Review & style fix
```
f64b0e0c
Update external_cldnn.cmake (#2488) · 2b54d810
aslepko authored Feb 22, 2019
```
Changing clDNN to latest commit.
```
2b54d810

21 Feb, 2019 3 commits
- IntelGPU backend: Quantize operations (#2465) · a0ab82d8
  Sergey Shalnov authored Feb 21, 2019
```
* IntelGPU backend: Quantize operations

* Update intelgpu_op_custom_kernels.cpp
```
  a0ab82d8
- [ONNX] Update status of OneHot operator (#2484) · 25d23a8d
  tsocha authored Feb 21, 2019
  
  25d23a8d
- [ONNX] Import functions input validation (#2475) · ae738690
  Tomasz Dołbniak authored Feb 21, 2019
  
  ae738690
20 Feb, 2019 2 commits
- [ONNX] User friendly assert message when op is not supported (#2478) · 8458b7f4
  Michał Karzyński authored Feb 20, 2019
```
* User friendly assert message

* User friendly assert message

* Update UT
```
  8458b7f4
- Utility function for reading binary file content. (#2423) · 8ea8ea3c
  Adam Rogowiec authored Feb 20, 2019
```
* Utility function for reading binary file content.

* Style apply.

* Review. Add sanity check on file size.

* Style-apply.:
```
  8ea8ea3c
19 Feb, 2019 2 commits
- Auto detect OSX_SYSROOT for macos >= 10.14 (#2470) · f1a6f064
  Sang Ik Lee authored Feb 19, 2019
```
* Auto detect OSX_SYSROOT for macos >= 10.14

* Fix potential regex issue.
```
  f1a6f064
- Use git shallow clone if CMake >= 3.6. Remove BUILD_BYPRODUCTS. (#2469) · b7dc7493
  Sang Ik Lee authored Feb 19, 2019
```
* Use git shallow clone if CMake >= 3.6. Remove BUILD_BYPRODUCTS.

* Fix ONNX, protobuf integration.
```
  b7dc7493