Commits · fe676f72014dfb946c7190b9cbbea77fb74909ec · submodule / ngraph

13 Sep, 2018 6 commits

Handle unsupported op in nbench (#1531) · fe676f72

Robert Kimball authored Sep 13, 2018

* add unsupported_op exception

* unsupported_op test

* add printout of unsupported op in model

* fix GPU dispatcher check

* fix test designation

* catch exceptions on single file runs too

* add unsupported_op exception where needed

* remove unsupported_op class

* add unassigned op exception

* add unit test

* catch unsupported op in nbench

* add cpu test back

* update all latest merges

* mode change

fe676f72

Turn off optimizations on emitted external function. (#1592) · b0e4d8cb

Chris Sullivan authored Sep 13, 2018

Clang chooses to use a __vectorcall optimization in which
address pointers are vector loaded in the gpu::invoke_primitive.
This results in a segfault when stack alignment is absent.
Since the GPU transformer does not rely on CPU for compute,
we disable the optimizations of the emitted function.

b0e4d8cb

Modify DEX OneHot op: use generator. (#1446) · 73bff556
Amy Zhuang authored Sep 13, 2018
```
* Modify DEX OneHot op: use generator.

* Cast index to int.
```
73bff556

Control dependencies (#1445) · 58f9af01

Nick Korovaiko authored Sep 13, 2018

* topological sort with cdeps

* add control deps API, fix unit tests

* rollback adjoints changes

* fix test failures,add more tests

* remove dead code

* address scott's feedback

58f9af01

pass args instead of pointer to array (#1591) · 68eb2e7d
Fenglei authored Sep 13, 2018

68eb2e7d
Bug fix for assert (#1598) · 309bfdf0
gaurides authored Sep 13, 2018

309bfdf0

12 Sep, 2018 7 commits

Use validated call in the quantization unit test and the correct shape (#1590) · 6ad5b97d
Jayaram Bobba authored Sep 12, 2018

6ad5b97d

Add in_place support for ReplaceSlice (#1559) · bb6de284

gaurides authored Sep 12, 2018

* Add in_place suport for ReplaceSlice

* Add emit_replace_slice_inplace kernel

* changed file permissions to original

* Formatted code using maint/apply-code-format.sh

* Removed data type check and removed dead code

* Removed setting mkldnn_op(true). ReplaceSlice is not mkldnn op

bb6de284

[ONNX] Tests for reduction ops. (#1589) · ba59b80b

Adam Rogowiec authored Sep 12, 2018

* Add missing header.

* Test for ReduceSum

* Simple tests for reductions

- L1/L2/LogSum/LogSumExp/Max/Mean/Min/Prod/SumSquare.

* Add floating point literal suffix

* Fix typo

ba59b80b

Update fusion doc and add ONNX build flag to buildlb doc (#1585) · 2d0721d5

L.S. Cook authored Sep 12, 2018

* Update fusion doc and add ONNX build flag to buildlb doc

* Fix PR comments

* Final PR review comments addreswsed

* Fix link on reformmatted doc README

* Delete index.rst.save

2d0721d5

nbench buffer copy each iteration (#1578) · c631b50b
Robert Kimball authored Sep 12, 2018
```
* add option to copy intput/output data for each iteration

* add support for stale buffers
```
c631b50b

Add support for Quantized Pooling(Max + Avg) op via mkldnn for IA backend (codegen + DEX) (#1571) · 20c2325c

Nishant Patel authored Sep 12, 2018

* Add support for Quantized Pooling(Max + Avg) op via mkldnn for IA backend (codegen + DEX)

* Add checks for min and max

* Extracting out the common code from codegen and DEX

* Use call_with_validate

20c2325c

[ONNX] Shape operator (#1586) · 1cdae06e
tsocha authored Sep 12, 2018
```
* [ONNX] Shape operator

* Review fix pt. 1

* Style check
```
1cdae06e

11 Sep, 2018 4 commits

Interpreter use switch() for main loop (#1538) · d81d0c93

Robert Kimball authored Sep 11, 2018

* wip

* interperter use switch instead of if/else

* more cleanup

* make nop elimination run on all backends

* revert

* use single include file to define all ops so there is only one instance

* move op.tbl to ngraph/op dir as it is useful. Added useage example.

* add some comments where needed

* revert some changes to reduce delta

* add const

* add more const

* simplify using NodeWrapper

* update per review comments

* update per review comments

* update per review comments

* remove switch warning as it is not supported in older gcc

d81d0c93

set correct perms on source files (#1564) · 5032f343
Nick Korovaiko authored Sep 11, 2018

5032f343
[ONNX] Refactor exceptions to asserts (#1573) · 189cf3b7
Michał Karzyński authored Sep 11, 2018

189cf3b7

Add conv add fusion (#1526) · 37174c90

gaurides authored Sep 11, 2018

* Add conv add fusion

* Updated file permissions and cpu_fusion order

* Formatted code using maint/apply-code-format.sh

* Fixed minor review comments

* Use NODE_VALIDATION_ASSERT instead of throw ngraph_error;\nupgrade baseline and fix issues

* Some more fixes

37174c90

10 Sep, 2018 1 commit
- IntelGPU backend: BatchNorm operation optimization (#1579) · 36e1de51
  shssf authored Sep 10, 2018
```
* IntelGPU backend: BatchNorm operation optimization

* PR1579. Function moved by request
```
  36e1de51
08 Sep, 2018 1 commit

[ONNX] Reduce* operations (#1562) · 4341c6ac

Adam Rogowiec authored Sep 08, 2018

* ReduceSum and ReduceSumSquare ONNX operations.

* Add new reduction ops.

- ReduceLogSum,
- ReduceLogSumExp,
- ReduceMax,
- ReduceMin,
- ReduceMean,
- ReduceProd.

* Add ReduceL1 and ReduceL2

* Utility generic functions generating monotonic sequences of values.

* Review comments: return AxisSet not std::vector

* Use common functions for generating monotonic sequence.

* Review comments.

4341c6ac

07 Sep, 2018 5 commits
- IntelGPU backend: Reshape operation optimization (#1566) · 3609cc74
  shssf authored Sep 07, 2018
  
  3609cc74
- Add support for Dequantize op via mkldnn for IA backend (codegen + DEX) (#1565) · e6267708
  Nishant Patel authored Sep 07, 2018
```
* Add support for Dequantize op via mkldnn for IA backend (codegen + DEX)

* Remove unused variable

* Static cast target range
```
  e6267708
- Constant Folding : Constant + Pad (#1528) · 00a76f3b
  Nick Korovaiko authored Sep 07, 2018
```
* constant + pad

* adding broadcast test back
```
  00a76f3b
- IntelGPU backend: Workaround for unsupported data types (#1572) · 446cf07b
  shssf authored Sep 07, 2018
  
  446cf07b
- [ONNX] Logical ops (#1567) · b21ff63d
  tsocha authored Sep 07, 2018
  
  b21ff63d
06 Sep, 2018 4 commits

TopK (w/ArgMax, ArgMin python wrapper) (#1560) · 3548772b

Sang Ik Lee authored Sep 06, 2018

* Implement TopK.

* Update python wrappers for TopK, ArgMin and ArgMax.

* Address some reviewer comments.

* Add type property check tests for TopK.
Set correct TopK behavior for K==0.

* TopK: Add 1d and 3d unit tests.

* Address more reviewer comments.

* Apply code style.

3548772b

Update doc build v and fix doc on captioning (#1568) · d309e96f

L.S. Cook authored Sep 06, 2018

* Update doc build v and fix doc on captioning

* Clarify to build the library

* update link on README

d309e96f

Double curly-brace initialization (required by clang for non-templated… · 9ea38d22

Chris Sullivan authored Sep 06, 2018

Double curly-brace initialization (required by clang for non-templated functions) causes a compiler error in centos. (#1561)

Since the warning is not enforced in clang for templated functions, we can get around the centos compiler error with only a single set of curly braces here.

9ea38d22

[ONNXIFI] implement onnxGetBackendIDs() interface function (#1546) · 836ee508

Artur Wojcik authored Sep 06, 2018

* onnx: add missing header files
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnxifi: implementation of onnxGetBackendIDs
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnxifi: add unit tests for onnxGetBackendIDs
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnxifi: change std::out_of_range to std::length_error
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnxifi: after review changes
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

836ee508

05 Sep, 2018 2 commits
- Simplify result copy elimination (#1345) · 626536ad
  Nick Korovaiko authored Sep 05, 2018
```
* simplify result copy elimination

* gpu fix

* remove include header

* circumvent gpu issue

* add a whitepace
```
  626536ad
- Use get_default_axis_vector utility function for Reshape op. (#1558) · 09242c31
  Adam Rogowiec authored Sep 05, 2018
  
  09242c31
04 Sep, 2018 10 commits

add more order to Node, at least a consistent sort order... (#1551) · 42cc4b82
Robert Kimball authored Sep 04, 2018

42cc4b82

nvgpu reduce to scalar optimization (#1491) · 5f40d957

Fenglei authored Sep 04, 2018

* add cuda reduce

* clang format

* fix bugs

* fix bug

* add 1d reduce

* clang format

* fix bugs

* unroll loop

* remove debug info

* revert tests

* unroll 1D reduce op

* add comments

* using cudnn for nd to scalar reduction

* remove cuda 1d reduction since cudnn version is faster

* remove 1D kernel

* fix bugs

* 1d multi block size

* remove debug

* change kernel name

* add reduce to scalar optimization, add test

* fix bugs and tune parameters

* clang format

* update comments

* update comments

* update comments

* clang format

* update comments

* remove wrong comments, apply clang format

* resolve Bob's comment

* clang format

* pass shared mem size from cuLaunchKernel, set unroll loop size through host code

* remove unused code.clang format

* change reduce to thread with shfl for each warp first

* add seed

* unroll size

5f40d957

[ONNX] Pow operator (#1557) · 8fdefa52
tsocha authored Sep 04, 2018

8fdefa52
Merge descriptor::TensorView into descriptor::Tensor (#1536) · 8bab36fb
Scott Cyphers authored Sep 04, 2018
```
* Merge descriptor::TensorView into descriptor::Tensot

* fix GPU build
```
8bab36fb

Cmake flags update (#1539) · 62e470b2

Avijit authored Sep 04, 2018

* Added cmake flags to specify D_GLIBCXX_USE_CXX11_ABI and disable building of doc

* Renamed the NGRAPH_DOC_BUILD_ENABLE flag based on PR feedback

62e470b2

IntelGPU backend: Sum operation optimization (#1545) · ed22bf6c

shssf authored Sep 04, 2018

* IntelGPU backend: Sum operation optimization

* PR1545. Comments addressed. Test added. Helper function refactored.

ed22bf6c

Fix for Conv op (#1556) · 75a18827
Michał Karzyński authored Sep 04, 2018

75a18827
Fix for GEMM op (#1555) · ae6a2903
Michał Karzyński authored Sep 04, 2018

ae6a2903
[ONNX] Numpy style binary broadcasting (#1549) · a2521cf9
tsocha authored Sep 04, 2018

a2521cf9
[ONNX] Tensor: add support for raw_data (#1552) · cc989301
Artur Wojcik authored Sep 04, 2018

cc989301