Commits · b5446d87bcf8f7bc7a4d82f07c1d0267903d3957 · submodule / ngraph

14 Jan, 2019 1 commit
- Fix xmmintrin guard (#2309) · b5446d87
  Rob Earhart authored Jan 14, 2019
  
  b5446d87
13 Jan, 2019 1 commit
- add GetOutputElementElimination (#2302) · 1ad6c7f0
  Fenglei authored Jan 13, 2019
  
  1ad6c7f0
12 Jan, 2019 4 commits

IntelGPU backend: minor fixes in statistic (#2300) · 4680678d
Sergey Shalnov authored Jan 12, 2019

4680678d

Minor PlaidML updates (#2283) · f79b40a7

Rob Earhart authored Jan 12, 2019

* Use static cast where possible

* Tensor API update

* Move prefix reshape elision to be a general pass

* Use pass config to select Winograd optimization

* Use get_is_transpose() to detect transposes

* Use get_default_order to build AxisVectors

f79b40a7

IntelGPU backend: Use new clDNN version 12.1 (#2280) · d74ea190
Sergey Shalnov authored Jan 12, 2019
```
* IntelGPU backend: Use new clDNN version 12.1

* PR2280. Comments addressed
```
d74ea190
Initial backend for non-IA CPUs (#2268) · 956f66ad
Robert Kimball authored Jan 12, 2019
```
* first cut at raspberry pi backend

* rename rpi to generic cpu

* disable cursed test
```
956f66ad

09 Jan, 2019 1 commit
- Replace relu implementation with element_wise (#2295) · 20bd8bbc
  Sang Ik Lee authored Jan 09, 2019
  
  20bd8bbc
08 Jan, 2019 8 commits

Allow external mklml outside of prebuilt mkldnn install directory. (#2291) · f21eeb8d
Sang Ik Lee authored Jan 08, 2019
```
* Allow external mklml outside of prebuilt mkldnn install directory.

* Limit prebuilt mkl-dnn support to Linux.
```
f21eeb8d
IntelGPU backend: fix typo in BatchNorm handling (#2294) · 5ecab1ad
Sergey Shalnov authored Jan 08, 2019

5ecab1ad
[NGCPU-339] UT for ArgMin ArgMax with int32 input data type. (#2256) · 25ab8a28
Adam Rogowiec authored Jan 08, 2019

25ab8a28

Fix signed conv op (#2287) · 259c0a48

Nishant Patel authored Jan 08, 2019

* Fix signedconv op

* Add assert and change the dynamic scale test case

* Change assert

* Update quantized_conv_bias.cpp

* Update quantized_conv_bias.cpp

259c0a48

fix bug in rnn matrix fusion call back (#2279) · 8e1922be

Pruthvi authored Jan 08, 2019

* - made changes to slicing logic in the rnn input matrix fusion call back
- this fixes bug in the GNMT

* - fix unit test seg fault
- add sorting slices logic make the replace_node easier

* i) add check for overlapping slices
ii) addressed PR comments

* remove ambiguity check

8e1922be

any/all stop-gap CPU implementation (#2250) · ea6a5b85
Nick Korovaiko authored Jan 08, 2019
```
* any/all stop-gap CPU implementation

* remove pass
```
ea6a5b85

Use plaidml cmake config (#2290) · fa3200f1

Rob Earhart authored Jan 08, 2019

* Use plaidml cmake config

* Require PlaidML if requested

* Don't install libplaidml

We can assume that it's correctly installed on the target, especially
since it needs to be correctly installed in order to find its
configuration files.

fa3200f1

add int32 support to argmin/argmax (#2288) · a2d8a9fd
Nick Korovaiko authored Jan 08, 2019

a2d8a9fd

07 Jan, 2019 4 commits

[onnx] fix building while NGRAPH_ONNXIFI_ENABLE is off (#2286) · 1df4cc36
Artur Wojcik authored Jan 07, 2019
```
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>
```
1df4cc36

[ONNX] ConvTranspose with groups. (#2289) · dee4a8b8

Adam Rogowiec authored Jan 07, 2019

* Enable support for group attribute.

* UT for ConvTranspose with groups.

* Validate group attribute value.

* Move helper function to unnamed namespace.

* Access values with bounds checking.

* Fix spelling.

dee4a8b8

Simplified all_close_f interface and tightened default criteria (#2285) · 0eaa960c

gcwenger authored Jan 07, 2019

* Simplified & tightened all_close_f parameters

Removed specification of mantissa bits for all_close_f in favor
of just specifying tolerance bits. Tightened up all_close_f default.
Fixed LRN unit test which had insufficient result precision to pass
tighter all_close_f tolerance.

* Addressed PR comments.

Reworked mantissa bit and tolerance constants.
Clarified and improved graph comparison tolerance calculation flexibility.
Clarified unit test tolerance testing.

0eaa960c

[NGCORE-270] UT for Softplus ONNX operator testing edge cases. (#2254) · 15a0bf19

Adam Rogowiec authored Jan 07, 2019

* UT for Softplus ONNX operator testing edge cases.

* Rename UT model name.

* Handle overflows.

* Add UT for ininite values and check them correctly.

* Update values in comment

15a0bf19

05 Jan, 2019 1 commit

nvgpu backend without clang (#2115) · 757621be

Chris Sullivan authored Jan 05, 2019

* Separate out external function base class.

* pt1 first step to removing m_writer from GPU_Emitter.

* pt2 add gpu_internal function skeleton

* pt3 temporarily add to gpu_backend for prototyping.

* pt4 add call frame (partial) and runtime constructor

* pt 5 implement resolution for function memory reservations. build new tensor wrapper for use with call frame.

* pt 6 resolve compilation errors.

* pt 7 Add host emitter for emitting host primtives and implement in gpu emitter.

* pt 8 add compile time manifest.

* pt 9 add simple runtime tracer.

* pt 10 seperate runtimes for different functions. index by function name, should switch to using function instance_id for look up performance.

* pt 11 add function call interface and support nested call frames

* pt 12 Reshape elimination check in emitter needs to include offset.

* pt 13 Add default indentation to all op emissions in gpu external functions.

* pt 14 fix constant mem reservation (should not depend on the tmeporary buffers existence check.

* pt 15 backward pooling for avg pool requires only one param. rather than passing this param
three times, this commit changes the runtime to detect if its avgpooling and pass the appropriate pointers.
This is a hold over until max and avgpool are refactored into separate cudnn emitters.

* pt 16 update cmake compatibility. gpu backend can now be built without clang via NGRAPH_DEX_ONLY.
if this cmake variable is not define, then both clang codegen (via gpu external function) and interpreter (via gpu internal function) modes will be built.
for now codegen is the default backend but can be explicitly disabled by setting the env. variable to NGRAPH_CODEGEN=0/FALSE/NO/etc.

additional note: made codegen::CodeWriter header-only so that it can be used independently of whether the clang codegen library is compiled.

* pt 17 fix issues with merge from master

* pt 18 factor compile function into a few virtual calls so that common passes can be added in a single location for both backends.

* pt 19 formatting

* Remove code_writer.cpp from cmake and disable (temporarily) some reduce tests that require changes to gpu_emitter.cpp

* Move call frame and runtime constructor implementations to source files.

* Use member m_common_function_string.

* Applying analogous bug fix as found in #2145

* Remove underscore from GPU_CompiledFunction, GPU_ExternalFunction, and GPU_InternalFunction.

* Made static members of GPUCompiledFunction static methods.

* Remove 'No' codegen options, use std::toupper and applied format

* review comments

* Remove vector overload for resolve inputs/outputs in GPUCallFrame.

* Remove diagnostic pragmas

757621be

03 Jan, 2019 6 commits

Fix a throw in slice cpu_memory_optimization.cpp due to unsupported i64 in MKLDNN (#2278) · 84d6ae08
Nick Korovaiko authored Jan 03, 2019
```
* fix throw in cpu_memory_optimization

* add input_tensor back

* more descriptive bail-out msg
```
84d6ae08
add version.hpp to ngraph install files for external backends (#2277) · 27f972fc
Robert Kimball authored Jan 03, 2019
```
* add version.hpp to ngraph install files for external backends

* update date
```
27f972fc
update licenses for 2019 (#2275) · ba299b93
Robert Kimball authored Jan 03, 2019
```
* update licenses for 2019

* style
```
ba299b93
[ONNX] Variadic operators in opset 8. (#2261) · a8ce39d6
Adam Rogowiec authored Jan 03, 2019
```
* Add broadcasting to variadic OP in opset 8.

* Apply style format.

* Update onnx_import.cpp
```
a8ce39d6

API cleanup & performance passes (#2242) · f5b2d581

Rob Earhart authored Jan 03, 2019

* nGraph compat updates

* Simplify op structure

* Add zero-dim elimination pass

* Add logical data conversion pass

* Add graphviz support

* Add implicit broadcast pass

* Elide unnecessary reshape ops

* Merge reshapes into convolutions

* Add winograd convolution support

* Allow three-input maxpool backprop

* Add concat elision

* Combine replication operations

* Elide unneeded replicates

* Style update

f5b2d581

Fix numeric instability in batchnorm bprop (#2246) · c11644ec
Scott Cyphers authored Jan 03, 2019
```
* Fix numeric instability in batchnorm bprop

* Another instability
```
c11644ec

02 Jan, 2019 2 commits

print llvm build choice and allow cmake arg to override env var (#2266) · b7f097ec
Robert Kimball authored Jan 02, 2019
```
* print llvm build choice and allow cmake arg to override env var

* fix cmake error

* use existing function
```
b7f097ec

[ONNX] automatically detect value types of TensorProto and AttributeProto (#2262) · 5dc708a1

Artur Wojcik authored Jan 02, 2019

* [ONNX] detect automatically value types of TensorProto and AttributeProto
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnx: style apply
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

5dc708a1

31 Dec, 2018 1 commit
- Change ngraph_test_util to a static library (#2274) · c5bf6812
  Robert Kimball authored Dec 31, 2018
  
  c5bf6812
29 Dec, 2018 1 commit
- upgrade json to v3.5.0 (#2271) · 6a4454bc
  Robert Kimball authored Dec 29, 2018
  
  6a4454bc
28 Dec, 2018 2 commits

fix static ctor for non-trivial type (#2269) · da60db75
Robert Kimball authored Dec 28, 2018

da60db75

Forward nGraph's C/C++ compiler, build type and generator information. (#2267) · 8763557d

Sang Ik Lee authored Dec 28, 2018

* Forward nGraph's C/C++ compiler, build type and generator information to cmake based external projects.

* Fix typo.

* Pass generator related info properly.

* Googletest is using DEBUG_POSTFIX

* TBB uses postfix for debug lib.

* Fix typo.

8763557d

23 Dec, 2018 2 commits

Hybrid GPU Backend (#2240) · 90503652

Robert Kimball authored Dec 23, 2018

* Add GPUH hybrid backend

* update manifests

* update node operator<<

* fix GOE

* remove debug

* remove debug

* more cleanup

* add parent support to cpu and intel gpu backend tensors

* cleanup

* fix odd failure when printing node during construction

* fix node output

* address review comments

* style

90503652

Remove code designed to support the Ninja cmake generator (#2241) · 42f16035

Robert Kimball authored Dec 23, 2018

* update build byproducts to support ninja

* remove unused cmake code

* more cmake cleanup

* display error message if Ninja generator is requested

* fix mkldnn ext project

* revert onnx cmake file

* revert protobuf cmake file

* revert mlsl cmake file

* more fixing

42f16035

22 Dec, 2018 4 commits
- Add missing #include to reference/avg_pool.hpp (#2263) · 179fcdef
  Adam Procter authored Dec 22, 2018
  
  179fcdef
- fix windows build (#2259) · ad315a1a
  Robert Kimball authored Dec 22, 2018
  
  ad315a1a
- Remove old cudnn test. (#2257) · 7ecc1d12
  Chris Sullivan authored Dec 22, 2018
  
  7ecc1d12
- add unit test for nbench functionality (#2253) · 6d984a5a
  Robert Kimball authored Dec 22, 2018
  
  6d984a5a
21 Dec, 2018 2 commits

Support dynamic scales for Qconv's and Dequantize (#2171) · 7e310e20

Nishant Patel authored Dec 21, 2018

* Support dynamic scales for Qconv's and Dequantize

* Remove constant folding

* add additional dynamic_quantize unittest

* add another mxnet quantize unittest

* add additional dynamic_dequantize tests

* fix shape error

* add dynamic signed_quantize unittest

* Pass correct scale

* Refactoring

* Added dynamic scale support for QCBA and QCBSA

* Refactor to create MKLDNN primitives on the first iteration

* remove stray code

* unused variables

* remove extraneous line

7e310e20

Graph comparison testing quiet unless problem is detected (#2258) · c153ea8a
gcwenger authored Dec 21, 2018
```
* Graph comparison testing quiet unless problem is detected.

* Fixed file formatting

* Renamed ss => msg
```
c153ea8a