Commits · 40bcfdf76bad03913616836eb35e6206d7bdec7d · submodule / ngraph

10 Nov, 2018 3 commits

Heterogenous serialized graph testing across backends (#2020) · 40bcfdf7

gcwenger authored Nov 10, 2018

* Heterogenous sub-graph comparison testing

* Print index for float differences

* Disabled compare_backends_with_graphs on most backends for now. Moved to new file. Added testing of unsigned values.

* Fixed element::boolean range. Added missing include.

* Switched use of shared_ptr as parm to raw *. Moved to using namespace std in cpp. Fixed comment marker in unit_test.manifest files. Switched some EXPECT_EQ TO ASSERT_EQ. Fixed parameterized test disabling.

* Frozen naming -> serialized. Removed extraneous comments.

* Graph comparison unit test relies on CPU for reference, so only build when CPU is built.

* Reworked per backend disabling of compare_backends_with_graphs

40bcfdf7

catch exceptions for convert_layout for visualize_tree (#2037) · 31e2765a
Nick Korovaiko authored Nov 10, 2018

31e2765a

Update l2_norm and std_dev builders to use op::Sqrt (#2040) · 3f561f5e

Adam Procter authored Nov 10, 2018

* Update l2_norm and std_dev builders to use op::Sqrt instead of op::Power(...,0.5)

* Removed unneeded power.hpp include

3f561f5e

09 Nov, 2018 11 commits

optimization for about 2x speedup (#2036) · 2fc73b43
Robert Kimball authored Nov 09, 2018
```
* optimization for about 2x speedup

* more optimizations
```
2fc73b43
swim a special case of broadcast (#2034) · 0ac2a8b6
Nick Korovaiko authored Nov 09, 2018

0ac2a8b6
Don't collapse unit size dimensions for dot ops (#2031) · 1daac094
Jayaram Bobba authored Nov 09, 2018

1daac094
more passes to static (#2027) · e54156cf
Nick Korovaiko authored Nov 09, 2018

e54156cf

Add experimental ShapeOf op (#2023) · 3a47eafc

Adam Procter authored Nov 09, 2018

* Add ShapeOf op

* Helps to check in the source files

* Add shape_of_scalar to unit test manifests

* Add missing include to gpu_emitter.cpp

* Change 'this op is experimental' wording per @indie's suggestion

* New idea: let's try not mallocing 300 terabytes

* Update interpreter implementation

3a47eafc

Many updates to latest nGraph Architecture and feature docs (#1953) · 3c830a69

L.S. Cook authored Nov 09, 2018

* editing docs

* more doc updates

* Cleanup theme, update backends for PlaidML, remove stale font

* Add PlaidML description and doc update that should have been added with PR 1888

* Add PlaidML description and doc update that should have been added with PR 1888

* Latest release doc updates

* Add PlaidML description and doc update for PR 1888
* Update glossary with tensor description and quantization def
* Refactor landpage with QuickStart guides
* Add better details about nGraph features and roadmap

* Placeholder detail for comparison section

* Add section link

* order sections alphabetically for now

* update compiler illustration

* Address feedback from doc review

* Update illustration wording

* Formatting and final edits

* keep tables consistent

* Clarify doc on bridge and compiler docs

* Clarify doc on bridge and compiler docs

* yay for more feedback and improvements

* edit with built doc

* Fix typo

* Another phase of PR review editing

* Final review comment resolved

3c830a69

DropOut for INT (#2029) · 0bb9368a
Nick Korovaiko authored Nov 09, 2018

0bb9368a

Fix gtest and LLVM builds with ABI flag setting (#2019) · ed14b94f

Robert Kimball authored Nov 09, 2018

* fix gtest abi build

* fix llvm build with abi flag

* remove debug

* add check for conflicting flags in cmake

ed14b94f

Interpreter rework (#2030) · 6f511762

Robert Kimball authored Nov 09, 2018

* all tests passing

* rename a few vars to be consistent with new tensor names

6f511762

onnxifi: add exceptions for statuses (#1994) · b52a7798
Artur Wojcik authored Nov 09, 2018
```
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>
```
b52a7798

Add in-place-slice optimization for CPU backend. (#1967) · 65355a17

Amy Zhuang authored Nov 09, 2018

* Add in-place-slice optimization for CPU backend.

* Modify slice emitter function for in place slice.

* Allow arg node to have multiple outputs for in place slice.

* Remove unused variable.

* Add CPUExecutionContext argument to slice builder.

* Address PR feedback: move computation out of the functor.

* Move size computation out of the functor for in place concat.

65355a17

08 Nov, 2018 6 commits
- Address potential bug in cudnnGetReductionWorkspaceSize (#1990) · dfc20454
  Chris Sullivan authored Nov 08, 2018
```
* When CUDNN_DATA_TYPE == CUDNN_DATA_DOUBLE, it appears that the cudnn calculated workspace size is incorrect.
Adding a temporary fix here until the underlying issue is found.

* Add softmax test illustrating bug in cudnn impl.

* disable new unit test in intel GPU
```
  dfc20454
- nvgpu cuda reduce (#1988) · 32398641
  Fenglei authored Nov 08, 2018
```
* change reduce using cuda, add support for AND, OR

* fix bug and format

* remove unused code

* style

* change reduce_op to reduce_func to avoid shadow, thansk Ayan.

* using dynamic_pointer_cast
```
  32398641
- [ONNX CI] ONNX CI fixes (#2024) · 77899668
  mchrusci authored Nov 08, 2018
  
  77899668
- [ONNXIFI] Change variable names (#1993) · 29f23128
  Artur Wojcik authored Nov 08, 2018
  
  29f23128
- Changed timeout to 15 minutes without activity (#2016) · f7adcbf4
  mchrusci authored Nov 08, 2018
  
  f7adcbf4
- Isolate mutable pass managers (#2015) · 5720e319
  Rob Earhart authored Nov 08, 2018
  
  5720e319
07 Nov, 2018 9 commits

IAT: Collapse dims for Dot ops (#1991) · e5d9b540

Jayaram Bobba authored Nov 07, 2018

* Collapse dimensions for inputs to Dot

* Remove eigen kernels for higher dimension dots since they will collapse to cblas_gemm kernels

* Moved collapse dims pass after the fusion passes to prevent interference with fusion patterns. Use cblas_gemm for 2D dot

e5d9b540

add dtype-generic load definitions and clean up nvrtc helpers (#1975) · f33317cc

Chris Sullivan authored Nov 07, 2018

* Refactor include_helpers into an nvrtc specific helper file. Add templated define functions for coherent and noncoherent memory loads.

* Format

* const refs.

* Remove cast of zero.

f33317cc

Update pattern ops to propagate partial shapes (#1986) · af889535
Adam Procter authored Nov 07, 2018

af889535

NOP backend (#1979) · 4918449c

Robert Kimball authored Nov 07, 2018

* add nop backend

* nop backend

* fix flag name

* add new switch to cmake output of switch settings

* add new unit test to igpu manifest

* remove redundant test

4918449c

address issues which surface with clang 6.0 (#1980) · 79802dcf

Robert Kimball authored Nov 07, 2018

* address issues which surface with clang 6.0

* revert changes due to new clang warning and disable new warning

79802dcf

graph builders for quantize scale (#1976) · 8bd3846f

Adam Straw authored Nov 07, 2018

* quantize scale passing unit tests

* epsilon bump

* finished with quantization scale

* unit tests passing with convolution scale as builder

* broadcasted constants and cleanup

* api consistency for quant builders

* code style

* cleanup

* newline at EOF

* use requantization_scale

* drop TF license as we are no longer using TF code directly

8bd3846f

Do not fuse nodes if one node is predecessor of another node in horiz… (#1928) · 2a26558a

Amy Zhuang authored Nov 07, 2018

* Do not fuse nodes if one node is predecessor of another node in horizontal fusion.

* Add dead node check and remove predecessor check in horizontal fusion.

2a26558a

Add a real HybridBackend (#1998) · 45fba7b1

Robert Kimball authored Nov 07, 2018

* wip

* wip

* wip

* move hybrid wrapper to hybrid backend dir

* move hybrid wrapper to correct namespace

* wip

* sorta working

* remove debug from sorta working homogeneous hybrid backend

* is_supported is supported for GPU

* cleanup debug

* more progress

* remove debug

* cleanup

* turn off hybrid by default

* revert change

* revert

* rename wrapper to backend

* revert

* address review comments

* style

45fba7b1

Jbobba/halide (#1971) · ba73e2b8

Jayaram Bobba authored Nov 07, 2018

* Add missing halide dependency

* Bug fix in halide op creation

* Localize halide/llvm to cpu backend

* Added comments

* Pass NGRAPH_HALIDE to tests

* Resolve merge conflicts

ba73e2b8

06 Nov, 2018 7 commits
- remove debug statement (#2012) · f85b1b83
  Robert Kimball authored Nov 06, 2018
  
  f85b1b83
- fix regex for version splitting (#1981) · 782ecd2f
  Robert Kimball authored Nov 06, 2018
  
  782ecd2f
- Update CODEOWNERS for license files and CI/Docker files (#1978) · a344bc4b
  Adam Procter authored Nov 06, 2018
```
* Update CODEOWNERS for /licenses and /LICENSE

* Review comments

* Minor formatting
```
  a344bc4b
- Missing header (#1992) · 08254b19
  Scott Cyphers authored Nov 06, 2018
  
  08254b19
- [ONNX] Support for external weights to enable Caffe2 models (#1941) · a5d3c78d
  Artur Wojcik authored Nov 06, 2018
```
* onnx: enable external weights to enable Caffe2 support
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnx: update ONNX importer interface documentation
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnx: after review updates
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>
```
  a5d3c78d
- Added variable to extend the CMAKE_INSTALL_RPATH settings (#1974) · d37cafd9
  Ransford Hyman Jr authored Nov 06, 2018
  
  d37cafd9
- Misc fixes for partial shapes (#1987) · ba640dbb
  Adam Procter authored Nov 06, 2018
  
  ba640dbb
05 Nov, 2018 4 commits

TopK additional tests for nvGPU backend (#1946) · 37dc586c

Ayan Moitra authored Nov 05, 2018

* added tests for malloc mode and graph transform

* Comment incorporation

* changed comparing backend to INTERPRETER

* COmments resolved+clang

* Adressed all comments

* IntelGPU does not support topk

37dc586c

enable hybrid test with graph splits (#1960) · d9f615b7

Sandeep authored Nov 05, 2018

* size_t for placement in node

* enable hybrid backend test

* style

* cp placement functions

* placement size_t based functions

* placement based on backends

* add placement based on size_t

* backend size_t based placement

* call

* update

* resolve bug

* format

* revert cmake changes

* address PR comments

* ci error

* pr comments

d9f615b7

extend cse to handle backend ops (#1972) · 6b3f3a0a
Nick Korovaiko authored Nov 05, 2018
```
* extend cse to handle backend ops

* revert back to static casts
```
6b3f3a0a
Make debug logging threadsafe (#1977) · ee6444ed
Rob Earhart authored Nov 05, 2018
```
* Make debug logging threadsafe

* Add nil stream comments
```
ee6444ed