Commits · 76b8b4d45add3fc124a18680f245b3d2ffac91ac · submodule / ngraph

13 Nov, 2018 3 commits

[ONNX] Fix MatMul op for vec @ tensor multiplication (#1969) · 76b8b4d4

Adam Rogowiec authored Nov 13, 2018

* Add static keyword for helper function.

* Fix MatMul for cases where left hand side is 1D vector.

- Add unit-test for this case.

* Add new line at the end of file.

* Log warning when dealing with scalars

* Apply clang-format

* Review: fix spelling, rename test model.

76b8b4d4

Ayzhuang/propagate cacheability (#1982) · 6e234d65

Amy Zhuang authored Nov 13, 2018

* Add cacheablility propagation pass.

* Use a functor to create op annotations.

* Address PR feedback.

* Address PR feedback.

* Address PR feedback.

6e234d65

[ONNX CI] ONNX CI Improvements (#2026) · f0c17477

mchrusci authored Nov 13, 2018

* Kill previous builds on PR update

* Remove Jenkinsfile.groovy

* Lower case method names

* Fix method notify()

* Added comment

Previous build deletion workaround to be removed as soon as better, less security vulnerable solution is found.

* Fix inconsistent stage names

f0c17477

12 Nov, 2018 5 commits

Moved mkldnn conv availability checks to utils and use it across passes (#1984) · b04f3c36
Jayaram Bobba authored Nov 12, 2018
```
* Moved mkldnn conv availability checks to utils and use it across passes

* Style fix
```
b04f3c36
cse for convert layout (#1983) · 28002287
Pruthvi authored Nov 12, 2018
```
* cse for convert layout

* addressed PR comments

* Addressed PR comments
```
28002287

Quantize(reorder) bias to int32 (#1933) · 296ee2cf

Nishant Patel authored Nov 12, 2018

* Quantize the bias to int32

* Bias scale fix

* mnist works

* Quantize Bias

* Introduce Quantize op in the graph to quantize bias & feedback

* Comments and some refactoring

* Add test case with float bias and enable int32 as quantized type in ngraph

* Change shape of scale from Shape{} to Shape{1} in the backend

296ee2cf

Tracing for CPU (#1956) · 71cc8bbf

Nick Korovaiko authored Nov 12, 2018

* tracing

* count tracepoint

* address scotts feedback

* merge

* fix an ununsed var warning

71cc8bbf

Faster argmax/argmin kernels (#2032) · ff98d02a
Jayaram Bobba authored Nov 12, 2018
```
* Faster argmax/argmin kernels

* Use switch statement for macro
```
ff98d02a

11 Nov, 2018 2 commits

nvgpu softmax cuda version (#2014) · be9f031e

Fenglei authored Nov 11, 2018

* add softmax cuda support

* optimize block size

* remove debug info

* remove debug

* style

* remove unused

* remove cudnn softmax

* format

* using nullptr

* move helper, add test

* fix style

* using all_close_f

* using kahansum

* style

* remove commentted out code

be9f031e

add isfinite check for all_close (#2028) · 702d465a

Fenglei authored Nov 11, 2018

* add isfinite check

* style

* output 5 diff and total diff

* output limit of diff for all_close_f

* dix bug

* disable tests

* remove failing unit test that does not make sense.

702d465a

10 Nov, 2018 4 commits

Update some incorrect comments in Dot::generate_adjoints (#2045) · 804e381a
Adam Procter authored Nov 10, 2018

804e381a

Heterogenous serialized graph testing across backends (#2020) · 40bcfdf7

gcwenger authored Nov 10, 2018

* Heterogenous sub-graph comparison testing

* Print index for float differences

* Disabled compare_backends_with_graphs on most backends for now. Moved to new file. Added testing of unsigned values.

* Fixed element::boolean range. Added missing include.

* Switched use of shared_ptr as parm to raw *. Moved to using namespace std in cpp. Fixed comment marker in unit_test.manifest files. Switched some EXPECT_EQ TO ASSERT_EQ. Fixed parameterized test disabling.

* Frozen naming -> serialized. Removed extraneous comments.

* Graph comparison unit test relies on CPU for reference, so only build when CPU is built.

* Reworked per backend disabling of compare_backends_with_graphs

40bcfdf7

catch exceptions for convert_layout for visualize_tree (#2037) · 31e2765a
Nick Korovaiko authored Nov 10, 2018

31e2765a

Update l2_norm and std_dev builders to use op::Sqrt (#2040) · 3f561f5e

Adam Procter authored Nov 10, 2018

* Update l2_norm and std_dev builders to use op::Sqrt instead of op::Power(...,0.5)

* Removed unneeded power.hpp include

3f561f5e

09 Nov, 2018 11 commits

optimization for about 2x speedup (#2036) · 2fc73b43
Robert Kimball authored Nov 09, 2018
```
* optimization for about 2x speedup

* more optimizations
```
2fc73b43
swim a special case of broadcast (#2034) · 0ac2a8b6
Nick Korovaiko authored Nov 09, 2018

0ac2a8b6
Don't collapse unit size dimensions for dot ops (#2031) · 1daac094
Jayaram Bobba authored Nov 09, 2018

1daac094
more passes to static (#2027) · e54156cf
Nick Korovaiko authored Nov 09, 2018

e54156cf

Add experimental ShapeOf op (#2023) · 3a47eafc

Adam Procter authored Nov 09, 2018

* Add ShapeOf op

* Helps to check in the source files

* Add shape_of_scalar to unit test manifests

* Add missing include to gpu_emitter.cpp

* Change 'this op is experimental' wording per @indie's suggestion

* New idea: let's try not mallocing 300 terabytes

* Update interpreter implementation

3a47eafc

Many updates to latest nGraph Architecture and feature docs (#1953) · 3c830a69

L.S. Cook authored Nov 09, 2018

* editing docs

* more doc updates

* Cleanup theme, update backends for PlaidML, remove stale font

* Add PlaidML description and doc update that should have been added with PR 1888

* Add PlaidML description and doc update that should have been added with PR 1888

* Latest release doc updates

* Add PlaidML description and doc update for PR 1888
* Update glossary with tensor description and quantization def
* Refactor landpage with QuickStart guides
* Add better details about nGraph features and roadmap

* Placeholder detail for comparison section

* Add section link

* order sections alphabetically for now

* update compiler illustration

* Address feedback from doc review

* Update illustration wording

* Formatting and final edits

* keep tables consistent

* Clarify doc on bridge and compiler docs

* Clarify doc on bridge and compiler docs

* yay for more feedback and improvements

* edit with built doc

* Fix typo

* Another phase of PR review editing

* Final review comment resolved

3c830a69

DropOut for INT (#2029) · 0bb9368a
Nick Korovaiko authored Nov 09, 2018

0bb9368a

Fix gtest and LLVM builds with ABI flag setting (#2019) · ed14b94f

Robert Kimball authored Nov 09, 2018

* fix gtest abi build

* fix llvm build with abi flag

* remove debug

* add check for conflicting flags in cmake

ed14b94f

Interpreter rework (#2030) · 6f511762

Robert Kimball authored Nov 09, 2018

* all tests passing

* rename a few vars to be consistent with new tensor names

6f511762

onnxifi: add exceptions for statuses (#1994) · b52a7798
Artur Wojcik authored Nov 09, 2018
```
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>
```
b52a7798

Add in-place-slice optimization for CPU backend. (#1967) · 65355a17

Amy Zhuang authored Nov 09, 2018

* Add in-place-slice optimization for CPU backend.

* Modify slice emitter function for in place slice.

* Allow arg node to have multiple outputs for in place slice.

* Remove unused variable.

* Add CPUExecutionContext argument to slice builder.

* Address PR feedback: move computation out of the functor.

* Move size computation out of the functor for in place concat.

65355a17

08 Nov, 2018 6 commits
- Address potential bug in cudnnGetReductionWorkspaceSize (#1990) · dfc20454
  Chris Sullivan authored Nov 08, 2018
```
* When CUDNN_DATA_TYPE == CUDNN_DATA_DOUBLE, it appears that the cudnn calculated workspace size is incorrect.
Adding a temporary fix here until the underlying issue is found.

* Add softmax test illustrating bug in cudnn impl.

* disable new unit test in intel GPU
```
  dfc20454
- nvgpu cuda reduce (#1988) · 32398641
  Fenglei authored Nov 08, 2018
```
* change reduce using cuda, add support for AND, OR

* fix bug and format

* remove unused code

* style

* change reduce_op to reduce_func to avoid shadow, thansk Ayan.

* using dynamic_pointer_cast
```
  32398641
- [ONNX CI] ONNX CI fixes (#2024) · 77899668
  mchrusci authored Nov 08, 2018
  
  77899668
- [ONNXIFI] Change variable names (#1993) · 29f23128
  Artur Wojcik authored Nov 08, 2018
  
  29f23128
- Changed timeout to 15 minutes without activity (#2016) · f7adcbf4
  mchrusci authored Nov 08, 2018
  
  f7adcbf4
- Isolate mutable pass managers (#2015) · 5720e319
  Rob Earhart authored Nov 08, 2018
  
  5720e319
07 Nov, 2018 9 commits

IAT: Collapse dims for Dot ops (#1991) · e5d9b540

Jayaram Bobba authored Nov 07, 2018

* Collapse dimensions for inputs to Dot

* Remove eigen kernels for higher dimension dots since they will collapse to cblas_gemm kernels

* Moved collapse dims pass after the fusion passes to prevent interference with fusion patterns. Use cblas_gemm for 2D dot

e5d9b540

add dtype-generic load definitions and clean up nvrtc helpers (#1975) · f33317cc

Chris Sullivan authored Nov 07, 2018

* Refactor include_helpers into an nvrtc specific helper file. Add templated define functions for coherent and noncoherent memory loads.

* Format

* const refs.

* Remove cast of zero.

f33317cc

Update pattern ops to propagate partial shapes (#1986) · af889535
Adam Procter authored Nov 07, 2018

af889535

NOP backend (#1979) · 4918449c

Robert Kimball authored Nov 07, 2018

* add nop backend

* nop backend

* fix flag name

* add new switch to cmake output of switch settings

* add new unit test to igpu manifest

* remove redundant test

4918449c

address issues which surface with clang 6.0 (#1980) · 79802dcf

Robert Kimball authored Nov 07, 2018

* address issues which surface with clang 6.0

* revert changes due to new clang warning and disable new warning

79802dcf

graph builders for quantize scale (#1976) · 8bd3846f

Adam Straw authored Nov 07, 2018

* quantize scale passing unit tests

* epsilon bump

* finished with quantization scale

* unit tests passing with convolution scale as builder

* broadcasted constants and cleanup

* api consistency for quant builders

* code style

* cleanup

* newline at EOF

* use requantization_scale

* drop TF license as we are no longer using TF code directly

8bd3846f

Do not fuse nodes if one node is predecessor of another node in horiz… (#1928) · 2a26558a

Amy Zhuang authored Nov 07, 2018

* Do not fuse nodes if one node is predecessor of another node in horizontal fusion.

* Add dead node check and remove predecessor check in horizontal fusion.

2a26558a

Add a real HybridBackend (#1998) · 45fba7b1

Robert Kimball authored Nov 07, 2018

* wip

* wip

* wip

* move hybrid wrapper to hybrid backend dir

* move hybrid wrapper to correct namespace

* wip

* sorta working

* remove debug from sorta working homogeneous hybrid backend

* is_supported is supported for GPU

* cleanup debug

* more progress

* remove debug

* cleanup

* turn off hybrid by default

* revert change

* revert

* rename wrapper to backend

* revert

* address review comments

* style

45fba7b1

Jbobba/halide (#1971) · ba73e2b8

Jayaram Bobba authored Nov 07, 2018

* Add missing halide dependency

* Bug fix in halide op creation

* Localize halide/llvm to cpu backend

* Added comments

* Pass NGRAPH_HALIDE to tests

* Resolve merge conflicts

ba73e2b8