Commits · b19fa875e2113d2de13419ac1b109741e67afdb7 · submodule / ngraph

02 Jun, 2019 8 commits

[MLIR] Move MLIR to src/contrib/mlir (#16) · b19fa875
Nagy Mostafa authored 5 years ago
```
* Move MLIR to src/contrib/mlir
```
b19fa875
[MLIR] Enable LLVM optimizations in execution engine. (#10) · ac8df2bb
Diego Caballero authored 5 years ago
```
This patch enables LLVM optimizations at -O3 level.
```
ac8df2bb
[MLIR] Rename LoopKernel->ComputedKernel. Move it to experimental core ops (#12) · 978691b4
Diego Caballero authored 5 years ago
```
We want to use ComputedKernel for any target to delimit sub-graphs to be
compiled and executed with MLIR.
```
978691b4

[MLIR] Move mlir code under runtime/mlir · d9dd03ce

Nagy Mostafa authored 5 years ago

* Create MLIR as cmake external project. Clone and build via ngraph cmake

* Moved code and enabled compilation. Need to clone and build MLIR/LLVM during cmake step, so find_package can work

* clone and build llvm/mlir during configuration. Compiles now. Needs more testing

* Force DEX only if MLIR is ON

* Remove extra cmake file. Style-apply

* Remove redundant files in cpu/mlir

* Update CODEOWNERS. Check for ninja and fail if not found

* Fixing post merge issues

d9dd03ce

[MLIR] Mem manager (#9) · e941412e

Nagy Mostafa authored 5 years ago

* Implements a simple memory manager that just does malloc for now. Pointers are freed during cleanup.
* Enable JIT call-back to memory manager to allocate temps.
* Memory manager pointer is passed to the JIT'ed code upon invocation. That makes the code re-entrant from different threads in case the code is shared among identical sub-graphs that are executed in parallel.

e941412e

[MLIR] Add MatmulBias op with basic support for simple matmuls (#8) · ba735a80
Diego Caballero authored 5 years ago
```
The following test should work now:
NGRAPH_MLIR_DUMP_ALL=1 NGRAPH_MLIR=1 test/unit-test '--gtest_filter=CPU.dot2d'
```
ba735a80
[MLIR] Add JIT compilation and execution of mlir code · dd5c6fb6
Diego Caballero authored 5 years ago

dd5c6fb6

[MLIR] Initial PoC: NG dialect, dialect code-gen, dialect lowering to affine, no JIT yet · a5c99754

Nagy Mostafa authored 5 years ago

* Link MLIR static libs to cpu backend

* Use LLVMConfig.cmake

* Initial commit. Link fails with undefined reference to typeinfo for mlir::Dialect

* Added AddOp

* initial compiler class

* Initialize module/function, and map tensors to arguments

* Code compiles. Moved MLIR building to correct DEX handler

* NGDialect code-gen working

* Use vector instead of sets for i/o tensors. Use functor in executor

* Misc fixes

* style-apply

* WIP: Adding support for dialect lowering.

* WIP: Lowered to affine. Crash on constant ops have side effects in Constant Folding

* Fixed missing whole package linkage.

* Removed fake instruction and update func type

*  Enable lowering to LLVM dialect and IR

* Made loop nest builder handle any rank

* Fixes per PR feedback. Major ones:
- Removed ngdialect namespace
- renamed dialect classes to start with NG prefixwq:w

* Add unreachable assert

* Add reading of LLVM options from an env var MLIR_LLVM_OPTIONS (#5)

a5c99754

31 May, 2019 6 commits

[MLIR] Link MLIR static libs to cpu backend · 021399a1
Nagy Mostafa authored 5 years ago
```
* Link MLIR static libs to cpu backend

* Use LLVMConfig.cmake
```
021399a1

Bob/hybrid multi (#3005) · e49dd589

Robert Kimball authored 5 years ago

* handle case where a node's output is connected multiple inputs of another node

* fix creation of the FunctionCall to have the correct outputs

* fix per review comment

Unverified

e49dd589

Cleanup how compile flags are set and used by nGraph and external projects. (#2942) · 08dcd01b

Sang Ik Lee authored 5 years ago

* Cleanup how compile flags set and used by nGraph and external projects.
Set C++11 through CMake and pass it down to external projects.
Prefer CMake variables such as CMAKE_POSITION_INDEPENDENT_CODE and
CMAKE_CXX_STANDARD instead of explicitly setting compiler dependent
flags.
Create json compilation database for external projects.
CMAKE_CXX_FLAGS is used as common global options for nGraph and external
projects.
add_compile_options() is used for local options for current and sub
directories.
add_definitions() is used for setting definitions for current and sub
directories.
Note: Global options are not passed down to some external projects.
Note: mkl-dnn resets CMAKE_CXX_FLAGS internally.
Note: TBB and MLSL are not CMake based.
Noet: Eigen and json is header only library.

* Fix error.

* Fix error. (second attempt)

* Cleanup code.

* Allow check for undefined macro.

* Try to fix cldnn issue.

* Set type for CMake arguments.

* Pass C++ standard to protobuf.

* Pass C++ standard down to TBB.

* Change how Clang specific flags are handled.

* Fix error.

* Workaround for compile error on Baidu's PDPD docker.

* Fix windows build error.

Unverified

08dcd01b

Add check to hybridexecutable::get_as. (#2998) · b520e839
Chris Sullivan authored 5 years ago

b520e839
Tweak backend constructor for gcc 4.8.5 (#3001) · e3330b47
Rob Earhart authored 5 years ago

e3330b47
Remove unused .gitmodule (#2997) · e4c5aa8f
Sang Ik Lee authored 5 years ago

e4c5aa8f

30 May, 2019 2 commits

Initial implementation of implicit broadcasting for eltwise ops (#2936) · 0caefe7d

Jayaram Bobba authored 5 years ago

* Initial implementation of implicit broadcasting for eltwise ops. Only Add supported

* Addressed PR feedback

* cleanup

* Rename Bcast to Broadcast

* Autobroadcasting support for rest of elementwise ops

* Serializer support for autobroadcast

* Added missing autob serialization for Minimum

* Added execution unit tests and more op types to implicit broadcast elimination

* Addressed PR feedback

* Fixes windows build issue

* RVO optimization per PR feedback

0caefe7d

Much faster serialize/deserialize of broadcast constants (#2993) · 4971bdf1
Robert Kimball authored 5 years ago
```
* serialize constant faster

* more speedup
```
Unverified

4971bdf1

29 May, 2019 7 commits

[Fused] FakeQuantize operation. (#2928) · 36422810

Adam Rogowiec authored 5 years ago

* Draft of FakeQuantize operation along with UTs.

* Add FakeQuantize to implemented operators on IGPU.

* Get back FakeQuantize op case to switch.

* Fix compilation errors.

* Skip test for INTERPRETER backend and disable type_prop tests.

* Initial implementation covering the most basic case

* Cleanup of fake_quantize_with_clip UT

* Reformat the cpu unit tests manifest and unlock anothe fake quant UT

* Handle the clipping case by subtracting input_low from quantization input

* Clip the input data before quantization to avoid Selects

* UT manifest fix

* Obsolete comment removed

* Code formatting

* Broadcast input data for non-scalar in/out params

* Code formatting

* Enable the type prop tests for FakeQuantize

* Dequant the data without using the Dequantize op (fixes an edge case)

36422810

Added option in order to build nGraph core static library (#2989) · 8707fba8
Ilya Churaev authored 5 years ago

8707fba8
Move reshape functions from utils to builder. (#2984) · db34286c
Adam Rogowiec authored 5 years ago
```
* Move reshape from utils to builder.

* Add aliases to functions in old place and describe changes.
```
db34286c
Removed unnecessary write from autodiff::get_autodiff (#2988) · c06bf6e1
gcwenger authored 5 years ago

c06bf6e1
fix broken doc strings (#2981) · 445c8158
Robert Kimball authored 5 years ago

445c8158

[FusedOps] ShuffleChannels (#2927) · 1fdf14ae

Tomasz Dołbniak authored 5 years ago

* ShuffleChannels implementation

* Validation of ShuffleChannels params

* Implementation of ShuffleChannels decompose_op()

* Formatting adjustments

* Corrected implementation and validation of op params

* Basic test of ShuffleChannels

* Negative axis value test

* Default params for the ShuffleChannels op

* ShuffleChannels test with floats

* ShuffleChannels validation unit tests

* PR comments

* Compilation error fix

* PR feedback and cleanup

* Code formatting adjustment

* Negative axis value documentation

* Docs update (PR feedback)

* PR feedback: shape and axis validation

* Modify axis semantics on shuffle op

* Revert "PR feedback: shape and axis validation"

This reverts commit 21b708e710b91da2a7e37a69c0da1f31c7743b47.

1fdf14ae

Switch to clDNN version with conformance fix for 3 ONNX models (DenseNet-121,… · 7d4bdab7
Dmitry Yershov authored 5 years ago
```
Switch to clDNN version with conformance fix for 3 ONNX models (DenseNet-121, Inception-v2, ResNet-50) (#2982)
```
7d4bdab7

28 May, 2019 2 commits
- Switch off the failing unit tests for iGPU (#2980) · 67e23441
  Tomasz Dołbniak authored 5 years ago
  
  67e23441
- Leona/doc v0.20 (#2971) · 14f16bc1
  Leona C authored 5 years ago
```
* Cleanup section

* Add updated illustrations for pattern_matcher and tensor_descriptor

* Add subsection link to be consistent
```
  14f16bc1
25 May, 2019 1 commit
- update a few files to build on windows (#2974) · 39cdee0e
  Robert Kimball authored 5 years ago
```
* update a few files to build on windows

* more fixes
```
  39cdee0e
24 May, 2019 10 commits

Switch some get_inputs uses to use the newer inputs (#2968) · 0c813cf2
Scott Cyphers authored 5 years ago
```
* Switch some get_inputs uses to use the newer inputs

* Review comments
```
Unverified

0c813cf2
CTCGreedyDecoder layer op (#2965) · 513f8de6
Jayaram Bobba authored 5 years ago
```
* Added CTCGreedyDecoder layer op

* Added comment on seq_len validation checks
```
513f8de6
Backport fix from #2973 (#2976) · cf5e3623
Adam Procter authored 5 years ago

cf5e3623

Add save/load API to runtime (#2955) · 7ad4c0ab

Robert Kimball authored 5 years ago

* API defined

* add unit test for save/load with INTERPRETER

* Update per review comments

* fix compiler error

Unverified

7ad4c0ab

Added accessor methods for layer op attributes (#2964) · 4ec94acc
Jayaram Bobba authored 5 years ago
```
* Added accessor methods for layer op attributes

* style fixes and addressed PR feedback
```
4ec94acc
IntelGPU backend: Switch to clDNN which is compatible with gcc4.8 (#2961) · 9ad52bfa
Dmitry Yershov authored 5 years ago

9ad52bfa

[ONNX] Unit tests for QLinearMatMul (#2706) · 9560c1fa

Michał Karzyński authored 5 years ago

* [ONNX] Unit test models for QLinearMatMul

* [ONNX] Extended types support for NgraphTestCase

* [ONNX] Move the value comparators to the NgraphTestCase class

* Add test cases

* Add shape checking

* disable GPU tests

9560c1fa

Make private members protected in hybrid classes (#2975) · d169f929
Robert Kimball authored 5 years ago
```
* make private members protected in hybrid classes

* allow overriding the passes
```
d169f929

[Fused] LeakyRelu op (#2919) · 5650e913

Michał Karzyński authored 5 years ago

* [Fused] LeakyRelu op

* Add LeakyRelu to serializer

* Add unit tests

* Fix merge branch 'master' into mkarzyns/fused_leaky_relu

* Change broadcasting rules to NumPy style

* Remove std:: and ngraph:: prefixes

* Rename CPU Runtime LeakyRelu to CPULeakyRelu

* Style apply

* Fix cpu_fusion.fuse_leaky_relu test

* Use eigen's tanh in the fused sigmoid multiply kernel (#2946)

* Merge branch 'master' into mkarzyns/fused_leaky_relu

* Add LeakyRelu to Intel GPU backend op list

* Add LeakyRelu to Intel GPU backend op list

5650e913

Create tensor for the primary backend (#2970) · 04be484a
Robert Kimball authored 5 years ago
```
* create tensor for the primary backend

* move private objects to protected
```
04be484a

23 May, 2019 4 commits
- Fix Convert for boolean output type in CODEGEN. (#2958) · fde204b0
  Amy Zhuang authored 5 years ago
  
  fde204b0
- Move zero padded conv fusions from CPUFusion to CoreFusion. (#2969) · f7b13dc4
  Amy Zhuang authored 5 years ago
```
* Move zero padded conv fusions from CPUFusion to CoreFusion.

* Address PR feedback: move unit tests to core_fusion.
```
  f7b13dc4
- Remove functions from cpu which were moved to core (#2962) · 49baa903
  gaurides authored 5 years ago
```
* Remove functions from cpu which were moved to core

* Fix a typo

* Remove unused function
```
  49baa903
- Allow NGRAPH_VISUALIZE_TREE_OUTPUT_SHAPES to output partial shapes (#2959) · df2a27ad
  Adam Procter authored 5 years ago
  
  df2a27ad