Commits · 209e3ccc3150388baf9bbad76c972b51e4f21dfa · submodule / ngraph

02 Jun, 2019 19 commits

[MLIR] Add support for dot with non-square tensor operands (#33) · 209e3ccc

Diego Caballero authored May 29, 2019

It extends dot definition to be able to deal with operands that are not
square tensors. It also fixes a bug in the lowerer related to that.

209e3ccc

[MLIR] Replace MatmulBiasOp with DotOp (#20) · 5867666f

Diego Caballero authored May 28, 2019

* [MLIR] Replace MatmulBiasOp with DotOp

We disable CPUFusion if MLIR is enabled to avoid CPU specific ops to be
introduced for now.

5867666f

[MLIR] MLIR version upgrade (#28) · 9acdfe04

Nagy Mostafa authored May 27, 2019

* Upgrade MLIR. Several code fixes based on API changes

* Fixes due to DialectConv API changes

* style-apply

* PR fixes

9acdfe04

[MLIR] Fix redundant MLIR initialization. · eda52385
Diego Caballero authored May 24, 2019
```
MLIR is now initialize once once in CPU backend.
```
eda52385
[MLIR] Move mlir related classes to MLIR namespace (#23) · 86bc31cc
Nagy Mostafa authored May 22, 2019
```
* Move dialect and types to mlir namespace

* PR fixes and some cleanup

* Merge fix
```
86bc31cc
[MLIR] Add all elt-wise ops (#24) · ea441a6e
Nagy Mostafa authored May 22, 2019

ea441a6e
[MLIR] Enable module verification. Fix FakeInput result type to memref (#26) · c31940d4
Nagy Mostafa authored May 22, 2019
```
* Enable module verification. Fix FakeInput result type to memref

* style-apply
```
c31940d4

[MLIR] Use .td and tablegen to declare ng dialect ops (#21) · 6bb90e3c

Nagy Mostafa authored May 20, 2019

* Initial td file. Cmake changes

* Move all ops to .td file.

* Added few more opcodes to show-case. Fixed PR feedback

* Remove NG_ prefix of opcode records. Some fixes

* Added some doc

*  Adding back NG prefix

* Bug fix in MLIR gen

6bb90e3c

[MLIR] Add NG integer type. Map float types to std types · 9bb2fad3
Nagy Mostafa authored May 20, 2019

9bb2fad3

[MLIR] Enable CompiledKernel as driver for MLIR backend (#18) · 3bd00e23

Diego Caballero authored May 20, 2019

This patch leverages CompiledKernel to delimit sub-graphs to be compiled
with MLIR. It introduces a pass that creates a CompiledKernel for the
whole function (for now) and changes MLIRCompiler to align with this new
approach.

3bd00e23

[MLIR] Move MLIR code into its own namespace. (#15) · e3c28fd2
Nagy Mostafa authored May 13, 2019
```
* Use NGRAPH export macros instead of CPU

* Move code to ngmlir namespace
```
e3c28fd2
[MLIR] Move MLIR to src/contrib/mlir (#16) · b19fa875
Nagy Mostafa authored May 13, 2019
```
* Move MLIR to src/contrib/mlir
```
b19fa875
[MLIR] Enable LLVM optimizations in execution engine. (#10) · ac8df2bb
Diego Caballero authored May 10, 2019
```
This patch enables LLVM optimizations at -O3 level.
```
ac8df2bb
[MLIR] Rename LoopKernel->ComputedKernel. Move it to experimental core ops (#12) · 978691b4
Diego Caballero authored May 10, 2019
```
We want to use ComputedKernel for any target to delimit sub-graphs to be
compiled and executed with MLIR.
```
978691b4

[MLIR] Move mlir code under runtime/mlir · d9dd03ce

Nagy Mostafa authored May 09, 2019

* Create MLIR as cmake external project. Clone and build via ngraph cmake

* Moved code and enabled compilation. Need to clone and build MLIR/LLVM during cmake step, so find_package can work

* clone and build llvm/mlir during configuration. Compiles now. Needs more testing

* Force DEX only if MLIR is ON

* Remove extra cmake file. Style-apply

* Remove redundant files in cpu/mlir

* Update CODEOWNERS. Check for ninja and fail if not found

* Fixing post merge issues

d9dd03ce

[MLIR] Mem manager (#9) · e941412e

Nagy Mostafa authored May 09, 2019

* Implements a simple memory manager that just does malloc for now. Pointers are freed during cleanup.
* Enable JIT call-back to memory manager to allocate temps.
* Memory manager pointer is passed to the JIT'ed code upon invocation. That makes the code re-entrant from different threads in case the code is shared among identical sub-graphs that are executed in parallel.

e941412e

[MLIR] Add MatmulBias op with basic support for simple matmuls (#8) · ba735a80
Diego Caballero authored May 03, 2019
```
The following test should work now:
NGRAPH_MLIR_DUMP_ALL=1 NGRAPH_MLIR=1 test/unit-test '--gtest_filter=CPU.dot2d'
```
ba735a80
[MLIR] Add JIT compilation and execution of mlir code · dd5c6fb6
Diego Caballero authored Apr 23, 2019

dd5c6fb6

[MLIR] Initial PoC: NG dialect, dialect code-gen, dialect lowering to affine, no JIT yet · a5c99754

Nagy Mostafa authored Apr 29, 2019

* Link MLIR static libs to cpu backend

* Use LLVMConfig.cmake

* Initial commit. Link fails with undefined reference to typeinfo for mlir::Dialect

* Added AddOp

* initial compiler class

* Initialize module/function, and map tensors to arguments

* Code compiles. Moved MLIR building to correct DEX handler

* NGDialect code-gen working

* Use vector instead of sets for i/o tensors. Use functor in executor

* Misc fixes

* style-apply

* WIP: Adding support for dialect lowering.

* WIP: Lowered to affine. Crash on constant ops have side effects in Constant Folding

* Fixed missing whole package linkage.

* Removed fake instruction and update func type

*  Enable lowering to LLVM dialect and IR

* Made loop nest builder handle any rank

* Fixes per PR feedback. Major ones:
- Removed ngdialect namespace
- renamed dialect classes to start with NG prefixwq:w

* Add unreachable assert

* Add reading of LLVM options from an env var MLIR_LLVM_OPTIONS (#5)

a5c99754

31 May, 2019 6 commits

[MLIR] Link MLIR static libs to cpu backend · 021399a1
Nagy Mostafa authored Apr 12, 2019
```
* Link MLIR static libs to cpu backend

* Use LLVMConfig.cmake
```
021399a1

Bob/hybrid multi (#3005) · e49dd589

Robert Kimball authored May 31, 2019

* handle case where a node's output is connected multiple inputs of another node

* fix creation of the FunctionCall to have the correct outputs

* fix per review comment

e49dd589

Cleanup how compile flags are set and used by nGraph and external projects. (#2942) · 08dcd01b

Sang Ik Lee authored May 31, 2019

* Cleanup how compile flags set and used by nGraph and external projects.
Set C++11 through CMake and pass it down to external projects.
Prefer CMake variables such as CMAKE_POSITION_INDEPENDENT_CODE and
CMAKE_CXX_STANDARD instead of explicitly setting compiler dependent
flags.
Create json compilation database for external projects.
CMAKE_CXX_FLAGS is used as common global options for nGraph and external
projects.
add_compile_options() is used for local options for current and sub
directories.
add_definitions() is used for setting definitions for current and sub
directories.
Note: Global options are not passed down to some external projects.
Note: mkl-dnn resets CMAKE_CXX_FLAGS internally.
Note: TBB and MLSL are not CMake based.
Noet: Eigen and json is header only library.

* Fix error.

* Fix error. (second attempt)

* Cleanup code.

* Allow check for undefined macro.

* Try to fix cldnn issue.

* Set type for CMake arguments.

* Pass C++ standard to protobuf.

* Pass C++ standard down to TBB.

* Change how Clang specific flags are handled.

* Fix error.

* Workaround for compile error on Baidu's PDPD docker.

* Fix windows build error.

08dcd01b

Add check to hybridexecutable::get_as. (#2998) · b520e839
Chris Sullivan authored May 31, 2019

b520e839
Tweak backend constructor for gcc 4.8.5 (#3001) · e3330b47
Rob Earhart authored May 31, 2019

e3330b47
Remove unused .gitmodule (#2997) · e4c5aa8f
Sang Ik Lee authored May 31, 2019

e4c5aa8f

30 May, 2019 2 commits

Initial implementation of implicit broadcasting for eltwise ops (#2936) · 0caefe7d

Jayaram Bobba authored May 30, 2019

* Initial implementation of implicit broadcasting for eltwise ops. Only Add supported

* Addressed PR feedback

* cleanup

* Rename Bcast to Broadcast

* Autobroadcasting support for rest of elementwise ops

* Serializer support for autobroadcast

* Added missing autob serialization for Minimum

* Added execution unit tests and more op types to implicit broadcast elimination

* Addressed PR feedback

* Fixes windows build issue

* RVO optimization per PR feedback

0caefe7d

Much faster serialize/deserialize of broadcast constants (#2993) · 4971bdf1
Robert Kimball authored May 30, 2019
```
* serialize constant faster

* more speedup
```
4971bdf1

29 May, 2019 7 commits

[Fused] FakeQuantize operation. (#2928) · 36422810

Adam Rogowiec authored May 29, 2019

* Draft of FakeQuantize operation along with UTs.

* Add FakeQuantize to implemented operators on IGPU.

* Get back FakeQuantize op case to switch.

* Fix compilation errors.

* Skip test for INTERPRETER backend and disable type_prop tests.

* Initial implementation covering the most basic case

* Cleanup of fake_quantize_with_clip UT

* Reformat the cpu unit tests manifest and unlock anothe fake quant UT

* Handle the clipping case by subtracting input_low from quantization input

* Clip the input data before quantization to avoid Selects

* UT manifest fix

* Obsolete comment removed

* Code formatting

* Broadcast input data for non-scalar in/out params

* Code formatting

* Enable the type prop tests for FakeQuantize

* Dequant the data without using the Dequantize op (fixes an edge case)

36422810

Added option in order to build nGraph core static library (#2989) · 8707fba8
Ilya Churaev authored May 29, 2019

8707fba8
Move reshape functions from utils to builder. (#2984) · db34286c
Adam Rogowiec authored May 29, 2019
```
* Move reshape from utils to builder.

* Add aliases to functions in old place and describe changes.
```
db34286c
Removed unnecessary write from autodiff::get_autodiff (#2988) · c06bf6e1
gcwenger authored May 29, 2019

c06bf6e1
fix broken doc strings (#2981) · 445c8158
Robert Kimball authored May 29, 2019

445c8158

[FusedOps] ShuffleChannels (#2927) · 1fdf14ae

Tomasz Dołbniak authored May 29, 2019

* ShuffleChannels implementation

* Validation of ShuffleChannels params

* Implementation of ShuffleChannels decompose_op()

* Formatting adjustments

* Corrected implementation and validation of op params

* Basic test of ShuffleChannels

* Negative axis value test

* Default params for the ShuffleChannels op

* ShuffleChannels test with floats

* ShuffleChannels validation unit tests

* PR comments

* Compilation error fix

* PR feedback and cleanup

* Code formatting adjustment

* Negative axis value documentation

* Docs update (PR feedback)

* PR feedback: shape and axis validation

* Modify axis semantics on shuffle op

* Revert "PR feedback: shape and axis validation"

This reverts commit 21b708e710b91da2a7e37a69c0da1f31c7743b47.

1fdf14ae

Switch to clDNN version with conformance fix for 3 ONNX models (DenseNet-121,… · 7d4bdab7
Dmitry Yershov authored May 29, 2019
```
Switch to clDNN version with conformance fix for 3 ONNX models (DenseNet-121, Inception-v2, ResNet-50) (#2982)
```
7d4bdab7

28 May, 2019 2 commits
- Switch off the failing unit tests for iGPU (#2980) · 67e23441
  Tomasz Dołbniak authored May 28, 2019
  
  67e23441
- Leona/doc v0.20 (#2971) · 14f16bc1
  Leona C authored May 28, 2019
```
* Cleanup section

* Add updated illustrations for pattern_matcher and tensor_descriptor

* Add subsection link to be consistent
```
  14f16bc1
25 May, 2019 1 commit
- update a few files to build on windows (#2974) · 39cdee0e
  Robert Kimball authored May 25, 2019
```
* update a few files to build on windows

* more fixes
```
  39cdee0e
24 May, 2019 3 commits
- Switch some get_inputs uses to use the newer inputs (#2968) · 0c813cf2
  Scott Cyphers authored May 24, 2019
```
* Switch some get_inputs uses to use the newer inputs

* Review comments
```
  0c813cf2
- CTCGreedyDecoder layer op (#2965) · 513f8de6
  Jayaram Bobba authored May 24, 2019
```
* Added CTCGreedyDecoder layer op

* Added comment on seq_len validation checks
```
  513f8de6
- Backport fix from #2973 (#2976) · cf5e3623
  Adam Procter authored May 24, 2019
  
  cf5e3623