Commits · 8eed208b94b1035f815d5529e4c7d60eca8b18d2 · submodule / ngraph

31 Jul, 2019 10 commits
- Unify folders for arithmetic reduction ops; add Max and Min · 8eed208b
  Adam Procter authored Jul 31, 2019
  
  8eed208b
- Convert CPU ops to new form (#3345) · d8d940d0
  Scott Cyphers authored Jul 31, 2019
```
* Convert CPU ops to new form

Remove obsolete sigmoid

* Fix export

* fix typo

* typo
```
  d8d940d0
- Merge pull request #3294 from NervanaSystems/etusien/clamp · 30603191
  Robert Kimball authored Jul 31, 2019
```
[Py] Added clamp operator to Python API
```
  30603191
- Merge branch 'master' into etusien/clamp · 138b6260
  Michał Karzyński authored Jul 31, 2019
  
  138b6260
- Convert PlaidML ops to new form (#3344) · 5b59c095
  Scott Cyphers authored Jul 31, 2019
```
* Convert PlaidML ops to new form

* style
```
  5b59c095
- Add ConstantFolding for Gather (#3342) · a6c2f23b
  Adam Procter authored Jul 31, 2019
```
* Add CF for Gather

* Style
```
  a6c2f23b
- Convert ONNX ops to new form (#3343) · 6b90c1bd
  Scott Cyphers authored Jul 31, 2019
  
  6b90c1bd
- CF updates: Slice, DynSlice (#3340) · 55209b7a
  Adam Procter authored Jul 31, 2019
```
* Move SlicePlan out of DynElimination, for reuse in ConstantFolding

* Add CF support for Slice

* Add CF for DynSlice

* Add <algorithm> to slice_plan.cpp, to make Windows happy
```
  55209b7a
- remove broken link (#3341) · 5366be98
  Leona C authored Jul 31, 2019
  
  5366be98
- CF updates: Reshape, DynReshape, Transpose (#3338) · 75379523
  Adam Procter authored Jul 31, 2019
```
* Update Reshape CF to support all ETs

* Add CF for DynReshape

* Add CF for Transpose

* Add #include <numeric>, for std::iota

* style, oops
```
  75379523
30 Jul, 2019 7 commits

Include missing header file. (#3339) · 30c7028f
Sang Ik Lee authored Jul 30, 2019

30c7028f

Add Output support for a number of builders (#3297) · 9ebedbbf

Scott Cyphers authored Jul 30, 2019

* Add Output support for a number of builders

* ../src/ngraph/builder/autobroadcast.cpp

* Remove some _values

* Simplify

* Simplify, add some more ops/builders

* ONNX failures

* Some GetOutputElement changes to help with Output<Node>

* Convert some ops to use Output<Node> inputs

* Review comments

* Some more changes of nodes to outputs

* Chane names for OutputVector helpers to avoid API change

* Cleanup

9ebedbbf

Some more changes of nodes to outputs (#3315) · 8b0a2d19

Scott Cyphers authored Jul 30, 2019

* Some more changes of nodes to outputs

* Chane names for OutputVector helpers to avoid API change

8b0a2d19

[MLIR] Bump MLIR repo to commit 26c683c, 07/29/2019. (#3310) · 81597f3a

Diego Caballero authored Jul 30, 2019

* [MLIR] Bump MLIR repo to commit 59167c2, 07/25/2019.

MLIR commit:
Author: River Riddle <riverriddle@google.com>
Date:   Wed Jul 24 16:41:11 2019 -0700

    NFC: Use ValueOfRange instead of T in Diagnostic::appendRange.

        For iterator_range, T is often the name of another iterator type
        and not the the value of the range.

LLVM commit:
Author: Marshall Clow <mclow.lists@gmail.com>
Date:   Thu Jul 25 03:26:05 2019 +0000

    Implement change #4 of P1466: Change weekday to accept both 0 and 7
    as Sunday. Add accessors 'c_encoding' and 'iso_encoding' to provide
    different interpretations of the weekday. Remove 'operator unsigned'

* style

* Move MLIR/LLVM repos a bit more forward

81597f3a

ConstantFolding for Not (#3326) · 5ece6de2

Adam Procter authored Jul 30, 2019

* CF for Sign, and extend element type capabilities for unary arithop CF

* CF for Ceiling and Floor

* Update CPU CF builders

* Update CPU CF builders

* CF for Not

* Add tests for new CPU CF folders

* Add tests for recently added CPU CF functors

* Add tests for non-CPU ceiling/floor CF

* Unit tests

* Add test for CPU folder

5ece6de2

ConstantFolding for Ceiling and Floor (#3320) · 09952c0b

Adam Procter authored Jul 30, 2019

* CF for Sign, and extend element type capabilities for unary arithop CF

* CF for Ceiling and Floor

* Update CPU CF builders

* Update CPU CF builders

* Add tests for new CPU CF folders

* Add tests for recently added CPU CF functors

* Add tests for non-CPU ceiling/floor CF

* Unit tests

09952c0b

Add tests for recently added CPU CF functors (#3328) · 831df41d
Adam Procter authored Jul 30, 2019
```
* Add tests for new CPU CF folders

* Add tests for recently added CPU CF functors
```
831df41d

29 Jul, 2019 12 commits

ConstantFolding for Equal, Greater, GreaterEq, Less, LessEq, NotEqual (#3322) · bcaf32c4

Adam Procter authored Jul 29, 2019

* CF for And and Or

* CF support for comparison ops

* Fix predicate for binary elementwise; add unit tests for non-arithmetic binops

* Update CPU CF builders

bcaf32c4

Make validation a pass and add it after every pass by default (#3296) · c693cb7e

Robert Kimball authored Jul 29, 2019

* Make validation a pass and add it after every pass by default

* cleanup

* update per review comments

* Switch plaid to new API for disabling  pass validation

* address review comment

c693cb7e

[MLIR] Enable affine dialect loop fusion (#3290) · aedd8c2e

Diego Caballero authored Jul 29, 2019

* [MLIR] Enable affine dialect loop fusion

Enable affine dialect loop fusion in nGraph pipeline. It also adds an
opt flag to enable/disable it when ngraph-opt is in place. Fusion seems
to work for simple cases. It wasn't able to fuse dot + add, though, at
least in my test case. One example that worked:

Input:
  %6 = alloc() : memref<2500x2500xf32>
  affine.for %i3 = 0 to 2500 {
    affine.for %i4 = 0 to 2500 {
      %7 = load %arg0[%i3, %i4] : memref<2500x2500xf32>
      %8 = load %0[%i3, %i4] : memref<2500x2500xf32>
      %9 = addf %8, %7 : f32
      store %9, %6[%i3, %i4] : memref<2500x2500xf32>
    }
  }
  %10 = alloc() : memref<2500x2500xf32>
  affine.for %i5 = 0 to 2500 {
    affine.for %i6 = 0 to 2500 {
      %11 = load %arg2[%i5, %i6] : memref<2500x2500xf32>
      %12 = load %0[%i5, %i6] : memref<2500x2500xf32>
      %13 = addf %12, %11 : f32
      store %13, %10[%i5, %i6] : memref<2500x2500xf32>
    }
  }
  %14 = alloc() : memref<2500x2500xf32>
  affine.for %i7 = 0 to 2500 {
    affine.for %i8 = 0 to 2500 {
      %15 = load %10[%i7, %i8] : memref<2500x2500xf32>
      %16 = load %6[%i7, %i8] : memref<2500x2500xf32>
      %17 = addf %16, %15 : f32
      store %17, %14[%i7, %i8] : memref<2500x2500xf32>
    }
  }

Output:
  %8 = alloc() : memref<2500x2500xf32>
  affine.for %i3 = 0 to 2500 {
    affine.for %i4 = 0 to 2500 {
      %9 = load %arg2[%i3, %i4] : memref<2500x2500xf32>
      %10 = load %2[%i3, %i4] : memref<2500x2500xf32>
      %11 = addf %10, %9 : f32
      %12 = affine.apply #map2(%i3, %i4, %i3, %i4)
      %13 = affine.apply #map3(%i3, %i4, %i3, %i4)
      store %11, %0[%12, %13] : memref<1x1xf32>
      %14 = load %arg0[%i3, %i4] : memref<2500x2500xf32>
      %15 = load %2[%i3, %i4] : memref<2500x2500xf32>
      %16 = addf %15, %14 : f32
      %17 = affine.apply #map2(%i3, %i4, %i3, %i4)
      %18 = affine.apply #map3(%i3, %i4, %i3, %i4)
      store %16, %1[%17, %18] : memref<1x1xf32>
      %19 = affine.apply #map2(%i3, %i4, %i3, %i4)
      %20 = affine.apply #map3(%i3, %i4, %i3, %i4)
      %21 = load %0[%19, %20] : memref<1x1xf32>
      %22 = affine.apply #map2(%i3, %i4, %i3, %i4)
      %23 = affine.apply #map3(%i3, %i4, %i3, %i4)
      %24 = load %1[%22, %23] : memref<1x1xf32>
      %25 = addf %24, %21 : f32
      store %25, %8[%i3, %i4] : memref<2500x2500xf32>
    }
  }

* Rename MLIR_LLVM_OPTIONS to NGRAPH_MLIR_OPTIONS

Something like this works now:
NGRAPH_MLIR_OPTIONS="--enable-affine-loop-fusion=false"

* Disable loop fusion by default and fix typo

aedd8c2e

Merge pull request #3325 from NervanaSystems/nmostafa/mergefix · 862aa5fe
Robert Kimball authored Jul 29, 2019
```
[MLIR] Fix bad merge on 2 MLIR changes
```
862aa5fe
Fix bad merge on 2 MLIR changes · c4dfca3b
nmostafa authored Jul 29, 2019

c4dfca3b
Merge pull request #3298 from NervanaSystems/nmostafa/recompile · fc9a7dea
Robert Kimball authored Jul 29, 2019
```
[MLIR] Re-compile sub-graph once on first invocation
```
fc9a7dea
Merge branch 'master' into nmostafa/recompile · 956e8b3a
Robert Kimball authored Jul 29, 2019

956e8b3a
Merge pull request #3324 from NervanaSystems/bob/unit-test · 5d3456e4
Robert Kimball authored Jul 29, 2019
```
Changes to get allow NNP to pass tests
```
5d3456e4
style · 9733630b
Robert Kimball authored Jul 29, 2019

9733630b
fix error · 6a0945ee
Robert Kimball authored Jul 29, 2019

6a0945ee
fix tolerance · 6292fa4d
Robert Kimball authored Jul 29, 2019

6292fa4d
[Py] Changed docstring. · e5aa1995
Ewa21 authored Jul 29, 2019

e5aa1995

28 Jul, 2019 4 commits
- Merge branch 'master' into bob/unit-test · 17bcff88
  Scott Cyphers authored Jul 28, 2019
  
  17bcff88
- ConstantFolding for Sign (#3319) · f4d44bbc
  Adam Procter authored Jul 28, 2019
```
* CF for Sign, and extend element type capabilities for unary arithop CF

* Update CPU CF builders
```
  f4d44bbc
- Change test to not use the create_tensor call which takes a memory buffer · c5a7e690
  Robert Kimball authored Jul 28, 2019
  
  c5a7e690
- unit test to use all_close_f · 5273c0f4
  Robert Kimball authored Jul 28, 2019
  
  5273c0f4
27 Jul, 2019 3 commits
- ConstantFolding for Sum (#3318) · 098c9118
  Adam Procter authored Jul 27, 2019
```
* CF for Sum

* style
```
  098c9118
- ConstantFolding for Concat (#3317) · 4e0e0f56
  Adam Procter authored Jul 27, 2019
```
* CF for Concat

* Switch from Nodes to Inputs/Outputs
```
  4e0e0f56
- Quantization conversion from nodes to outputs (#3316) · 34499001
  Scott Cyphers authored Jul 27, 2019
  
  34499001
26 Jul, 2019 4 commits
- Reshape sinking: fix issue with handling rank changing reshape. (#3314) · 8eb63379
  Sang Ik Lee authored Jul 26, 2019
  
  8eb63379
- Refactor optimize() back · a2b9c6b8
  nmostafa authored Jul 26, 2019
  
  a2b9c6b8
- Fixed double-buffering timing (#3309) · c04b5588
  gcwenger authored Jul 26, 2019
```
API is synchronous per thread and threads are coordinated so that
we know when we hit the last iteration everything is done.
Using join() to gate end of iterations was introducing too much
overhead to timing as verified via checking traces.
```
  c04b5588
- [MLIR] Add missing visitor for Relu in compiler.cpp (#3308) · f54e9159
  Diego Caballero authored Jul 26, 2019
  
  f54e9159