Commits · a574bdafcd59d4f33184822d59b5057d99e401b4 · submodule / ngraph

24 Feb, 2018 1 commit

fix maxpool copy_with_new_args (#541) · 2feefb92

* fix maxpool copy_with_new_args

* fix a free(nullptr) error

* update output shape from a maxpool

* remove free gaurd

2feefb92

23 Feb, 2018 2 commits
- overpadded maxpool in INTERPRETER now matches mkldnn behavior (#537) · e1818685
  Matthew Brookhart authored 7 years ago
  
  e1818685
- Fixes NGMX-338: Adds option to AvgPool padding. · d2d0196b
  Christian Convey authored 7 years ago
  
  d2d0196b
22 Feb, 2018 3 commits

fix numeric stability bug in autodiff of divide (#532) · 8ad86ab9
Matthew Brookhart authored 7 years ago
```
* fix numeric stability bug in autodiff of divide

* add a test for divide autodiff stability
```
8ad86ab9

Jbobba/layout query (#502) · 8a8c0446

Jayaram Bobba authored 7 years ago

* Basic assignment pass for CPU backend

* Change CPU convolution emitter to check op annotations

* Queries MKLDNN for optimal layout on convolutions

* Added layout conversions through ConverLayout ops and explicit layout conversion on CPU tensor view objects

* Added layout conversions for non-MKLDNN ops

* - Style fixes
- Removed unused variables to avoid clang errors
- Added more mkldnn format types to utility functions

* Move ConvertLayout back to runtime::cpu::op namespace

* Added more mkldnn memory formats

* Moved op annotations to Op class

* Style changes

* Minor fix

* Minor fix to keep clang happy

* Use ngraph element type instead of c_type_string in MKLDNN utility functions

* Addressed PR(#502) comments

8a8c0446

Fix a segfault due to an unhandled op (#530) · 3b474dfb
Nick Korovaiko authored 7 years ago
```
* fix a segfault due to an unhandled op

* fix a missing new line
```
3b474dfb

21 Feb, 2018 4 commits
- update test and cmake · 790dcd6c
  fenglei.tian authored 7 years ago
  
  790dcd6c
- add skip gpu test micro for new tests · e40f9c50
  fenglei.tian authored 7 years ago
  
  e40f9c50
- moving Relu op form Argon backend with CoreFusion (#489) · 4e29c153
  Sandeep authored 7 years ago
```
* relu for interpreter

* relu in serializer

* core fusion

* relu backprop

* relu backprop and test interpreter

* core fusion for CPU

* COREFusion -> CoreFusion

* relu MKL dnn
```
  4e29c153
- clang format · 763d448a
  fenglei.tian authored 7 years ago
  
  763d448a
20 Feb, 2018 8 commits
- Some more reduction ops (forward prop) (#510) · 96cabff0
  Adam Procter authored 7 years ago
```
* Add product op

* Add Max (max reduce) and Min (min reduce) ops

* Refactor arithmetic reduction ops to a common base class

* Fix PREFER_EIGEN codepaths in cpu_emitter
```
  96cabff0
- Fixed bug in initializing weights for bn fprop mkldnn emitted code (#519) · 292ab46f
  Pruthvi authored 7 years ago
```
* fixed bn weights initialization to correct size

* style fix
```
  292ab46f
- clang format · ba9a2a25
  fenglei.tian authored 7 years ago
  
  ba9a2a25
- add benchmark util · 7d160dad
  Ashok Emani authored 7 years ago
  
  7d160dad
- refactor benchmark util · 348dd27f
  Ashok Emani authored 7 years ago
  
  348dd27f
- add mxnet sockeye Seq2Seq model (#508) · 607bcbc4
  Ashok Emani authored 7 years ago
```
* add mxnet sockeye Seq2Seq model

* update test with sockeye model
```
  607bcbc4
- Addressed PR review comments · 67fb65b8
  pthoreho authored 7 years ago
  
  67fb65b8
- enable abs test for gpu · d23e5a70
  fenglei.tian authored 7 years ago
  
  d23e5a70
16 Feb, 2018 2 commits
- style fix · c6672b3d
  pthoreho authored 7 years ago
  
  c6672b3d
- - Added test for max pooling to verify mkldnn maxpool implementation · 03f6c0ab
  pthoreho authored 7 years ago
```
- added workaround to attach the maxpool workspace for bprop delta propogation
```
  03f6c0ab
15 Feb, 2018 2 commits
- address review comments · 2e11e95d
  Ashok Emani authored 7 years ago
  
  2e11e95d
- add cmake for nbench, address review comments · a1962e76
  Ashok Emani authored 7 years ago
  
  a1962e76
14 Feb, 2018 4 commits

pattern matcher for BatchnormFprop + mkldnn integration in the CPU emitter (#468) · 34b1322d

Pruthvi authored 7 years ago

* fuse dot(a,b) + c

cblas_gemm working on mlp

rebase & small fixes

enable debug output

support replacing function's outputs

* WIP pattern matching for variance

* - Added pattern matcher graph to look up variance(sub graph) in bn
- Added test case to verify the variance graph pattern

* added batch norm mean pattern matcher.

* remove reshapes

(cherry picked from commit ecad321fb1b1bc3f7facda229beb940118ca0701)

* fixed mean test to use Matcher.

* resolve merge conflict in test/pattern.cpp

* WIP bn fprop pattern

* fprop bn fusion working

* - Added unit test case to read the bn serializeed *.json file and run bn fprop fusion pass
- Added batchnorm header file and defined the bn class to emit the mkldnn kernel
- Added pattern matcher for fprop bn in CPU graph_rewrite pass

* WIP MKLDNN fprop bn emitter code

* completed fprop batchnorm kernel in CPU emitter

* fixed bug in the emitter code for fprop bn

* - Fixed copilation issues
- unit tests are passing for bn emitter fprop code

* Added support to compute fprop bn with mean annd variance as input

* resolved compilation issues

* refactored bn fprop code

* - added batchnorm src file to the CMakeFilelist
- moved bn fusion under CPU runtime/pass/cpu_fusion
- fixed compilation issue

* Resolved compilation issues in bn emitted code

* Addded debug statements in fprop bn emitted code

* added batchnorm.cpp src file

* - Added test case to test fprop batchnorm with known tensor values
- fixed bug related to defining weights in fprop bn

* - Added test case for fprop batchnorm Op
- Added test case for mean and variance pattern matcher
- Added fprop bn *.json file with input having 4dmis mb2c3h2w2
- refactored fprop bn op class

* Style fix

* - Removed Debug symbols

* - Fixed header template with correct year
- appended mkldnn.hpp in the CPU generated code

*  Addressed PR review comments
 -  added support for batchnorm op in serializer and de-serializer
 - added more sanity in bn constructor
 - renamed "BatchnormFprop" -> BatchNorm

* - Addressed PR review comments
- replaced auto with speicfic mkldnn::type in emitted bn kernel
- modified function signature to take 'eps' as double instead of <Node> type

* added missing header files, resolved compilation issue

* style fix

* Addressed PR comments
1. initilized member variables for bn in the same order as they are defined
2. renamed bn member variables to start with m_* as per coding convention
3. moved bn fusion test to test/cpu_fusion.cpp
4. style fix
5. added more checks to evaluate type and shape of inputs to bn

* Added support for EMITDECL macro for batchnorm

* - made correction to batchnorm src file name batchnorm -> batch_norm as per coding guidelines
- corrected bn copy_with_new_args() method

* Removed redundant SqrtOp support in serializer

34b1322d

skip tests for GPU · 2a89f8b4
fenglei.tian authored 7 years ago

2a89f8b4
Allow caching of external dependencies (everything but TBB, which I can't figure out yet) (#473) · 2fe7f0f3
Adam Procter authored 7 years ago

2fe7f0f3

add AllReduce op and MPI support (#425) · b9c5b9d3

Sevin F. Varoglu authored 7 years ago

- enable distributed ngraph (MPI)
- add AllReduce op to ngraph core, interpreter and CPU backend
- add AllReduce unit test

b9c5b9d3

13 Feb, 2018 4 commits
- disable gpu tests for now, since most will be fail · a6d78dd7
  fenglei.tian authored 7 years ago
  
  a6d78dd7
- add gpu mnist_mlp_forward test · 3da186d7
  fenglei.tian authored 7 years ago
  
  3da186d7
- cleanup code · 2db7022e
  fenglei.tian authored 7 years ago
  
  2db7022e
- cleanup code · f7d97aa1
  fenglei.tian authored 7 years ago
  
  f7d97aa1
12 Feb, 2018 4 commits
- fix Shape declarations (#488) · 00fb503f
  Robert Kimball authored 7 years ago
```
* fix Shape declarations
```
  00fb503f
- remove unit test that was both redundant and marked as disabled (#487) · 5e773f81
  Robert Kimball authored 7 years ago
  
  5e773f81
- Merge fixes · 2ca1528e
  Jaikrishnan Menon authored 7 years ago
  
  2ca1528e
- Bob/zero size tests (#484) · e931b2b6
  Robert Kimball authored 7 years ago
```
* unit tests faster

* speed up binary zero size tests

* fix style error

* remove some of the redundant code
```
  e931b2b6
09 Feb, 2018 6 commits
- fixed isnan issue on centos 7.2 · 9918436d
  Louis Feng authored 7 years ago
  
  9918436d
- GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum (#440) · da50410b
  Tristan Webb authored 7 years ago
```
* GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum

(A + B) * C test now with cuBLAS
Additional gemm and gemv calls
cmake updates for cuDNN calls
memcpy wrappers in gpu_util

Additional passing tests:
aliased outputs, parameter, constant tensor memcopy
```
  da50410b
- check derivatives from bprop against derivatives from fprop cache bprop (#469) · 27fee946
  adstraw authored 7 years ago
```
* compare derivatives from bprop and bprop with fprop cache

* code format
```
  27fee946
- Remove execute permissions from non-executable files (#474) · e054366e
  Adam Procter authored 7 years ago
  
  e054366e
- Fix pep8 warning in copyright · c7a3a76b
  Jennifer Myers authored 7 years ago
  
  c7a3a76b
- add rentime cuda kernel compile · e63322d9
  fenglei.tian authored 7 years ago
  
  e63322d9