- 24 Feb, 2018 1 commit
-
-
Matthew Brookhart authored
* fix maxpool copy_with_new_args * fix a free(nullptr) error * update output shape from a maxpool * remove free gaurd
-
- 23 Feb, 2018 2 commits
-
-
Matthew Brookhart authored
-
Christian Convey authored
-
- 22 Feb, 2018 3 commits
-
-
Matthew Brookhart authored
* fix numeric stability bug in autodiff of divide * add a test for divide autodiff stability
-
Jayaram Bobba authored
* Basic assignment pass for CPU backend * Change CPU convolution emitter to check op annotations * Queries MKLDNN for optimal layout on convolutions * Added layout conversions through ConverLayout ops and explicit layout conversion on CPU tensor view objects * Added layout conversions for non-MKLDNN ops * - Style fixes - Removed unused variables to avoid clang errors - Added more mkldnn format types to utility functions * Move ConvertLayout back to runtime::cpu::op namespace * Added more mkldnn memory formats * Moved op annotations to Op class * Style changes * Minor fix * Minor fix to keep clang happy * Use ngraph element type instead of c_type_string in MKLDNN utility functions * Addressed PR(#502) comments
-
Nick Korovaiko authored
* fix a segfault due to an unhandled op * fix a missing new line
-
- 21 Feb, 2018 4 commits
-
-
fenglei.tian authored
-
fenglei.tian authored
-
Sandeep authored
* relu for interpreter * relu in serializer * core fusion * relu backprop * relu backprop and test interpreter * core fusion for CPU * COREFusion -> CoreFusion * relu MKL dnn
-
fenglei.tian authored
-
- 20 Feb, 2018 8 commits
-
-
Adam Procter authored
* Add product op * Add Max (max reduce) and Min (min reduce) ops * Refactor arithmetic reduction ops to a common base class * Fix PREFER_EIGEN codepaths in cpu_emitter
-
Pruthvi authored
* fixed bn weights initialization to correct size * style fix
-
fenglei.tian authored
-
Ashok Emani authored
-
Ashok Emani authored
-
Ashok Emani authored
* add mxnet sockeye Seq2Seq model * update test with sockeye model
-
pthoreho authored
-
fenglei.tian authored
-
- 16 Feb, 2018 2 commits
- 15 Feb, 2018 2 commits
-
-
Ashok Emani authored
-
Ashok Emani authored
-
- 14 Feb, 2018 4 commits
-
-
Pruthvi authored
* fuse dot(a,b) + c cblas_gemm working on mlp rebase & small fixes enable debug output support replacing function's outputs * WIP pattern matching for variance * - Added pattern matcher graph to look up variance(sub graph) in bn - Added test case to verify the variance graph pattern * added batch norm mean pattern matcher. * remove reshapes (cherry picked from commit ecad321fb1b1bc3f7facda229beb940118ca0701) * fixed mean test to use Matcher. * resolve merge conflict in test/pattern.cpp * WIP bn fprop pattern * fprop bn fusion working * - Added unit test case to read the bn serializeed *.json file and run bn fprop fusion pass - Added batchnorm header file and defined the bn class to emit the mkldnn kernel - Added pattern matcher for fprop bn in CPU graph_rewrite pass * WIP MKLDNN fprop bn emitter code * completed fprop batchnorm kernel in CPU emitter * fixed bug in the emitter code for fprop bn * - Fixed copilation issues - unit tests are passing for bn emitter fprop code * Added support to compute fprop bn with mean annd variance as input * resolved compilation issues * refactored bn fprop code * - added batchnorm src file to the CMakeFilelist - moved bn fusion under CPU runtime/pass/cpu_fusion - fixed compilation issue * Resolved compilation issues in bn emitted code * Addded debug statements in fprop bn emitted code * added batchnorm.cpp src file * - Added test case to test fprop batchnorm with known tensor values - fixed bug related to defining weights in fprop bn * - Added test case for fprop batchnorm Op - Added test case for mean and variance pattern matcher - Added fprop bn *.json file with input having 4dmis mb2c3h2w2 - refactored fprop bn op class * Style fix * - Removed Debug symbols * - Fixed header template with correct year - appended mkldnn.hpp in the CPU generated code * Addressed PR review comments - added support for batchnorm op in serializer and de-serializer - added more sanity in bn constructor - renamed "BatchnormFprop" -> BatchNorm * - Addressed PR review comments - replaced auto with speicfic mkldnn::type in emitted bn kernel - modified function signature to take 'eps' as double instead of <Node> type * added missing header files, resolved compilation issue * style fix * Addressed PR comments 1. initilized member variables for bn in the same order as they are defined 2. renamed bn member variables to start with m_* as per coding convention 3. moved bn fusion test to test/cpu_fusion.cpp 4. style fix 5. added more checks to evaluate type and shape of inputs to bn * Added support for EMITDECL macro for batchnorm * - made correction to batchnorm src file name batchnorm -> batch_norm as per coding guidelines - corrected bn copy_with_new_args() method * Removed redundant SqrtOp support in serializer
-
fenglei.tian authored
-
Adam Procter authored
-
Sevin F. Varoglu authored
- enable distributed ngraph (MPI) - add AllReduce op to ngraph core, interpreter and CPU backend - add AllReduce unit test
-
- 13 Feb, 2018 4 commits
-
-
fenglei.tian authored
-
fenglei.tian authored
-
fenglei.tian authored
-
fenglei.tian authored
-
- 12 Feb, 2018 4 commits
-
-
Robert Kimball authored
* fix Shape declarations
-
Robert Kimball authored
-
Jaikrishnan Menon authored
-
Robert Kimball authored
* unit tests faster * speed up binary zero size tests * fix style error * remove some of the redundant code
-
- 09 Feb, 2018 6 commits
-
-
Louis Feng authored
-
Tristan Webb authored
* GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum (A + B) * C test now with cuBLAS Additional gemm and gemv calls cmake updates for cuDNN calls memcpy wrappers in gpu_util Additional passing tests: aliased outputs, parameter, constant tensor memcopy
-
adstraw authored
* compare derivatives from bprop and bprop with fprop cache * code format
-
Adam Procter authored
-
Jennifer Myers authored
-
fenglei.tian authored
-