- 18 Jul, 2018 3 commits
-
-
Robert Kimball authored
* make pool test check backends other than CPU * more unit test cleanup
-
Jaikrishnan Menon authored
-
Nick Korovaiko authored
* cpu loop kernel fusion pass * remove extra code * bounded relu test * address scotts feedback
-
- 17 Jul, 2018 1 commit
-
-
Jayaram Bobba authored
* CPU Direct Execution: Implement ConvertLayout and refactor * CPU Direct Execution: Implement Convolution * 1) Adds computation reuse to direct execution 2) Add avg_pool, broadcast and convolution_bias to direct execution 3) Moved some computation reuse utility functions to graph_utils * Use lists instead of vectors to avoid reallocation overheads * - Added convolution variants to direct execution - Removed ConvolutionBiasRelu, use ConvolutionBias instead - Reduced code duplication by moving functionality to mkldnn_emitter from cpu_emitter * Style fix * Moved mkldnn build_convolution to a templated method * Style fix * refactored mkldnn conv bprop builders * Style fix
-
- 14 Jul, 2018 1 commit
-
-
Robert Kimball authored
move long building tests to the be the first tests built with the hope of reducing build time. (#1229)
-
- 13 Jul, 2018 1 commit
-
-
Nick Korovaiko authored
* get_subgraph_outputs * simplify the condition
-
- 12 Jul, 2018 2 commits
-
-
Louis Feng authored
* reshape inplace without copy data if possible. * added reshape and broadcast to CSE. * Fixed debug messages.
-
Robert Kimball authored
* open only the unversioned library but check that it is built against the correct version of ngraph * review comments
-
- 11 Jul, 2018 1 commit
-
-
Pruthvi authored
-
- 09 Jul, 2018 2 commits
-
-
Robert Kimball authored
* Faster liveness. Memory manager optimized for non-sharing of tensors. Add pass manager profiler. * Move pass profiler to a separate PR * Move Memory Layout optimizations to a separate PR * use find instead of count
-
Robert Kimball authored
* Cache some generated functions in backwards tests to speed performance * more caching
-
- 07 Jul, 2018 1 commit
-
-
Robert Kimball authored
* complete the new backend construction/destruction API * close each dlopen * don't close libraries for now as it causes python to segfault
-
- 06 Jul, 2018 2 commits
-
-
Nishant Patel authored
* Usage of mkldnn reshape updated * update reshape condition for mkldnn * Add a test case and order in which conditions are checked
-
Nick Korovaiko authored
* collect matched nodes * clear m_matched_list * tests * address feedback
-
- 03 Jul, 2018 2 commits
-
-
Louis Feng authored
* hacking to support dot of 3 by 2 inputs with gemm_batch. * clean up.
-
Robert Kimball authored
* nbench cleanup * update style
-
- 02 Jul, 2018 3 commits
-
-
Sandeep authored
* declare sigmoid for core fusion * add simple test for sigmoid * info fusion status * cp op as main op * builds as expected * move sigmoid fusion code * add reference kernel * sigmoid bprop reference kernel and clang-format * add delta to bprop * fprop called * compiles bprop * move tests * serializer support * address comments in code * add doc * naming similar to core ops * fix failing test * fix failing test * address clang issue * more changes * change test macro
-
Pruthvi authored
* 1. Added MKLDNNN BoundedRelu op support for Relu6 2. CpuLayout && CPU assignment pass for BoundedRelu Op 3. Unit test inter v/s CPU for BoundedReluOp 4. MKLDNN and default emitter code for BoundedReluOp * Removed Debug prints * 1. Added support for boundedrelu to work on any constant literal 2. unit test case for rank2, rank3, rank4 for bounded relu without serialized graph * Removed is_six() method
-
Louis Feng authored
* Reshape bias to 1D for conv + bias bprop fusion * Reshape goe2 back to 2D before replacing * added shape checks to validate conv+bias op. * removed conv+bias backprop merge for separate PR review. * fixed conv_bias_bprop test. * minor changes to error messages.
-
- 30 Jun, 2018 2 commits
-
-
Pruthvi authored
* - Fixed replace output for the multi layer recurrent cell state tensor output - Modified rnn add_output to consider direction and n_layer while calculating the output size for mkldnn dst_layer and dst_iter * fix unit test failure
-
Nick Korovaiko authored
* collector * keeping track of inputs; simplifying a merging stratey; adding LKGraph * LoopKernel Collector * address feedback * address feedback 2 * address feedback 3
-
- 28 Jun, 2018 2 commits
-
-
Nishant Patel authored
* Reshape 4d * Support dimshuffles/transpose with MKLDNN * Addressing PR Feedback * Use Eigen for 3D dimshuffles
-
Adam Straw authored
* constant broadcast folding * code review feedback
-
- 26 Jun, 2018 3 commits
-
-
Robert Kimball authored
-
Jayaram Bobba authored
* inplace compute * fix warnings * Initial support for convolution sum fusion * Added in-place support for conv sum fusion and test cases * reverting spurious changes * Bug fix to account for inplace input in conv sum fusion * fix compilation error * Addressed PR feedback
-
Igor Kaplounenko authored
* updated to work with llvm 8.1 that tensorflow is built with * sane extensions on the mac * not doing rpath on apple * apply style
-
- 25 Jun, 2018 2 commits
-
-
Nick Korovaiko authored
* inplace compute * fix warnings * address bob's feedback * bob's feedback 2 * bobs feedback 3 * address bob's feedback 4
-
Robert Kimball authored
* remove reference to ngraph core code from codegen. add stand-alone implementations of needed funcions * fixed potential pointer leak * clean up file_util * more file util cleanup, removing unused functions * interpreter works on mac * CPU and INTERPRETER build and pass unmit tests on macos * move get_directory to file_util * cleanup
-
- 22 Jun, 2018 1 commit
-
-
Matthew Brookhart authored
-
- 21 Jun, 2018 1 commit
-
-
Adam Straw authored
* adding constant propagation pass * adding test/constant_propagation.cpp * template make_constant_reshape function * code review feedback * add missing files
-
- 20 Jun, 2018 1 commit
-
-
Adam Procter authored
* Fix bug with concat for 0-size tensors * Simplify test for zero-length axes, per PR comments
-
- 19 Jun, 2018 2 commits
-
-
Robert Kimball authored
* fix mkldnn rpath * fix compile warning * close backends when exiting * set backend output directory of backends to the ngraph output directory * Aprocter/patch patch (#1119) * Move more rpath stuff inside if(NOT APPLE) * fix repatch problem with mkldnn library * add updated patch command for older versions of cmake
-
Nick Korovaiko authored
* loop kernel + tests * remove commented out code * remove commented code; add comments * copy_with_new_args +test * add comment * fix comp errors
-
- 16 Jun, 2018 2 commits
-
-
Nick Korovaiko authored
* optimized strided convolutions * clean up debug messages * format fixes * more tests * even more tests * adapt to resnet-50.v1 * fix format errors; remove changes from diff PRs
-
Nick Korovaiko authored
* enable cse for reduction ops * reduction tests
-
- 15 Jun, 2018 2 commits
-
-
Robert Kimball authored
-
Pruthvi authored
* - Added graph pass for fusing RNN op across layer - Added test case for inter v/s cpu for verifying layer fused RNN - more sanity checks in the RNN fusion graph pass - added support to replace the recurrent cell state correctly in the fused RNN op * Fixed multi layer rnn fusion unit test failure * Addressed PR comments
-
- 13 Jun, 2018 3 commits
-
-
Robert Kimball authored
* backend libraries now found in tree dynamically read header search paths fix running from install
-
Nick Korovaiko authored
* group conv init * add GroupConvolution op; refine checks in fusion logic * add an emitter, cpu assigment * cpu_layout * add checks to algebraic simplification * updating emitter logic for groupconvolution * working before refactoring * moving primitive creation logic to mkldnn_emitter * group convolution graph test * rename an opt * address jbobba's feedback
-
Fenglei authored
* add pad_dilation function * add dilation to gpu_emitter * add CoordinateDiff constructor to GPUShape * remove unecessary cast * working version for forward * forward working * forward test all pass * deconvolution forward * backward data dilation * forward test passed * initial to 0 * fix bug for get_padded_shape and clang format * code style, change variable names * refactor convolution conditions * fix bug padding_below_diff * change pad_dilation to pad_dynamic, compare to pad * remove passed convolution test from skip list, clang format * change pad to use GPUShape
-