- 26 May, 2018 2 commits
-
-
Nick Korovaiko authored
* serializer pass
-
Jayaram Bobba authored
* Bug fix to graph control logic to always compute output tensors * Remove stale comments
-
- 25 May, 2018 7 commits
-
-
Fenglei authored
-
Chris Sullivan authored
* cuDNN softmax impl. for all axis activation. * Added catch for per-axis activations.
-
Nick Korovaiko authored
* add any op
-
Robert Kimball authored
* fix the op list generator script
-
Fenglei authored
* enable more gpu test * enable more * more test * more tests
-
Fenglei authored
* add gpu product * enable test, change initial value for product
-
Fenglei authored
-
- 23 May, 2018 1 commit
-
-
Pruthvi authored
* - Added pattren matcher for LSTM cell * WIP added support to replace lstm cell instead of subgraph * WIP LSTM pattern matcher, fuses recurrent cells * WIP added RNN CPU op * WIP mkldnn emmiter code for fprop RNN * WIP RNN mkldnn integration - Added mkldnn kernel for uni directional LSTM in the CPU emitter * add a getter for root node * recurrent graph rewrite * fix perms, rename match_root -> get_match_root * fix comp errors * make match_root return the topmost match; fix tests * - WIP GetOutputElement for handling multiple LSTM o/ps - use RecurrentGraphRewrite for replacing node after matching LSTM cells * WIP LSTM multi Output + debug prints * moved LSTM fusion to cpu_fusion * WIP added RNN superfused OP * WIP towards RNN layer fusion * WIP multiple output slicing RNN * WIP RNN mulitple o/ps fusion across layer * WIP corrected input params for fused RNN OP * concat corrosponding param's across differnt LSTM to form inputs to RNN fused op * i) Added test case for RNN kernel ii) runs without error's * refactored and moved LSTM class to standalone file * Rename RNN -> Rnn , LSTM -> Lstm * WIP replace lstm slices to the consumer op * Slicing works on multiple RNN layers * fixed all bugs * - Added CPU RNN Recurrent Fusion - Added CPU LSTM fusion - removed debug code - style fix * - Added support to compute src_iter and dst_iter instead of taking zero_memory_desc - Added unit test to compute one LSTM cell * changed RNN op signature to accept number of states in basic unit of RNN(GRU/LSTM/ vanilla RNN) cell * added sanity checks for RNN op * Fixed issue related to patching the graph while replacing the RNN sliced outputs * Fixed issue to feed the input symbols in the order X0, X1, ...Xt to the RNN op * Added unit test for multi layer RNN fusion * Removed debug statements * Added mulitlayered serialized graph ii) fixed compilation issue * Addressed PR comments * i) WIP MKLDNN layout for RNN Op ii) added test case for INTERPRETER v/s CPU Rnn results * - Fixed bug w.r.to src_layer feature size in rnn mkldnn emitter code - Refactored cpu_fusion rnn test case * merge origin/master with branch pruthvi/lstm_fusion * style fix * Added test case for multiple RNN layers * i) make rnn as mkldnn op if it meets the constraints ii) assert if rnn is not mkldnn op * fix unit test failure * - Added support to reliabily identify the hiddent state and input symbols from the nodes collected by Pattern matcher - Fixed failing unit tests * style fix * - removed "node type" dependency to replace the intermediate LSTM outputs * Addressed PR comments * Fix unit test * - added MKLDNN emitter for LSTM op - graph pass to concat LSTM input recurrent state tensors - CPU layout assignment for LSTM Op - Fixed bug in rnn/lstm unit test's - made changes to use replace_output instead of replace_node for replacing matched graph nodes in LSTM/RNN fusion pass (cherry picked from commit d16fc709265cc0a73e60c6d5f6d2878e7b908aca) * style fix * Renamed passes and style fixes
-
- 22 May, 2018 1 commit
-
-
Robert Kimball authored
-
- 21 May, 2018 4 commits
-
-
L.S. Cook authored
* editing how to execute computation file for clarity and linenos * Add placeholder for runtime docs * Update section on backends, interpreter, and FPGA options * add updated master to fix python_ci * Weird autosummary issue reverted * Clarify new section * remove renamed file * sentence structure
-
Jayaram Bobba authored
* Batch norm folding * Addressed PR feedback * Style fixes * Style fix
-
Yixing Lao authored
-
tsocha authored
-
- 18 May, 2018 3 commits
-
-
Nick Korovaiko authored
* use reference kernel for reverse_sequence for int * move tests * resolve CI errors * TEST to NGRAPH_TEST
-
tsocha authored
-
Michał Karzyński authored
-
- 17 May, 2018 2 commits
-
-
Adam Rogowiec authored
-
Sang Ik Lee authored
If user manully provide MKLDNN_INCLUDE_DIR and MKLDNN_LIB_DIR, don't build mkl-dnn and just add a dummy external project "ext-_mkldnn" to satisfy target dependency for the rest of the build.
-
- 16 May, 2018 1 commit
-
-
Nick Korovaiko authored
* give frontends some flexibility over fusions they would like to run * address jbobbas feedback
-
- 15 May, 2018 7 commits
-
-
L.S. Cook authored
* Make sure that generating pyapi does not throw errors due to directory structureor linenos * Update basic.py
-
Chris Sullivan authored
* Output warnings and compiler errors from NVRTC, updated avg_pool kernel for cuda 9. * Added defgaurd to only apply defines for deprecated function if CUDA_VERSION < 9000. Updated shfl_xor invocation.
-
Michał Karzyński authored
-
Scott Cyphers authored
-
L.S. Cook authored
* Make sure that generating pyapi does not throw errors due to directory structureor linenos * update docs * one blank line instead of two * blank line error resolve
-
Nick Korovaiko authored
-
Jayaram Bobba authored
Move temporary buffer pools to CPU runtime context to avoid static variable destruction issues with clang (#990)
-
- 14 May, 2018 6 commits
-
-
Yixing Lao authored
-
Yixing Lao authored
* Enable NNP reverse build, clean up ngraph repo * clean mkldnn cmake
-
Fenglei authored
* add reduce op * fix bug * fix bug and enable tests * hack solution to get reduction function in reduct op * hack version working on all tests * fixed the reduction checking process
-
Nick Korovaiko authored
* clean up, rename * remove commented code; add comments
-
Robert Kimball authored
* throw useful error when Backend library fails to initialize * add dlerror to error message when opening backend shared lib
-
Robert Kimball authored
* Generate binary representable floats for input data
-
- 11 May, 2018 6 commits
-
-
Scott Cyphers authored
Removed ReLu activation present just before softmax. (It's not a good idea to have ReLu > Softmax immediately. The test accuracy of the model is now much better.)
-
Nick Korovaiko authored
-
Adam Straw authored
* move nop elimination pass to nGraph and add broadcast elimination * fix pad test bug * remove graph visualizer and clean up includes in nop eliminate test * code format
-
L.S. Cook authored
* Final PR review edits plus repair abc.cpp example docs that broke when code was added * Word
-
Nick Korovaiko authored
-
Nick Korovaiko authored
* type tests for reverse_sequence * remove commented out code
-