1. 08 May, 2018 1 commit
    • Nick Korovaiko's avatar
      MaxPoolWithIndices (#900) · a174c8c9
      Nick Korovaiko authored
      * MaxPoolWithIndices CPU Fusion
      
      * fix test to pass checks in cpu_fusion
      
      * pass test
      
      * clean up
      
      * add a new pass, add layouts
      
      * remove the opt from cpu_fusion
      
      * refactor cpu_layout logic for maxpool, clean up comments
      
      * add comment w.r.t. indices tensor
      
      * rename to cpu_workspace_insertion
      
      * add CPUWorkspaceInsertion pass for TF
      a174c8c9
  2. 04 May, 2018 1 commit
  3. 23 Apr, 2018 1 commit
  4. 18 Apr, 2018 1 commit
    • Nick Korovaiko's avatar
      Weight Fusion (#853) · 8cb48d37
      Nick Korovaiko authored
      * CPU weight fusion initial version
      
      * add tests for weight_fusion
      
      * address @jbobba's feedback
      
      * before cleaning up convolution_weight_optimization.cpp
      
      * clean up, rename, fix perms, fix format
      8cb48d37
  5. 16 Apr, 2018 1 commit
  6. 13 Apr, 2018 1 commit
    • Robert Kimball's avatar
      Remove legacy Backend API (#848) · ec501913
      Robert Kimball authored
      * remove deprecated
      
      * remove all legacy Backend API usage
      
      remove deprecated files
      
      * pull in changes from master
      
      * fix GPU calls
      
      * disable tests in convolution generator
      
      * update per PR comments. Enable performance counter feature.
      
      * update per PR comments
      
      * fix build error
      
      * fix conditionally compiled test :(
      ec501913
  7. 09 Apr, 2018 1 commit
  8. 04 Apr, 2018 1 commit
    • Nick Korovaiko's avatar
      Support multi-output ops in Adjoints (#796) · 5f0e8dc3
      Nick Korovaiko authored
      * refactor Adjoints to support multi-output ops
      
      * passing tests
      
      * switch to generate_adjoints(deltas) and backprop_node
      
      * remove debugging code
      
      * fix error msg
      
      * fix typo adjoitns
      
      * fix comp errors in mnist_mlp
      5f0e8dc3
  9. 03 Apr, 2018 1 commit
  10. 02 Apr, 2018 1 commit
    • Pruthvi's avatar
      Pruthvi/bn to support globalstats (#783) · 1d80cabe
      Pruthvi authored
      * WIP support bn training for global_stats
      
      (cherry picked from commit eb81a37328ea177b1d58c9eebdbb345e0fa25f0d)
      
      * - Style fix
      - Fix test case
      
      * Addressed PR comments
      - added support for bn training/inference with a same ctor
      - added more verbose comments in bn header
      
      * Fixed bn serializer and default value in bn ctor for bwd compatibility
      
      * proposed docs change
      
      * - Addressed PR comments
        - added support to compute bn inference/training using same mkldnn kernel with global stats
      
      * fix unit bn relu unit test
      1d80cabe
  11. 30 Mar, 2018 1 commit
  12. 29 Mar, 2018 1 commit
  13. 28 Mar, 2018 2 commits
  14. 24 Mar, 2018 1 commit
    • Louis Feng's avatar
      ngmx-372 Fuse RNN matrix operations (#724) · de760a38
      Louis Feng authored
      * test graph.
      
      * rnn matrix fusion wip.
      
      * WIP.
      
      * matrix fused.
      
      * clean up.
      
      * additional refactor.
      
      * fixed merge errors.
      
      * testing.
      
      * added test case.
      
      * more testing.
      
      * more testing.
      
      * bias wip.
      
      * debuggging.
      
      * fusing bias too.
      
      * disabled debug outputs.
      
      * removed debugging.
      
      * removed testing models.
      
      * removed debug models.
      
      * reset nbench.
      
      * clean up and testing.
      
      * removed debug code.
      
      * updated comments and var names.
      
      * rename var.
      
      * removed debug code.
      
      * removed debug code.
      
      * fixed compiler warnings.
      
      * refactored test.
      
      * Added const to parameter.
      
      * address some of the review comments.
      
      * added comments and addressed reviews.
      de760a38
  15. 23 Mar, 2018 1 commit
  16. 22 Mar, 2018 2 commits
    • Nick Korovaiko's avatar
      Make MatMulBias aware of addition commutativity (#713) · a855a3ad
      Nick Korovaiko authored
      * make matmulbias callback aware that addition is commutative
      a855a3ad
    • Pruthvi's avatar
      Pruthvi/bn inference (#670) · 5394ad2d
      Pruthvi authored
      * Added new ctor for bn which supports Inference
      - added mkldnn emitter code for bn inference
      * Added test case for bn inference
      - added support for layout propogation for bn inference
      * added sanity checks for gamma, beta, mean, variance shape in bn
      * added serializer support for bn inference
      5394ad2d
  17. 21 Mar, 2018 2 commits
  18. 13 Mar, 2018 1 commit
    • Pruthvi's avatar
      Pruthvi/sigmoid bprop (#630) · 490e4e63
      Pruthvi authored
      * - Added pattern matcher for bprop sigmoid
      - mkldnn emitter code for sigmoid bprop
      - Fusion pass unit test for sigmoid bprop
      - style fix
      
      * Added test case for bprop sigmoid
      
      * fixed sigmoid bprop test case failure
      
      * fixed bprop unit test values for sigmoid
      
      * style fix
      
      * fix typo
      
      * Addressed PR comments
      - added layout assignment pass to ensure delta and input have same layout for SigmoidBprop
      490e4e63
  19. 11 Mar, 2018 1 commit
  20. 09 Mar, 2018 2 commits
    • Louis Feng's avatar
      clang format · 362bb996
      Louis Feng authored
      362bb996
    • Pruthvi's avatar
      Pruthvi/sigmoid (#614) · 5885c09a
      Pruthvi authored
      * - Added sigmoid fusion pass
      - added mkldnn emitter code for sigmoid
      
      * - corrected sigmoid expected values
      - add layout assignment for sigmoid op
      
      * - added assert's in cpu fusion for sigmoid
      - style fix
      
      * remove debug prints
      
      * NGMX-371 #comment addressed PR comments - Added sigmoid unit test case with 3D input ii) support in cpu_emmiter for sigmoid to handle all input shapes
      
      * NGMX-371 #comment use shape_size() to calculate the 1d input size
      5885c09a
  21. 08 Mar, 2018 1 commit
  22. 07 Mar, 2018 6 commits
    • Pruthvi's avatar
      bn fprop mkldnn optimized implementation (#581) · 9db548c6
      Pruthvi authored
      * - Added support optimized bn mkldnn implementation in cpu emitter
      - modified bn unit_test to support new implementation
      - added layout assignment for bn op
      - Style Fix
      
      (cherry picked from commit 7747a40806d62c126059d5c873adcd2e61a0adb0)
      
      * modified value initilization in cpu_fusion to be float explicit
      
      (cherry picked from commit 03499d380073d0197ab8cbd154eb03f63b042a48)
      
      * fix compilation issue
      
      * Addressed PR comments
      - added exception if gamma and beta layout isnot equal to memory::format::x
      - throw exception if bn Op is not mkldnn op
      
      * fix compilation issue
      
      * added support to handle multiple o/ps in fprop bn fusion
      
      * - Removed laytout pass for bn
      - fixed autodiff bug in bn
      - added "Add" for the dispatcher in cpu-layout pass
      
      * style fix
      
      * Fix bprop batchnorm test with get_output_elements
      
      * Style fix
      9db548c6
    • Louis Feng's avatar
      clang format. · d37b30ad
      Louis Feng authored
      d37b30ad
    • Louis Feng's avatar
      clean up. · 338b9622
      Louis Feng authored
      338b9622
    • Louis Feng's avatar
      simplify convbias test. · 812a699a
      Louis Feng authored
      812a699a
    • Louis Feng's avatar
      refactor and clean up. · 8b7f042d
      Louis Feng authored
      8b7f042d
    • Louis Feng's avatar
      more tests. · 97c2ce20
      Louis Feng authored
      97c2ce20
  23. 06 Mar, 2018 3 commits
  24. 02 Mar, 2018 1 commit
    • Nick Korovaiko's avatar
      Batchnorm Bprop v2 (#567) · e4b90a9c
      Nick Korovaiko authored
      * one output
      
      multiple outputs
      
      initial clean-up
      
      * test clean-up
      
      current version
      
      test pass
      
      * clean up
      
      * fix format
      
      * add dbeta,dgamma asserts
      
      * revert some files
      
      * 0644 on node.cpp
      
      * 0644 on mkldnn_utils.cpp
      
      * 0644 on more files
      
      * add support for serialization + test case
      
      * fix merge errors
      e4b90a9c
  25. 28 Feb, 2018 1 commit
  26. 27 Feb, 2018 1 commit
  27. 22 Feb, 2018 2 commits
  28. 14 Feb, 2018 1 commit
    • Pruthvi's avatar
      pattern matcher for BatchnormFprop + mkldnn integration in the CPU emitter (#468) · 34b1322d
      Pruthvi authored
      * fuse dot(a,b) + c
      
      cblas_gemm working on mlp
      
      rebase & small fixes
      
      enable debug output
      
      support replacing function's outputs
      
      * WIP pattern matching for variance
      
      * - Added pattern matcher graph to look up variance(sub graph) in bn
      - Added test case to verify the variance graph pattern
      
      * added batch norm mean pattern matcher.
      
      * remove reshapes
      
      (cherry picked from commit ecad321fb1b1bc3f7facda229beb940118ca0701)
      
      * fixed mean test to use Matcher.
      
      * resolve merge conflict in test/pattern.cpp
      
      * WIP bn fprop pattern
      
      * fprop bn fusion working
      
      * - Added unit test case to read the bn serializeed *.json file and run bn fprop fusion pass
      - Added batchnorm header file and defined the bn class to emit the mkldnn kernel
      - Added pattern matcher for fprop bn in CPU graph_rewrite pass
      
      * WIP MKLDNN fprop bn emitter code
      
      * completed fprop batchnorm kernel in CPU emitter
      
      * fixed bug in the emitter code for fprop bn
      
      * - Fixed copilation issues
      - unit tests are passing for bn emitter fprop code
      
      * Added support to compute fprop bn with mean annd variance as input
      
      * resolved compilation issues
      
      * refactored bn fprop code
      
      * - added batchnorm src file to the CMakeFilelist
      - moved bn fusion under CPU runtime/pass/cpu_fusion
      - fixed compilation issue
      
      * Resolved compilation issues in bn emitted code
      
      * Addded debug statements in fprop bn emitted code
      
      * added batchnorm.cpp src file
      
      * - Added test case to test fprop batchnorm with known tensor values
      - fixed bug related to defining weights in fprop bn
      
      * - Added test case for fprop batchnorm Op
      - Added test case for mean and variance pattern matcher
      - Added fprop bn *.json file with input having 4dmis mb2c3h2w2
      - refactored fprop bn op class
      
      * Style fix
      
      * - Removed Debug symbols
      
      * - Fixed header template with correct year
      - appended mkldnn.hpp in the CPU generated code
      
      *  Addressed PR review comments
       -  added support for batchnorm op in serializer and de-serializer
       - added more sanity in bn constructor
       - renamed "BatchnormFprop" -> BatchNorm
      
      * - Addressed PR review comments
      - replaced auto with speicfic mkldnn::type in emitted bn kernel
      - modified function signature to take 'eps' as double instead of <Node> type
      
      * added missing header files, resolved compilation issue
      
      * style fix
      
      * Addressed PR comments
      1. initilized member variables for bn in the same order as they are defined
      2. renamed bn member variables to start with m_* as per coding convention
      3. moved bn fusion test to test/cpu_fusion.cpp
      4. style fix
      5. added more checks to evaluate type and shape of inputs to bn
      
      * Added support for EMITDECL macro for batchnorm
      
      * - made correction to batchnorm src file name batchnorm -> batch_norm as per coding guidelines
      - corrected bn copy_with_new_args() method
      
      * Removed redundant SqrtOp support in serializer
      34b1322d