1. 22 Aug, 2018 1 commit
  2. 21 Aug, 2018 3 commits
    • Robert Kimball's avatar
      Statically link cpu backend into ngraph shared library (#1444) · 5ab5a129
      Robert Kimball authored
      * static link cpu library to ngraph
      
      * remove debug
      
      * link ngraph and cpu backend into a single shared object
      
      * add -fPIC and whole-archive for CPU backend
      
      * Added conditional for --whole-archive for Mac OS.
      
      * Added more conditonal for MacOS.
      
      * fix linking problem and unit test failures caused by multiple copies of the same function in CPU backend and INTERPRETER
      
      * fix nbench build
      
      * add nbench to unit test build
      
      * add version number to libngraph
      5ab5a129
    • Nick Korovaiko's avatar
      ArgMin (#1435) · 951e77b4
      Nick Korovaiko authored
      * argmin
      
      * address feedbacka argmin
      
      * add new lines
      
      *  addnew lines
      
      * address adam's nitpicks
      
      * scott's feedback
      
      * fix unit tests
      951e77b4
    • Michał Karzyński's avatar
      [ONNX] Add Relu op (#1448) · 73942928
      Michał Karzyński authored
      * [ONNX] Add Relu op
      73942928
  3. 17 Aug, 2018 2 commits
  4. 15 Aug, 2018 1 commit
  5. 14 Aug, 2018 1 commit
  6. 13 Aug, 2018 2 commits
  7. 12 Aug, 2018 1 commit
  8. 11 Aug, 2018 1 commit
  9. 10 Aug, 2018 2 commits
  10. 08 Aug, 2018 2 commits
  11. 07 Aug, 2018 2 commits
    • Matthew Brookhart's avatar
      reduce fprop cache outputs (#1343) · efa2561e
      Matthew Brookhart authored
      * reduce fprop cache outputs
      
      * refactor traverse nodes
      
      * Slight refactor, add test, adress PR comments
      
      * fix formatting
      efa2561e
    • Jayaram Bobba's avatar
      Switch to using more expressive layout descriptors instead of numeric layout names (#1278) · 69c51c27
      Jayaram Bobba authored
      * Switch to using mkldnn memory descriptors for layout
      
      * More changes for using mkldnn descriptor instead of format
      
      * Removed mkldnn format from cpu layout descriptor. TODO - shuffle folding
      
      * Rotate mkldnn layouts on transpose
      
      * Modifications to builder reshape to skip rotated layouts
      
      * More fixes to layouts and removes axis order from cpu layout descriptor
      
      * Code cleanup
      
      * Removed shuffle folding pass since the functionality is subsumed by the layout pass
      
      * Canonicalize a few more formats to keep MKLDNN happy.
      
      * Style fixes
      
      * Style fixes
      
      * Style fixes
      
      * Addressed PR feedback and added reshape passthrough for non-transpose cases
      
      * Adjust named formats for weights tensors to keep MKLDNN happy
      
      * Style fixes
      
      * resolved merge issues
      69c51c27
  12. 03 Aug, 2018 2 commits
    • Nick Korovaiko's avatar
      11b992a7
    • Chris Sullivan's avatar
      Preallocate intermediate buffers (#1231) · 0599a628
      Chris Sullivan authored
      * Utilize GPUMemoryManager/Allocator for preallocation of intermediate tensor buffer memory.
      
      * Formatting.
      
      * Merge with master required rework of memory due to CFE pass. Moved function memory pool allocation to pass as a result.
      
      * Formatting.
      
      * Added pass source files.
      
      * Updated tests to account for new assert check. All GPUAllocators should be deconstructed before allocation is made in GPUMemoryManager.
      
      * GPUAllocator::close() can be used to close the allocator prior to destruction
      
      * Removed open allocators. Replaced check with inspection of pass::MemoryManager node list.
      
      * Formatting.
      
      * Rename m_memory_buffers -> m_tensor_memory_buffers. Use full path to static alignment variable.
      
      * FunctionMemoryReservation -> TensorMemoryReservation. Only return true in pass if reservation is made (bug fix).
      
      * Moved static compilation mutex.
      
      * Update external function with new pass name.
      
      * GPU_ExternalFunction: Add s_memory_pool_alignment, remove optimize_and_assemble method.
      0599a628
  13. 02 Aug, 2018 3 commits
  14. 27 Jul, 2018 3 commits
    • Nick Korovaiko's avatar
      is_contained (#1257) · 81c48453
      Nick Korovaiko authored
      81c48453
    • Nick Korovaiko's avatar
      CSE constant (#1271) · 953c65f8
      Nick Korovaiko authored
      953c65f8
    • Adam Procter's avatar
      Add some convenience macros/classes for error messages (#1258) · deacf29a
      Adam Procter authored
      * Testing out some ideas for better error messages on AvgPool
      
      * Add uncaught_exception() check to ConstructionAssertLogger dtor
      
      * More general assertion class, not homed inside Node
      
      * Minor formatting change
      
      * NODE_ASSERT for type prop failure
      
      * Produce lighter-weight DummyAssertionHandler when assertion succeeds
      
      * New ctor for AssertionHelper that takes a single location arg; more const&-ness for the constructors
      
      * Remove move constructor for AssertionHelper; fix broken test in assertion.cpp
      
      * Miscellaneous improvements
      
      * Templatized AssertionHelper so different exception classes can be used; implemented TYPE_CHECK_ASSERT around this
      * Changed from a "stack" of locations to a single location (the stack was too complicated)
      * Added "FAIL" classes/macros which do not take a condition
      
      * Rename a helper function
      
      * Cleanup, cruft removal
      
      * Add test to make sure the assert helper has the lifetime we expect
      
      * Missing includes
      deacf29a
  15. 26 Jul, 2018 1 commit
  16. 18 Jul, 2018 3 commits
  17. 17 Jul, 2018 1 commit
    • Jayaram Bobba's avatar
      Added more convolution variants to DEX (#1223) · 9bb0b653
      Jayaram Bobba authored
      * CPU Direct Execution: Implement ConvertLayout and refactor
      
      * CPU Direct Execution: Implement Convolution
      
      * 1) Adds computation reuse to direct execution
      2) Add avg_pool, broadcast and convolution_bias to direct execution
      3) Moved some computation reuse utility functions to graph_utils
      
      * Use lists instead of vectors to avoid reallocation overheads
      
      * - Added convolution variants to direct execution
      - Removed ConvolutionBiasRelu, use ConvolutionBias instead
      - Reduced code duplication by moving functionality to mkldnn_emitter
        from cpu_emitter
      
      * Style fix
      
      * Moved mkldnn build_convolution to a templated method
      
      * Style fix
      
      * refactored mkldnn conv bprop builders
      
      * Style fix
      9bb0b653
  18. 14 Jul, 2018 1 commit
  19. 13 Jul, 2018 1 commit
  20. 12 Jul, 2018 2 commits
  21. 11 Jul, 2018 1 commit
  22. 09 Jul, 2018 2 commits
  23. 07 Jul, 2018 1 commit
  24. 06 Jul, 2018 1 commit