1. 12 Jan, 2019 1 commit
  2. 09 Jan, 2019 1 commit
  3. 08 Jan, 2019 8 commits
  4. 07 Jan, 2019 4 commits
  5. 05 Jan, 2019 1 commit
    • Chris Sullivan's avatar
      nvgpu backend without clang (#2115) · 757621be
      Chris Sullivan authored
      * Separate out external function base class.
      
      * pt1 first step to removing m_writer from GPU_Emitter.
      
      * pt2 add gpu_internal function skeleton
      
      * pt3 temporarily add to gpu_backend for prototyping.
      
      * pt4 add call frame (partial) and runtime constructor
      
      * pt 5 implement resolution for function memory reservations. build new tensor wrapper for use with call frame.
      
      * pt 6 resolve compilation errors.
      
      * pt 7 Add host emitter for emitting host primtives and implement in gpu emitter.
      
      * pt 8 add compile time manifest.
      
      * pt 9 add simple runtime tracer.
      
      * pt 10 seperate runtimes for different functions. index by function name, should switch to using function instance_id for look up performance.
      
      * pt 11 add function call interface and support nested call frames
      
      * pt 12 Reshape elimination check in emitter needs to include offset.
      
      * pt 13 Add default indentation to all op emissions in gpu external functions.
      
      * pt 14 fix constant mem reservation (should not depend on the tmeporary buffers existence check.
      
      * pt 15 backward pooling for avg pool requires only one param. rather than passing this param
      three times, this commit changes the runtime to detect if its avgpooling and pass the appropriate pointers.
      This is a hold over until max and avgpool are refactored into separate cudnn emitters.
      
      * pt 16 update cmake compatibility. gpu backend can now be built without clang via NGRAPH_DEX_ONLY.
      if this cmake variable is not define, then both clang codegen (via gpu external function) and interpreter (via gpu internal function) modes will be built.
      for now codegen is the default backend but can be explicitly disabled by setting the env. variable to NGRAPH_CODEGEN=0/FALSE/NO/etc.
      
      additional note: made codegen::CodeWriter header-only so that it can be used independently of whether the clang codegen library is compiled.
      
      * pt 17 fix issues with merge from master
      
      * pt 18 factor compile function into a few virtual calls so that common passes can be added in a single location for both backends.
      
      * pt 19 formatting
      
      * Remove code_writer.cpp from cmake and disable (temporarily) some reduce tests that require changes to gpu_emitter.cpp
      
      * Move call frame and runtime constructor implementations to source files.
      
      * Use member m_common_function_string.
      
      * Applying analogous bug fix as found in #2145
      
      * Remove underscore from GPU_CompiledFunction, GPU_ExternalFunction, and GPU_InternalFunction.
      
      * Made static members of GPUCompiledFunction static methods.
      
      * Remove 'No' codegen options, use std::toupper and applied format
      
      * review comments
      
      * Remove vector overload for resolve inputs/outputs in GPUCallFrame.
      
      * Remove diagnostic pragmas
      757621be
  6. 03 Jan, 2019 6 commits
  7. 02 Jan, 2019 2 commits
  8. 31 Dec, 2018 1 commit
  9. 29 Dec, 2018 1 commit
  10. 28 Dec, 2018 2 commits
  11. 23 Dec, 2018 2 commits
    • Robert Kimball's avatar
      Hybrid GPU Backend (#2240) · 90503652
      Robert Kimball authored
      * Add GPUH hybrid backend
      
      * update manifests
      
      * update node operator<<
      
      * fix GOE
      
      * remove debug
      
      * remove debug
      
      * more cleanup
      
      * add parent support to cpu and intel gpu backend tensors
      
      * cleanup
      
      * fix odd failure when printing node during construction
      
      * fix node output
      
      * address review comments
      
      * style
      90503652
    • Robert Kimball's avatar
      Remove code designed to support the Ninja cmake generator (#2241) · 42f16035
      Robert Kimball authored
      * update build byproducts to support ninja
      
      * remove unused cmake code
      
      * more cmake cleanup
      
      * display error message if Ninja generator is requested
      
      * fix mkldnn ext project
      
      * revert onnx cmake file
      
      * revert protobuf cmake file
      
      * revert mlsl cmake file
      
      * more fixing
      42f16035
  12. 22 Dec, 2018 4 commits
  13. 21 Dec, 2018 2 commits
    • Nishant Patel's avatar
      Support dynamic scales for Qconv's and Dequantize (#2171) · 7e310e20
      Nishant Patel authored
      * Support dynamic scales for Qconv's and Dequantize
      
      * Remove constant folding
      
      * add additional dynamic_quantize unittest
      
      * add another mxnet quantize unittest
      
      * add additional dynamic_dequantize tests
      
      * fix shape error
      
      * add dynamic signed_quantize unittest
      
      * Pass correct scale
      
      * Refactoring
      
      * Added dynamic scale support for QCBA and QCBSA
      
      * Refactor to create MKLDNN primitives on the first iteration
      
      * remove stray code
      
      * unused variables
      
      * remove extraneous line
      7e310e20
    • gcwenger's avatar
      Graph comparison testing quiet unless problem is detected (#2258) · c153ea8a
      gcwenger authored
      * Graph comparison testing quiet unless problem is detected.
      
      * Fixed file formatting
      
      * Renamed ss => msg
      c153ea8a
  14. 20 Dec, 2018 3 commits
    • Robert Kimball's avatar
      Implement ngraph::element::Type as a wrapper around an enum (#2120) · 5d80f203
      Robert Kimball authored
      * cleanup and add enum
      
      * new type working
      
      * enum works
      
      * use type enum
      
      * cleanup
      
      * fix errant past to source file
      
      * fix type
      
      * safely construct the type map
      
      * fix get_type_info_map return type
      5d80f203
    • Adam Rogowiec's avatar
      3f2cd153
    • L.S. Cook's avatar
      Doc 0.11 (#2219) · 9940123b
      L.S. Cook authored
      * editing docs
      
      * more doc updates
      
      * Cleanup theme, update backends for PlaidML, remove stale font
      
      * Add PlaidML description and doc update that should have been added with PR 1888
      
      * Add PlaidML description and doc update that should have been added with PR 1888
      
      * Latest release doc updates
      
      * Add PlaidML description and doc update for PR 1888
      * Update glossary with tensor description and quantization def
      * Refactor landpage with QuickStart guides
      * Add better details about nGraph features and roadmap
      
      * Placeholder detail for comparison section
      
      * Add section link
      
      * order sections alphabetically for now
      
      * update compiler illustration
      
      * Address feedback from doc review
      
      * Update illustration wording
      
      * Formatting and final edits
      
      * keep tables consistent
      
      * Clarify doc on bridge and compiler docs
      
      * Clarify doc on bridge and compiler docs
      
      * yay for more feedback and improvements
      
      * edit with built doc
      
      * Fix typo
      
      * Another phase of PR review editing
      
      * Final review comment resolved
      
      * note grammatically-correct wording preferred as often as possible.
      
      * First iteration of shared subgraphs with onnx doc
      
      * Updte onnx wheel install instructions with latest ngraph-0.9.0 versioning
      
      * Updte onnx wheel install instructions with latest ngraph-0.9.0 versioning
      
      * Update section  on subgraphs and shared subgraph docs
      
      * Finalize edit of mxnet tutorial given status of our PR
      
      * Make sure latest conf py is being used
      
      * Update to latest index
      
      * Add link to design doc mentioned by Ashoke and update about for consistent headings
      
      * Update with PR feedback
      
      * Update with new pip install instructions
      
      * add more testing
      
      * Further feedback review included
      
      * Improve descriptions, given the new pip pkg install options
      
      * Add note to onnx_ssg_tutorial
      
      * Links updated to latest correct url
      
      * Improve docs for Beta
      
      * Reorganize TOC
      
      * Better org in sections
      
      * Make heading style consistent across indexes
      
      * Update intro to framework builders
      
      * Update intro to framework builders
      
      * Add feedback from reviewers
      
      * Minor fixes to ToC and editing
      
      * Add section on FMV for miscellaneous use cases
      
      * Update notice on README
      
      * Updte link to howto index
      
      * fix typo
      
      * fix note
      
      * Update glossary
      9940123b
  15. 19 Dec, 2018 2 commits