1. 07 Nov, 2018 8 commits
    • Chris Sullivan's avatar
      add dtype-generic load definitions and clean up nvrtc helpers (#1975) · f33317cc
      Chris Sullivan authored
      * Refactor include_helpers into an nvrtc specific helper file. Add templated define functions for coherent and noncoherent memory loads.
      
      * Format
      
      * const refs.
      
      * Remove cast of zero.
      f33317cc
    • Adam Procter's avatar
      af889535
    • Robert Kimball's avatar
      NOP backend (#1979) · 4918449c
      Robert Kimball authored
      * add nop backend
      
      * nop backend
      
      * fix flag name
      
      * add new switch to cmake output of switch settings
      
      * add new unit test to igpu manifest
      
      * remove redundant test
      4918449c
    • Robert Kimball's avatar
      address issues which surface with clang 6.0 (#1980) · 79802dcf
      Robert Kimball authored
      * address issues which surface with clang 6.0
      
      * revert changes due to new clang warning and disable new warning
      79802dcf
    • Adam Straw's avatar
      graph builders for quantize scale (#1976) · 8bd3846f
      Adam Straw authored
      * quantize scale passing unit tests
      
      * epsilon bump
      
      * finished with quantization scale
      
      * unit tests passing with convolution scale as builder
      
      * broadcasted constants and cleanup
      
      * api consistency for quant builders
      
      * code style
      
      * cleanup
      
      * newline at EOF
      
      * use requantization_scale
      
      * drop TF license as we are no longer using TF code directly
      8bd3846f
    • Amy Zhuang's avatar
      Do not fuse nodes if one node is predecessor of another node in horiz… (#1928) · 2a26558a
      Amy Zhuang authored
      * Do not fuse nodes if one node is predecessor of another node in horizontal fusion.
      
      * Add dead node check and remove predecessor check in horizontal fusion.
      2a26558a
    • Robert Kimball's avatar
      Add a real HybridBackend (#1998) · 45fba7b1
      Robert Kimball authored
      * wip
      
      * wip
      
      * wip
      
      * move hybrid wrapper to hybrid backend dir
      
      * move hybrid wrapper to correct namespace
      
      * wip
      
      * sorta working
      
      * remove debug from sorta working homogeneous hybrid backend
      
      * is_supported is supported for GPU
      
      * cleanup debug
      
      * more progress
      
      * remove debug
      
      * cleanup
      
      * turn off hybrid by default
      
      * revert change
      
      * revert
      
      * rename wrapper to backend
      
      * revert
      
      * address review comments
      
      * style
      45fba7b1
    • Jayaram Bobba's avatar
      Jbobba/halide (#1971) · ba73e2b8
      Jayaram Bobba authored
      * Add missing halide dependency
      
      * Bug fix in halide op creation
      
      * Localize halide/llvm to cpu backend
      
      * Added comments
      
      * Pass NGRAPH_HALIDE to tests
      
      * Resolve merge conflicts
      ba73e2b8
  2. 06 Nov, 2018 7 commits
  3. 05 Nov, 2018 4 commits
  4. 04 Nov, 2018 2 commits
  5. 03 Nov, 2018 5 commits
  6. 02 Nov, 2018 7 commits
  7. 01 Nov, 2018 4 commits
    • Adam Procter's avatar
      Partial Shapes and Types, Part 4m: BatchNorm and backprops (#1904) · 722903ac
      Adam Procter authored
      * Implement partial shape/type propagation for Convolution; fail for want of unit tests
      
      * Implement unit tests for partial shapes/types for Convolution
      
      * Implement partial shape/type validation for BatchNormInference, BatchNormTraining, BatchNormTrainingBackprop with unit tests
      
      * Formatting
      
      * Update CPU and GPU backends to deal with elimination of BatchNormBase
      
      * Update BatchNormTrainingWithStats to use templated emit_BatchNorm function
      
      * Restore five-argument BatchNormTraining ctor for now; #1901 will eliminate it
      
      * Replace enum for input indices with symbolic constants
      
      * Update intelgpu/visualize_tree.cpp to deal with removal of BatchNormBase
      722903ac
    • Sergey Shalnov's avatar
    • Nick Korovaiko's avatar
      Dropout for CPU (#1942) · 5b994011
      Nick Korovaiko authored
      * generate mask
      
      * add codegen+dex
      
      * states in context
      
      * CPU dropout
      
      * remove dead code
      
      * remove dead code
      
      * remove state.cpp
      
      * change perms, add state.cpp
      
      * address bobs feedback
      
      * restore igpu unit-test manifest after a bad merge
      
      * better error msgs
      
      * throw on GPUs to keep a compiler happy
      
      * address more feedback
      
      * fix tests
      5b994011
    • Nick Korovaiko's avatar
      GraphRewrite until fixed point (#1889) · 6c1ba614
      Nick Korovaiko authored
      * graph_rewriter until fixed point
      
      * fix a bug + comment
      
      * move comment
      
      * move rewritten=true into process_node conditional
      6c1ba614
  8. 31 Oct, 2018 3 commits
    • Robert Kimball's avatar
      Change Backend::create to return std::unique_ptr<Backend> (#1909) · 05a404a8
      Robert Kimball authored
      * create unique_ptr backend
      
      * unit test cleanup
      
      * address more code that was recently added
      
      * change from reference to pointer when passing backend to reduce the number of lines changed.
      
      * fix build error
      
      * fix python wrapper
      
      * style
      
      * more specific treatment for unique_ptr
      05a404a8
    • Sergey Shalnov's avatar
    • Adam Rogowiec's avatar
      [ONNX] Generic N-dimensional MatMul operation. (#1921) · 92c1cc19
      Adam Rogowiec authored
      * [WIP] Draft for matmul
      
      * Numpy-style broadcasting for matrix multiplication.
      
      * 3D matrix multiplication with one big Dot/slice/concat.
      
      * Generic ND matmul implementation using slice/dot/concat pattern.
      
      * Code formatting.
      
      * remove unused header
      
      * Add missing header
      
      * Utility reshape-like functions.
      
      * Use utility functions.
      
      * Review comments.
      
      * Code format
      
      * Use if/else instead of ternary operator for readability.
      
      * Remove unused function overloading
      
      * Utility function expanding tensor shape with empty axes.
      
      * Use helper functions.
      
      * Use type for auto variable initializer to fix Centos build
      
      * Fix Centos build errors.
      92c1cc19