1. 05 Jun, 2019 1 commit
  2. 14 May, 2019 1 commit
    • Scott Cyphers's avatar
      Remove remaining osx compiler warnings (#2925) · 1cbc1517
      Scott Cyphers authored
      * Fix clang compiler warnings
      
      * Remove unintended file.
      
      * style
      
      * Not part of PR
      
      * Another extra closure ref
      
      * More warnings from merges
      
      * Lambda arg was used
      
      * Remove remaining osx compiler warnings
      
      * style
      
      * Try to avoid compiler warning
      
      * Same for the other test
      1cbc1517
  3. 13 May, 2019 1 commit
  4. 26 Apr, 2019 1 commit
  5. 11 Apr, 2019 1 commit
    • Louis Feng's avatar
      [Dynamic Shape] Moving BatchDot to Core Op (#2691) · cc8dd452
      Louis Feng authored
      * batch dot WIP.
      
      * cpu backend refactor and unit tests pass.
      
      * WIP.
      
      * batch dot interpreter impelementation.
      
      * minor clean up.
      
      * more clean up.
      
      * patching the gpu backends.
      
      * added more tests, fixes, etc.
      
      * fixed compile error.
      
      * renamed batch dot to batch matmul.
      
      * refactor WIP.
      
      * fixes some tests and formating.
      
      * more fixes.
      cc8dd452
  6. 22 Feb, 2019 1 commit
  7. 18 Jan, 2019 1 commit
    • Louis Feng's avatar
      Addes backprop to BatchDot op, allows fusion in training. (#2297) · ef778693
      Louis Feng authored
      * batch dot bprop WIP.
      
      * WIP.
      
      * testing.
      
      * clean up debug code.
      
      * comments and var name change.
      
      * clean up.
      
      * format style, batch dot differentiable pass.
      
      * removed debug output.
      
      * added unit test to autodiff, refactored make_function -> make_function_from_file.
      
      * fixed build warning.
      
      * fixed gpu build error.
      
      * clang format fix.
      
      * all test_tools.cpp to find SERIALIZED_ZOO
      
      * remove cmake redef.
      
      * fix unused macro.
      
      * making test cpu only.
      
      * testing build var
      
      * macro test
      
      * verbose makefile test
      
      * style fix
      
      * verbose make
      
      * test/util needs test/models.
      
      * removed debug output.
      
      * refactor fusion type.
      
      * refactor fusion type.
      ef778693
  8. 03 Jan, 2019 1 commit
  9. 19 Dec, 2018 1 commit
  10. 13 Dec, 2018 1 commit
  11. 07 Dec, 2018 1 commit
    • Robert Kimball's avatar
      Backend API change pre-work (#2064) · e0933553
      Robert Kimball authored
      * change compile call to return Handle
      
      * make CPU require compile() before call()
      
      * fix unit tests to call compile() before call()
      
      * fix failing ops
      
      * update unit test
      
      * revert some changes
      
      * more fixups
      
      * more diff cleanup
      
      * a few more issues addressed
      
      * more fixes
      
      * update API
      
      * more updates
      
      * fix test_ops.py
      
      * fix
      
      * another attempt to fix
      
      * fix unit test
      
      * fix test error
      e0933553
  12. 28 Nov, 2018 1 commit
    • Scott Cyphers's avatar
      Cyphers/bnorm back (#2129) · 403a09ce
      Scott Cyphers authored
      * Fix batchnorm argument order, cleanup some comments, fix backprop
      
      * Merge error
      
      * Clean up training function, organize inference test
      
      * BatchNormInference tests
      
      * Training case
      
      * Training test
      
      * Fix autodiff BatchNorm test
      
      * Cleanup
      
      * Move file to doc checkout
      
      * Update disabled test name in igpu manifest
      Fix unnused variable
      
      * Unit tests disables
      
      * Review comments
      403a09ce
  13. 16 Nov, 2018 1 commit
  14. 11 Nov, 2018 1 commit
    • Fenglei's avatar
      add isfinite check for all_close (#2028) · 702d465a
      Fenglei authored
      * add isfinite check
      
      * style
      
      * output 5 diff and total diff
      
      * output limit of diff for all_close_f
      
      * dix bug
      
      * disable tests
      
      * remove failing unit test that does not make sense.
      702d465a
  15. 31 Oct, 2018 1 commit
    • Robert Kimball's avatar
      Change Backend::create to return std::unique_ptr<Backend> (#1909) · 05a404a8
      Robert Kimball authored
      * create unique_ptr backend
      
      * unit test cleanup
      
      * address more code that was recently added
      
      * change from reference to pointer when passing backend to reduce the number of lines changed.
      
      * fix build error
      
      * fix python wrapper
      
      * style
      
      * more specific treatment for unique_ptr
      05a404a8
  16. 22 Oct, 2018 1 commit
    • Nick Korovaiko's avatar
      BatchNorm splitting into ops (2nd try) (#1828) · 1beec46b
      Nick Korovaiko authored
      * split bn into bn_inference bn_training
      
      * fix warnings
      
      * Add GPU support for the new BN ops (#1569)
      
      * Add GPU support and change batchnorm_globalstats test to use BNInference.
      
      * Changed test back to using BNTraining for global stats and updated cudnn backend to account for it.
      
      * Fix issues in merge with master.
      
      * Formatting.
      
      * CPU fixes
      
      * remove 5-arg training BN for now
      
      * more fixes
      
      * python batchnorm changes
      
      * fix onnx_import
      
      * fix a call BatchNormInference c-tor
      
      * yet another fix to BatchNormInference c-tor
      
      * AND yet another fix to batchnorm_inference c-tor
      
      * ops.py
      
      * address adam's feedback
      
      * Remove unnecessary parameter/argument.
      
      * remove batch_norm_training_relu_with_global_stats
      
      * remove bn_relu (training)
      1beec46b
  17. 29 Sep, 2018 1 commit
  18. 05 Sep, 2018 1 commit
  19. 29 Aug, 2018 1 commit
  20. 27 Aug, 2018 1 commit
  21. 17 Aug, 2018 1 commit
  22. 13 Aug, 2018 1 commit
  23. 03 Aug, 2018 1 commit
  24. 02 Aug, 2018 1 commit
  25. 09 Jul, 2018 1 commit
  26. 02 Jul, 2018 1 commit
    • Sandeep's avatar
      move sigmoid to core fusion (#1132) · d05b5e39
      Sandeep authored
      * declare sigmoid for core fusion
      
      * add simple test for sigmoid
      
      * info fusion status
      
      * cp op as main op
      
      * builds as expected
      
      * move sigmoid fusion code
      
      * add reference kernel
      
      * sigmoid bprop reference kernel and clang-format
      
      * add delta to bprop
      
      * fprop called
      
      * compiles bprop
      
      * move tests
      
      * serializer support
      
      * address comments in code
      
      * add doc
      
      * naming similar to core ops
      
      * fix failing test
      
      * fix failing test
      
      * address clang issue
      
      * more changes
      
      * change test macro
      d05b5e39
  27. 02 Jun, 2018 1 commit
  28. 18 May, 2018 1 commit
  29. 10 May, 2018 2 commits
  30. 09 May, 2018 1 commit
    • Chris Sullivan's avatar
      CUDNN and CUDA kernels for AvgPool (forward/backward) (#951) · b1b3d4d6
      Chris Sullivan authored
      * Added op::AvgPool cudnn impl. which works for 2-3 spatial dimesions and no/symmetric padding. Enabled tests.
      
      * Added cuda-c implementation of average pool which handles 1-3 spatial
      dimensions as well as asymmetric padding. This commit also introduces
      several helper functions for performing fast integer division and
      fast constant memory access.
      
      * Formatting. Removed bool that was used for testing to force the cuda impl. over cudnn.
      
      * Added CUDNN AvgPoolBackprop implementation.
      
      * Removed inline enum in preference of a helper struct. Removed instances of multiple declarations on a single line. Updated comments.
      
      * Removed _prefix to helper functions in anonymous namespace.
      b1b3d4d6
  31. 01 May, 2018 1 commit
  32. 24 Apr, 2018 1 commit
  33. 21 Apr, 2018 1 commit
    • Adam Straw's avatar
      Add Inference Engine (IE) backend (#883) · 3d590dea
      Adam Straw authored
      * ie backend and manager with passing unit tests except for select/function
      
      * fix function_call and select
      
      * simplify implemenation by removing support for convert and select
      
      * remove manager
      3d590dea
  34. 16 Apr, 2018 1 commit
  35. 13 Apr, 2018 2 commits
    • Robert Kimball's avatar
      Remove legacy Backend API (#848) · ec501913
      Robert Kimball authored
      * remove deprecated
      
      * remove all legacy Backend API usage
      
      remove deprecated files
      
      * pull in changes from master
      
      * fix GPU calls
      
      * disable tests in convolution generator
      
      * update per PR comments. Enable performance counter feature.
      
      * update per PR comments
      
      * fix build error
      
      * fix conditionally compiled test :(
      ec501913
    • Chris Sullivan's avatar
      Add GPURuntimeContext and GPUPrimitiveEmitter to the gpu transformer (#837) · 026bede0
      Chris Sullivan authored
      * Begin prototype of cudnn_emitter.
      
      * Added GPURuntimeContext to gpu_external_function for passing through to JIT functions.
      
      * gpu_emitters now utilize gpu runtime context.
      
      * Moved cublas and cudnn handles into GPURuntimeContext pointer and out of callframe EntryPoint.
      
      * Added CUDNNEmitter, comparable to MKLDNNEmitter,
      which allows for cudnn kernels to be defined via
      lambda primitives that are emitted and
      subsequently called during graph execution.
      An example implementation is provided for op::Sum.
      
      * Added GPURuntimeContext to gpu_external_function for passing through to JIT functions.
      
      * gpu_emitters now utilize gpu runtime context.
      
      * Moved cublas and cudnn handles into GPURuntimeContext pointer and out of callframe EntryPoint.
      
      * GPURuntimeContext should be stored as unique_ptr in external function.
      
      * GPURuntimeContext should be stored as unique_ptr in external function.
      
      * Extract raw pointer from unique for cudnn_emitter.
      
      * Removing unrelated code from PR.
      
      * GPURuntimeContext needs to be a strict C interface in case
      the native compiler and clang are utilizing different glibc ABIs.
      Updated to reflect this.
      
      * Added cudnn::primitive typedef for better readability.
      
      * Moved allocation of CudaFunctionPool to external function
      so that it is available during gpu emission.
      
      * Fixed too-late initialization of cudart.
      
      * Fixed too-late initialization of cudart.
      
      * CUDNNEmitter moved into superset class GPUPrimitiveEmitter.
      The GPUPrimitiveEmitter handles the emission of all gpu primitives,
      including cudnn, cuda, and cublas. CUBLASEmitter support not yet included.
      
      * Added unordered_map for cacheing primitives in the gpu_emitter.
      
      * Added dtor to GPUPrimitiveEmitter to cleanup compiled functions.
      
      * Adding back a serialized model graph that was accidentally rem* Added a few additional helpers to use ngraph::row_major_strides.
      
      * added whitespace per @fengleitian's comment
      
      * added whitespace per @fengleitian's comment
      
      * Remove implicit type conversions from size_t to int.
      
      * Add op::MaxPool, op::MaxPoolBackprop and op::Pad to GPU transformer (#817)
      
      * Added pooling for 1 and 2dimensions. 1d uses a cuda kernel and 2d utilizes cudnn.
      Padding is not yet supported.
      
      * Normalized call signature on gpu emission for 1d max pool. Added a few comments.
      
      * Max pool backprop impl. inprogress. Amend this commit.
      
      * Max pool backprop implemented. Note that cuDNN
      requests the output tensor for the maxpool operation but it is not required for computation.
      
      * Formatting and invokation for maxpool changed.
      
      * Fixed too-late initialization of cudart.
      
      * Added padding kernel that is used with maxpool. Need to investigate remaining tests.
      
      * Changed dimensionality check to correctly
      determine if data is 1d or not.
      
      * Added 3d MaxPooling (forward), verified by forcing 2d case to use Nd pooling routines.
      
      * Added 3d MaxPooling (backward), verified by forcing 2d case to use Nd pooling routines.
      
      * Moved cudnn prologues for maxpool into ngraph runtime and out of primitive so
      that the only execution occuring on the JIT runtime is the evaluation of the op kernel.
      
      * Refactored forward and backward pooling into single CUDNNEmitter::build_pooling interface
      with a runtime switch to determine if the op is forward or backward propagation.
      
      * Cache preconstructed cudnn kernel for maxpool if it has already been constructed.
      
      * Forgot to add padding arrays back into cudnn kernel for MaxPool in the 2d case.
      
      * Fixed namespace issues and use join(...,'_')
      
      * Refactored 4d/Nd tensor descriptor builder into single function.
      
      * Changed conditionals and comments. Now throws if MaxPool on more than 3 spatial dimensions is requested.
      
      * Fixed forward declare for GPURuntimeContext (class -> struct).
      
      * Clang complains about missing braces on brace-initializer. Fixed implicit conversions.
      
      * Fixed implicit conversions (clang).
      
      * Reverting changes on autodiff test for maxpool. @Krovatkin will update later.
      026bede0
  36. 04 Apr, 2018 1 commit
    • Nick Korovaiko's avatar
      Support multi-output ops in Adjoints (#796) · 5f0e8dc3
      Nick Korovaiko authored
      * refactor Adjoints to support multi-output ops
      
      * passing tests
      
      * switch to generate_adjoints(deltas) and backprop_node
      
      * remove debugging code
      
      * fix error msg
      
      * fix typo adjoitns
      
      * fix comp errors in mnist_mlp
      5f0e8dc3
  37. 26 Mar, 2018 1 commit
  38. 21 Mar, 2018 1 commit