1. 18 Dec, 2018 1 commit
  2. 15 Dec, 2018 1 commit
  3. 14 Dec, 2018 5 commits
  4. 13 Dec, 2018 9 commits
  5. 12 Dec, 2018 3 commits
    • Nick Korovaiko's avatar
      Removing GOEE (#2212) · 18034315
      Nick Korovaiko authored
      * make GOE extend from util::Op
      
      * fix build breaks
      
      * refactor GOEE
      
      * redundant after jbobba's fix
      
      * fix clang warnings
      
      * add an assert
      18034315
    • gaurides's avatar
      Skip Broadcast in sigmoid fusion (#2197) · 71f13654
      gaurides authored
      * Skip Broadcast in sigmoid fusion
      
      * added test case; modified file perms
      
      * incorporate review comments
      
      * using is_one() to check the node is constant&1
      71f13654
    • Adam Procter's avatar
      "Any" and "All" ops (#2217) · fc216f39
      Adam Procter authored
      * Skip --exclude-libs linker flag on macOS
      
      * Change test to if(LINUX)
      
      * Add "Any" op and AnyAllReplacement pass
      
      * Add AnyAllReplacement to IGPU backend
      
      * Stub (error-out) handlers for GPU and INTELGPU
      
      * Add 'All' op
      
      * Add AnyAllInsertion pass, deprecate deprecable ops, add stubs for INTELGPU
      
      * Add failing unit tests to INTELGPU manifest
      
      * Reduce boilerplate
      
      * Reduce more boilerplate
      
      * Add static keywords
      fc216f39
  6. 11 Dec, 2018 13 commits
    • Nick Korovaiko's avatar
      Embedding fprop (#2053) · 16d88a7f
      Nick Korovaiko authored
      * embedding fprop
      
      * add a new line
      
      * type prop tests
      
      * rename
      
      * add a stub handler for embeddinglookup on intelgpu
      
      * rename embedding.* to embedding_lookup
      
      * rename tests in manifest files
      
      * move embeddinglookup to catchall case
      
      * fix test case breaks after merge
      
      * add a negative test, pull up an assertion
      
      * fix test failures
      16d88a7f
    • Robert Kimball's avatar
      Framework for Hybrid GPU backend (#2196) · af2c4c7d
      Robert Kimball authored
      * add empty framework for hybrid GPU, or GPUH
      
      * move placement to the runtime directory
      
      * wip
      
      * skeleton for hybrid GPU backend. most unit tests pass.
      
      * cleanup
      
      * move hybrid code into hybrid dir/namespace
      
      * move hybrid functions
      
      * move more hybrid functions to hybrid directory
      
      * fix placement after compile. All unit tests passing
      
      * fix gpu backend ctor
      af2c4c7d
    • Robert Kimball's avatar
      Windows build support (#2177) · 9234cc69
      Robert Kimball authored
      * files pulled from bob/winbuild
      
      * fix compile problems
      
      * fix a few windows build errors
      
      * add windows file to exclude from git
      
      * add comment why change was made
      
      * revert obsolete change
      
      * more cleanup
      
      * building interpreter and unit test on windows with DLLs
      
      * Add flag for windows to export all symbols. Short term fix.
      
      * enable MD build
      
      * address warnings
      
      * dump all windows build results to a single directory
      
      * fix windows backend dll open issue
      
      * remove debug
      
      * fix file iterator for windows
      
      * fix merge error
      
      * fix test failure
      
      * change header from h to hpp in hopes of making python happy
      
      * address more linux build issues
      
      * fix visibility enable
      9234cc69
    • Fenglei's avatar
      nvgpu cuda softmax optimization (#2101) · a3133482
      Fenglei authored
      * add some helper function
      
      * update with new helper function
      
      * update reduce to nd with new helper function
      
      * update float sum to stable sum
      
      * fix bug
      
      * update all reduce to stable sum for float
      
      * fix bug and pass the sum stable test
      
      * remove debug info
      
      * style
      
      * update with shape
      
      * fix bug
      
      * add host parameters to cuda_emitter
      
      * clang format
      
      * fix bugs
      
      * add element::type support
      
      * format
      
      * add a cached value with datatype name
      
      * add init_reduce_value
      
      * unroll loop
      
      * optimization
      
      * remove the need for init_value
      
      * add memset kernel
      
      * add memcpy
      
      * working version
      
      * remove debug info
      
      * add comments, clean up code.
      
      * change in_idx to input_idx
      
      * fix bug
      
      * change args name for memset in emitter
      
      * pass element::Type instead of string
      
      * the op::reduce come with init value, add support
      
      * resolve codacy-bot comment
      
      * fix bug
      
      * resove codacy-bot comment
      
      * add soft_max_block_reduce kernel
      
      * fix bugs
      
      * add softmax_block_reduce to cuda_emitter
      
      * compiing ok, result wrong
      
      * fix bug in kernel
      
      * working version
      
      * removed unused code
      
      * remove unused comments, resolve comments
      
      * cuda reduce for max, min, mul, reduce op init value, format
      
      * use type::info
      
      * use type info for numeric_limits
      
      * remove code from gpu_host_parameters
      
      * header
      
      * remvoe outdated comments
      
      * add helper to check if stable sum is needed
      
      * add stable sum test for double
      
      * remove extra line
      
      * consolidate helper functions
      
      * no need list now.
      
      * remove extra ;
      
      * clang format
      
      * style
      
      * add skip test for cpu and intelGPU side
      
      * resolve more conflict
      
      * update comment
      
      * fix a warning
      
      * Update src/ngraph/runtime/gpu/gpu_cuda_kernel_builder.cpp
      
      using load.
      Co-Authored-By: 's avatarfengleitian <35274053+fengleitian@users.noreply.github.com>
      
      * using WARPSIZE instead of 32, using lambda
      
      * more WARPSIZE instead of 32
      
      * fix block_size_x bug
      
      * using __expf
      a3133482
    • gaurides's avatar
      fix crash in ReshapeConvertLayout (#2205) · 6584306c
      gaurides authored
      * fix crash in ngraph-tf test conv_ops_test.Conv2DTest.testConv2DKernelSmallerThanStrideSame
      
      * fix file perms
      
      * correct checks
      6584306c
    • Sergey Shalnov's avatar
      24bd105f
    • Chris Sullivan's avatar
      Bind cuda context to thread prior to compilation (#2199) · 31210402
      Chris Sullivan authored
      * Bind cuda context to thread prior to compilation. Small refactoring.
      
      * bind_cuda_context_to_thread in source
      
      * bind_cuda_context_to_thread header
      31210402
    • tsocha's avatar
      [Py]Add version to ngraph python (#2193) · ec0a3f5c
      tsocha authored
      * [Py]Add version to ngraph python
      
      * FIX
      ec0a3f5c
    • Nick Korovaiko's avatar
      Reshape SoftMax Reshape (#2188) · b77fd922
      Nick Korovaiko authored
      * reshape softmax reshape
      
      * add new line
      
      * add new line
      
      * fix style errors
      b77fd922
    • Nick Korovaiko's avatar
      Matcher skip (#2169) · c8bc3edc
      Nick Korovaiko authored
      * Update cpu_external_function.cpp
      
      * fix test case failures
      
      * env var to abort matching
      
      * Update matcher.cpp
      
      * Update matcher.cpp
      
      * add a comment
      
      * give an env var a better name
      c8bc3edc
    • Adam Rogowiec's avatar
      Fix setup.py for CentOS (#2163) · f46e56ec
      Adam Rogowiec authored
      * Fix installing numpy dependency on CentOS.
      
      * Check whether nGraph library directory exists.
      f46e56ec
    • Amy Zhuang's avatar
      Fix TF test failures on Mac. (#2210) · 1640d21e
      Amy Zhuang authored
      * Bug fixes to unordered map checks
      
      * No in-place slice for non-native MKLDNN layouts
      
      * is_op
      1640d21e
    • Nick Korovaiko's avatar
      is_op (#2203) · c9eef901
      Nick Korovaiko authored
      c9eef901
  7. 10 Dec, 2018 1 commit
    • harryskim's avatar
      Harryk remove winml ref (#2204) · 90aa7336
      harryskim authored
      * Removed winml from stack diagram
      
      * Removed winml from full stack diagram
      
      * Update README.md
      
      * update the diagram without winml
      
      * Changed sentence about WinML
      
      * Removed duplication
      90aa7336
  8. 08 Dec, 2018 4 commits
  9. 07 Dec, 2018 3 commits
    • Jayaram Bobba's avatar
      Update slice kernels (#2180) · a16c4961
      Jayaram Bobba authored
      * initial commit for update slice op
      
      * Finished up update_slice fusion and added codegen support
      
      * style fixes
      
      * Added unit test for in-place update-slice strided
      
      * change pattern name
      a16c4961
    • Robert Kimball's avatar
      Backend API change pre-work (#2064) · e0933553
      Robert Kimball authored
      * change compile call to return Handle
      
      * make CPU require compile() before call()
      
      * fix unit tests to call compile() before call()
      
      * fix failing ops
      
      * update unit test
      
      * revert some changes
      
      * more fixups
      
      * more diff cleanup
      
      * a few more issues addressed
      
      * more fixes
      
      * update API
      
      * more updates
      
      * fix test_ops.py
      
      * fix
      
      * another attempt to fix
      
      * fix unit test
      
      * fix test error
      e0933553
    • Sergey Shalnov's avatar