- 13 Dec, 2018 4 commits
-
-
Jimin Ha authored
* Move CPU ReshapeSinking to Core pass * Modify clang compile error * Fix for style-apply check
-
Nick Korovaiko authored
* reshape broadcast * fix warnings
-
Jayaram Bobba authored
* Remove old Eigen code * Bug fixes to unordered map checks
-
Aleksey Marchuk authored
-
- 12 Dec, 2018 3 commits
-
-
Nick Korovaiko authored
* make GOE extend from util::Op * fix build breaks * refactor GOEE * redundant after jbobba's fix * fix clang warnings * add an assert
-
gaurides authored
* Skip Broadcast in sigmoid fusion * added test case; modified file perms * incorporate review comments * using is_one() to check the node is constant&1
-
Adam Procter authored
* Skip --exclude-libs linker flag on macOS * Change test to if(LINUX) * Add "Any" op and AnyAllReplacement pass * Add AnyAllReplacement to IGPU backend * Stub (error-out) handlers for GPU and INTELGPU * Add 'All' op * Add AnyAllInsertion pass, deprecate deprecable ops, add stubs for INTELGPU * Add failing unit tests to INTELGPU manifest * Reduce boilerplate * Reduce more boilerplate * Add static keywords
-
- 11 Dec, 2018 13 commits
-
-
Nick Korovaiko authored
* embedding fprop * add a new line * type prop tests * rename * add a stub handler for embeddinglookup on intelgpu * rename embedding.* to embedding_lookup * rename tests in manifest files * move embeddinglookup to catchall case * fix test case breaks after merge * add a negative test, pull up an assertion * fix test failures
-
Robert Kimball authored
* add empty framework for hybrid GPU, or GPUH * move placement to the runtime directory * wip * skeleton for hybrid GPU backend. most unit tests pass. * cleanup * move hybrid code into hybrid dir/namespace * move hybrid functions * move more hybrid functions to hybrid directory * fix placement after compile. All unit tests passing * fix gpu backend ctor
-
Robert Kimball authored
* files pulled from bob/winbuild * fix compile problems * fix a few windows build errors * add windows file to exclude from git * add comment why change was made * revert obsolete change * more cleanup * building interpreter and unit test on windows with DLLs * Add flag for windows to export all symbols. Short term fix. * enable MD build * address warnings * dump all windows build results to a single directory * fix windows backend dll open issue * remove debug * fix file iterator for windows * fix merge error * fix test failure * change header from h to hpp in hopes of making python happy * address more linux build issues * fix visibility enable
-
Fenglei authored
* add some helper function * update with new helper function * update reduce to nd with new helper function * update float sum to stable sum * fix bug * update all reduce to stable sum for float * fix bug and pass the sum stable test * remove debug info * style * update with shape * fix bug * add host parameters to cuda_emitter * clang format * fix bugs * add element::type support * format * add a cached value with datatype name * add init_reduce_value * unroll loop * optimization * remove the need for init_value * add memset kernel * add memcpy * working version * remove debug info * add comments, clean up code. * change in_idx to input_idx * fix bug * change args name for memset in emitter * pass element::Type instead of string * the op::reduce come with init value, add support * resolve codacy-bot comment * fix bug * resove codacy-bot comment * add soft_max_block_reduce kernel * fix bugs * add softmax_block_reduce to cuda_emitter * compiing ok, result wrong * fix bug in kernel * working version * removed unused code * remove unused comments, resolve comments * cuda reduce for max, min, mul, reduce op init value, format * use type::info * use type info for numeric_limits * remove code from gpu_host_parameters * header * remvoe outdated comments * add helper to check if stable sum is needed * add stable sum test for double * remove extra line * consolidate helper functions * no need list now. * remove extra ; * clang format * style * add skip test for cpu and intelGPU side * resolve more conflict * update comment * fix a warning * Update src/ngraph/runtime/gpu/gpu_cuda_kernel_builder.cpp using load. Co-Authored-By: fengleitian <35274053+fengleitian@users.noreply.github.com> * using WARPSIZE instead of 32, using lambda * more WARPSIZE instead of 32 * fix block_size_x bug * using __expf
-
gaurides authored
* fix crash in ngraph-tf test conv_ops_test.Conv2DTest.testConv2DKernelSmallerThanStrideSame * fix file perms * correct checks
-
Sergey Shalnov authored
-
Chris Sullivan authored
* Bind cuda context to thread prior to compilation. Small refactoring. * bind_cuda_context_to_thread in source * bind_cuda_context_to_thread header
-
tsocha authored
* [Py]Add version to ngraph python * FIX
-
Nick Korovaiko authored
* reshape softmax reshape * add new line * add new line * fix style errors
-
Nick Korovaiko authored
* Update cpu_external_function.cpp * fix test case failures * env var to abort matching * Update matcher.cpp * Update matcher.cpp * add a comment * give an env var a better name
-
Adam Rogowiec authored
* Fix installing numpy dependency on CentOS. * Check whether nGraph library directory exists.
-
Amy Zhuang authored
* Bug fixes to unordered map checks * No in-place slice for non-native MKLDNN layouts * is_op
-
Nick Korovaiko authored
-
- 10 Dec, 2018 1 commit
-
-
harryskim authored
* Removed winml from stack diagram * Removed winml from full stack diagram * Update README.md * update the diagram without winml * Changed sentence about WinML * Removed duplication
-
- 08 Dec, 2018 4 commits
-
-
Robert Kimball authored
* change all_close tests to return gtest AssertionResult instead of bool to allow for better error messages * change throw to return error * address PR comments and fix compile error
-
Jayaram Bobba authored
* reenable mkldnn convolution for large padding * specify precision tolerance to unit test * pass tolerance values to all_close
-
Robert Kimball authored
* move GPU specific test to GPU only * fix unit test invocation * fix compile error * fix compile error * style * fix runtime error
-
Nick Korovaiko authored
* make GOE extend from util::Op * fix build breaks
-
- 07 Dec, 2018 6 commits
-
-
Jayaram Bobba authored
* initial commit for update slice op * Finished up update_slice fusion and added codegen support * style fixes * Added unit test for in-place update-slice strided * change pattern name
-
Robert Kimball authored
* change compile call to return Handle * make CPU require compile() before call() * fix unit tests to call compile() before call() * fix failing ops * update unit test * revert some changes * more fixups * more diff cleanup * a few more issues addressed * more fixes * update API * more updates * fix test_ops.py * fix * another attempt to fix * fix unit test * fix test error
-
Sergey Shalnov authored
-
gcwenger authored
* Double support for all_close_f * all_close_f uses fixed number of mantissa bits now. Simplified testing code. * Initialize test data members in constructor to values which will cause test failure. Setup then sets them correctly. * Reduce info printed out during all_close_f unit tests.
-
Sang Ik Lee authored
-
Adam Straw authored
* re-enable quantize_clamp_int32 test on CPU * MLKDNN typo
-
- 06 Dec, 2018 9 commits
-
-
Nishant Patel authored
* Quantize the bias to int32 * Bias scale fix * mnist works * Quantize Bias * Introduce Quantize op in the graph to quantize bias & feedback * Add QuantizedConvBiasAdd * Comments and some refactoring * Add test case with float bias and enable int32 as quantized type in ngraph * Change shape of scale from Shape{} to Shape{1} in the backend * Add QuantizedConvBiasSignedAdd * Fix Layouts, clean up and a test case for QCBA * Test case for QCBSA * cleanup mkldnn_emitter.hpp * fix build error * Constant fold
-
Sergey Shalnov authored
-
Nick Korovaiko authored
* one output passing tests clean up fix build breaks * move generators into a separate file
-
Nick Korovaiko authored
-
Nick Korovaiko authored
* an env var to disable individual fusions * fix env var name
-
Nick Korovaiko authored
* give fusions names * fix build breaks * fix perms
-
Nick Korovaiko authored
* abort messages in matcher.cpp * style fixes
-
gcwenger authored
* Isolated per op testing when comparing graphs for better determination of source of accuracy divergence. * Improve clarity of comment
-
Michał Karzyński authored
* Update README for PyPI * Update README for PyPI * Remove redundant newlines * Fix links
-