- 13 Nov, 2018 4 commits
-
-
Ayan Moitra authored
* Add cudnn sum and test that exposes failure * Add test to interpreter manifest and move test to sum file * change test name * Fails INTELGPU and no need to run on CPU * Comment incorporation + kahan summation for interpreter * small oversight correction * resolve conflict * minor variable name edit
-
Adam Rogowiec authored
* Add static keyword for helper function. * Fix MatMul for cases where left hand side is 1D vector. - Add unit-test for this case. * Add new line at the end of file. * Log warning when dealing with scalars * Apply clang-format * Review: fix spelling, rename test model.
-
Amy Zhuang authored
* Add cacheablility propagation pass. * Use a functor to create op annotations. * Address PR feedback. * Address PR feedback. * Address PR feedback.
-
mchrusci authored
* Kill previous builds on PR update * Remove Jenkinsfile.groovy * Lower case method names * Fix method notify() * Added comment Previous build deletion workaround to be removed as soon as better, less security vulnerable solution is found. * Fix inconsistent stage names
-
- 12 Nov, 2018 5 commits
-
-
Jayaram Bobba authored
* Moved mkldnn conv availability checks to utils and use it across passes * Style fix
-
Pruthvi authored
* cse for convert layout * addressed PR comments * Addressed PR comments
-
Nishant Patel authored
* Quantize the bias to int32 * Bias scale fix * mnist works * Quantize Bias * Introduce Quantize op in the graph to quantize bias & feedback * Comments and some refactoring * Add test case with float bias and enable int32 as quantized type in ngraph * Change shape of scale from Shape{} to Shape{1} in the backend
-
Nick Korovaiko authored
* tracing * count tracepoint * address scotts feedback * merge * fix an ununsed var warning
-
Jayaram Bobba authored
* Faster argmax/argmin kernels * Use switch statement for macro
-
- 11 Nov, 2018 2 commits
-
-
Fenglei authored
* add softmax cuda support * optimize block size * remove debug info * remove debug * style * remove unused * remove cudnn softmax * format * using nullptr * move helper, add test * fix style * using all_close_f * using kahansum * style * remove commentted out code
-
Fenglei authored
* add isfinite check * style * output 5 diff and total diff * output limit of diff for all_close_f * dix bug * disable tests * remove failing unit test that does not make sense.
-
- 10 Nov, 2018 4 commits
-
-
Adam Procter authored
-
gcwenger authored
* Heterogenous sub-graph comparison testing * Print index for float differences * Disabled compare_backends_with_graphs on most backends for now. Moved to new file. Added testing of unsigned values. * Fixed element::boolean range. Added missing include. * Switched use of shared_ptr as parm to raw *. Moved to using namespace std in cpp. Fixed comment marker in unit_test.manifest files. Switched some EXPECT_EQ TO ASSERT_EQ. Fixed parameterized test disabling. * Frozen naming -> serialized. Removed extraneous comments. * Graph comparison unit test relies on CPU for reference, so only build when CPU is built. * Reworked per backend disabling of compare_backends_with_graphs
-
Nick Korovaiko authored
-
Adam Procter authored
* Update l2_norm and std_dev builders to use op::Sqrt instead of op::Power(...,0.5) * Removed unneeded power.hpp include
-
- 09 Nov, 2018 11 commits
-
-
Robert Kimball authored
* optimization for about 2x speedup * more optimizations
-
Nick Korovaiko authored
-
Jayaram Bobba authored
-
Nick Korovaiko authored
-
Adam Procter authored
* Add ShapeOf op * Helps to check in the source files * Add shape_of_scalar to unit test manifests * Add missing include to gpu_emitter.cpp * Change 'this op is experimental' wording per @indie's suggestion * New idea: let's try not mallocing 300 terabytes * Update interpreter implementation
-
L.S. Cook authored
* editing docs * more doc updates * Cleanup theme, update backends for PlaidML, remove stale font * Add PlaidML description and doc update that should have been added with PR 1888 * Add PlaidML description and doc update that should have been added with PR 1888 * Latest release doc updates * Add PlaidML description and doc update for PR 1888 * Update glossary with tensor description and quantization def * Refactor landpage with QuickStart guides * Add better details about nGraph features and roadmap * Placeholder detail for comparison section * Add section link * order sections alphabetically for now * update compiler illustration * Address feedback from doc review * Update illustration wording * Formatting and final edits * keep tables consistent * Clarify doc on bridge and compiler docs * Clarify doc on bridge and compiler docs * yay for more feedback and improvements * edit with built doc * Fix typo * Another phase of PR review editing * Final review comment resolved
-
Nick Korovaiko authored
-
Robert Kimball authored
* fix gtest abi build * fix llvm build with abi flag * remove debug * add check for conflicting flags in cmake
-
Robert Kimball authored
* all tests passing * rename a few vars to be consistent with new tensor names
-
Artur Wojcik authored
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>
-
Amy Zhuang authored
* Add in-place-slice optimization for CPU backend. * Modify slice emitter function for in place slice. * Allow arg node to have multiple outputs for in place slice. * Remove unused variable. * Add CPUExecutionContext argument to slice builder. * Address PR feedback: move computation out of the functor. * Move size computation out of the functor for in place concat.
-
- 08 Nov, 2018 6 commits
-
-
Chris Sullivan authored
* When CUDNN_DATA_TYPE == CUDNN_DATA_DOUBLE, it appears that the cudnn calculated workspace size is incorrect. Adding a temporary fix here until the underlying issue is found. * Add softmax test illustrating bug in cudnn impl. * disable new unit test in intel GPU
-
Fenglei authored
* change reduce using cuda, add support for AND, OR * fix bug and format * remove unused code * style * change reduce_op to reduce_func to avoid shadow, thansk Ayan. * using dynamic_pointer_cast
-
mchrusci authored
-
Artur Wojcik authored
-
mchrusci authored
-
Rob Earhart authored
-
- 07 Nov, 2018 8 commits
-
-
Jayaram Bobba authored
* Collapse dimensions for inputs to Dot * Remove eigen kernels for higher dimension dots since they will collapse to cblas_gemm kernels * Moved collapse dims pass after the fusion passes to prevent interference with fusion patterns. Use cblas_gemm for 2D dot
-
Chris Sullivan authored
* Refactor include_helpers into an nvrtc specific helper file. Add templated define functions for coherent and noncoherent memory loads. * Format * const refs. * Remove cast of zero.
-
Adam Procter authored
-
Robert Kimball authored
* add nop backend * nop backend * fix flag name * add new switch to cmake output of switch settings * add new unit test to igpu manifest * remove redundant test
-
Robert Kimball authored
* address issues which surface with clang 6.0 * revert changes due to new clang warning and disable new warning
-
Adam Straw authored
* quantize scale passing unit tests * epsilon bump * finished with quantization scale * unit tests passing with convolution scale as builder * broadcasted constants and cleanup * api consistency for quant builders * code style * cleanup * newline at EOF * use requantization_scale * drop TF license as we are no longer using TF code directly
-
Amy Zhuang authored
* Do not fuse nodes if one node is predecessor of another node in horizontal fusion. * Add dead node check and remove predecessor check in horizontal fusion.
-
Robert Kimball authored
* wip * wip * wip * move hybrid wrapper to hybrid backend dir * move hybrid wrapper to correct namespace * wip * sorta working * remove debug from sorta working homogeneous hybrid backend * is_supported is supported for GPU * cleanup debug * more progress * remove debug * cleanup * turn off hybrid by default * revert change * revert * rename wrapper to backend * revert * address review comments * style
-