Commits · 4514faf9d432d62ff507457cba58a6c751f77c6d · submodule / ngraph

13 Dec, 2018 4 commits
- Move CPU ReshapeSinking to Core pass (#2211) · 4514faf9
  Jimin Ha authored Dec 13, 2018
```
* Move CPU ReshapeSinking to Core pass

* Modify clang compile error

* Fix for style-apply check
```
  4514faf9
- Reshape Broadcast (#2198) · 922aaaf8
  Nick Korovaiko authored Dec 13, 2018
```
* reshape broadcast

* fix warnings
```
  922aaaf8
- Remove old eigen code in codegen and misc bug fixes (#2189) · 556179a2
  Jayaram Bobba authored Dec 13, 2018
```
* Remove old Eigen code

* Bug fixes to unordered map checks
```
  556179a2
- Integration of MLSL library (#1520) · dbf3703a
  Aleksey Marchuk authored Dec 13, 2018
  
  dbf3703a
12 Dec, 2018 3 commits

Nick Korovaiko authored Dec 12, 2018

* make GOE extend from util::Op

* fix build breaks

* refactor GOEE

* redundant after jbobba's fix

* fix clang warnings

* add an assert

18034315

Skip Broadcast in sigmoid fusion (#2197) · 71f13654

gaurides authored Dec 12, 2018

* Skip Broadcast in sigmoid fusion

* added test case; modified file perms

* incorporate review comments

* using is_one() to check the node is constant&1

71f13654

"Any" and "All" ops (#2217) · fc216f39

Adam Procter authored Dec 12, 2018

* Skip --exclude-libs linker flag on macOS

* Change test to if(LINUX)

* Add "Any" op and AnyAllReplacement pass

* Add AnyAllReplacement to IGPU backend

* Stub (error-out) handlers for GPU and INTELGPU

* Add 'All' op

* Add AnyAllInsertion pass, deprecate deprecable ops, add stubs for INTELGPU

* Add failing unit tests to INTELGPU manifest

* Reduce boilerplate

* Reduce more boilerplate

* Add static keywords

fc216f39

11 Dec, 2018 13 commits

Embedding fprop (#2053) · 16d88a7f

Nick Korovaiko authored Dec 11, 2018

* embedding fprop

* add a new line

* type prop tests

* rename

* add a stub handler for embeddinglookup on intelgpu

* rename embedding.* to embedding_lookup

* rename tests in manifest files

* move embeddinglookup to catchall case

* fix test case breaks after merge

* add a negative test, pull up an assertion

* fix test failures

16d88a7f

Framework for Hybrid GPU backend (#2196) · af2c4c7d

Robert Kimball authored Dec 11, 2018

* add empty framework for hybrid GPU, or GPUH

* move placement to the runtime directory

* wip

* skeleton for hybrid GPU backend. most unit tests pass.

* cleanup

* move hybrid code into hybrid dir/namespace

* move hybrid functions

* move more hybrid functions to hybrid directory

* fix placement after compile. All unit tests passing

* fix gpu backend ctor

af2c4c7d

Windows build support (#2177) · 9234cc69

Robert Kimball authored Dec 11, 2018

* files pulled from bob/winbuild

* fix compile problems

* fix a few windows build errors

* add windows file to exclude from git

* add comment why change was made

* revert obsolete change

* more cleanup

* building interpreter and unit test on windows with DLLs

* Add flag for windows to export all symbols. Short term fix.

* enable MD build

* address warnings

* dump all windows build results to a single directory

* fix windows backend dll open issue

* remove debug

* fix file iterator for windows

* fix merge error

* fix test failure

* change header from h to hpp in hopes of making python happy

* address more linux build issues

* fix visibility enable

9234cc69

nvgpu cuda softmax optimization (#2101) · a3133482

Fenglei authored Dec 11, 2018

* add some helper function

* update with new helper function

* update reduce to nd with new helper function

* update float sum to stable sum

* fix bug

* update all reduce to stable sum for float

* fix bug and pass the sum stable test

* remove debug info

* style

* update with shape

* fix bug

* add host parameters to cuda_emitter

* clang format

* fix bugs

* add element::type support

* format

* add a cached value with datatype name

* add init_reduce_value

* unroll loop

* optimization

* remove the need for init_value

* add memset kernel

* add memcpy

* working version

* remove debug info

* add comments, clean up code.

* change in_idx to input_idx

* fix bug

* change args name for memset in emitter

* pass element::Type instead of string

* the op::reduce come with init value, add support

* resolve codacy-bot comment

* fix bug

* resove codacy-bot comment

* add soft_max_block_reduce kernel

* fix bugs

* add softmax_block_reduce to cuda_emitter

* compiing ok, result wrong

* fix bug in kernel

* working version

* removed unused code

* remove unused comments, resolve comments

* cuda reduce for max, min, mul, reduce op init value, format

* use type::info

* use type info for numeric_limits

* remove code from gpu_host_parameters

* header

* remvoe outdated comments

* add helper to check if stable sum is needed

* add stable sum test for double

* remove extra line

* consolidate helper functions

* no need list now.

* remove extra ;

* clang format

* style

* add skip test for cpu and intelGPU side

* resolve more conflict

* update comment

* fix a warning

* Update src/ngraph/runtime/gpu/gpu_cuda_kernel_builder.cpp

using load.
Co-Authored-By: fengleitian <35274053+fengleitian@users.noreply.github.com>

* using WARPSIZE instead of 32, using lambda

* more WARPSIZE instead of 32

* fix block_size_x bug

* using __expf

a3133482

fix crash in ReshapeConvertLayout (#2205) · 6584306c

gaurides authored Dec 11, 2018

* fix crash in ngraph-tf test conv_ops_test.Conv2DTest.testConv2DKernelSmallerThanStrideSame

* fix file perms

* correct checks

6584306c

IntelGPU backend: Fix reshape operation (#2201) · 24bd105f
Sergey Shalnov authored Dec 11, 2018

24bd105f

Bind cuda context to thread prior to compilation (#2199) · 31210402

Chris Sullivan authored Dec 11, 2018

* Bind cuda context to thread prior to compilation. Small refactoring.

* bind_cuda_context_to_thread in source

* bind_cuda_context_to_thread header

31210402

[Py]Add version to ngraph python (#2193) · ec0a3f5c
tsocha authored Dec 11, 2018
```
* [Py]Add version to ngraph python

* FIX
```
ec0a3f5c
Reshape SoftMax Reshape (#2188) · b77fd922
Nick Korovaiko authored Dec 11, 2018
```
* reshape softmax reshape

* add new line

* add new line

* fix style errors
```
b77fd922

Matcher skip (#2169) · c8bc3edc

Nick Korovaiko authored Dec 11, 2018

* Update cpu_external_function.cpp

* fix test case failures

* env var to abort matching

* Update matcher.cpp

* Update matcher.cpp

* add a comment

* give an env var a better name

c8bc3edc

Fix setup.py for CentOS (#2163) · f46e56ec

Adam Rogowiec authored Dec 11, 2018

* Fix installing numpy dependency on CentOS.

* Check whether nGraph library directory exists.

f46e56ec

Fix TF test failures on Mac. (#2210) · 1640d21e

Amy Zhuang authored Dec 11, 2018

* Bug fixes to unordered map checks

* No in-place slice for non-native MKLDNN layouts

* is_op

1640d21e

is_op (#2203) · c9eef901
Nick Korovaiko authored Dec 11, 2018

c9eef901

10 Dec, 2018 1 commit

Harryk remove winml ref (#2204) · 90aa7336

harryskim authored Dec 10, 2018

* Removed winml from stack diagram

* Removed winml from full stack diagram

* Update README.md

* update the diagram without winml

* Changed sentence about WinML

* Removed duplication

90aa7336

08 Dec, 2018 4 commits

change all_close tests to return gtest AssertionResult instead of bool (#2195) · fcdfc4ce

Robert Kimball authored Dec 08, 2018

* change all_close tests to return gtest AssertionResult instead of bool to allow for better error messages

* change throw to return error

* address PR comments and fix compile error

fcdfc4ce

reenable mkldnn convolution for large padding (#2168) · 15d9b658

Jayaram Bobba authored Dec 08, 2018

* reenable mkldnn convolution for large padding

* specify precision tolerance to unit test

* pass tolerance values to all_close

15d9b658

move GPU specific test to GPU only (#2191) · 40dda4eb

Robert Kimball authored Dec 08, 2018

* move GPU specific test to GPU only

* fix unit test invocation

* fix compile error

* fix compile error

* style

* fix runtime error

40dda4eb

make GOE extend from util::Op (#2153) · 453a6a3c
Nick Korovaiko authored Dec 08, 2018
```
* make GOE extend from util::Op

* fix build breaks
```
453a6a3c

07 Dec, 2018 6 commits

Update slice kernels (#2180) · a16c4961

Jayaram Bobba authored Dec 07, 2018

* initial commit for update slice op

* Finished up update_slice fusion and added codegen support

* style fixes

* Added unit test for in-place update-slice strided

* change pattern name

a16c4961

Backend API change pre-work (#2064) · e0933553

Robert Kimball authored Dec 07, 2018

* change compile call to return Handle

* make CPU require compile() before call()

* fix unit tests to call compile() before call()

* fix failing ops

* update unit test

* revert some changes

* more fixups

* more diff cleanup

* a few more issues addressed

* more fixes

* update API

* more updates

* fix test_ops.py

* fix

* another attempt to fix

* fix unit test

* fix test error

e0933553

IntelGPU backend: Fix memory copy into zero tensors (#2192) · c95bdf64
Sergey Shalnov authored Dec 07, 2018

c95bdf64

Support for all_close_f w/ doubles (#2184) · 125f7242

gcwenger authored Dec 07, 2018

* Double support for all_close_f

* all_close_f uses fixed number of mantissa bits now. Simplified testing code.

* Initialize test data members in constructor to values which will cause test failure. Setup then sets them correctly.

* Reduce info printed out during all_close_f unit tests.

125f7242

Update TBB from 2019_U1 to 2019_U2. (#2154) · 91c4b553
Sang Ik Lee authored Dec 07, 2018

91c4b553
re-enable quantize_clamp_int32 test on CPU (#2090) · bba2b3bd
Adam Straw authored Dec 07, 2018
```
* re-enable quantize_clamp_int32 test on CPU

* MLKDNN typo
```
bba2b3bd

06 Dec, 2018 9 commits

QCBiasAdd and QCBiasSignedAdd for mkldnn (#2062) · 1f40160d

Nishant Patel authored Dec 06, 2018

* Quantize the bias to int32

* Bias scale fix

* mnist works

* Quantize Bias

* Introduce Quantize op in the graph to quantize bias & feedback

* Add QuantizedConvBiasAdd

* Comments and some refactoring

* Add test case with float bias and enable int32 as quantized type in ngraph

* Change shape of scale from Shape{} to Shape{1} in the backend

* Add QuantizedConvBiasSignedAdd

* Fix Layouts, clean up and a test case for QCBA

* Test case for QCBSA

* cleanup mkldnn_emitter.hpp

* fix build error

* Constant fold

1f40160d

IntelGPU backend: Allow more cases for clDNN gemm (#2187) · 4034a0c2
Sergey Shalnov authored Dec 06, 2018

4034a0c2

DEX Loop Kernel (updated) (#2156) · 8fc481a3

Nick Korovaiko authored Dec 06, 2018

* one output

passing tests

clean up

fix build breaks

* move generators into a separate file

8fc481a3

add a throw in lieu of a return stmt (#2183) · 56980738
Nick Korovaiko authored Dec 06, 2018

56980738
an env var to disable individual fusions (#2185) · 504e78f8
Nick Korovaiko authored Dec 06, 2018
```
* an env var to disable individual fusions

* fix env var name
```
504e78f8
Give Fusions Names (#2178) · a09d5f88
Nick Korovaiko authored Dec 06, 2018
```
* give fusions names

* fix build breaks

* fix perms
```
a09d5f88
Abort messages in Matcher to better understand cases where we fail to match (#2179) · 06916cbc
Nick Korovaiko authored Dec 06, 2018
```
*  abort messages in matcher.cpp

* style fixes
```
06916cbc

Graph comparison - isolated per op testing (#2144) · 1feb49f1

gcwenger authored Dec 06, 2018

* Isolated per op testing when comparing graphs for better determination of source of accuracy divergence.

* Improve clarity of comment

1feb49f1

[Py] Update README for PyPI (#2151) · 8a9cf8aa

Michał Karzyński authored Dec 06, 2018

* Update README for PyPI

* Update README for PyPI

* Remove redundant newlines

* Fix links

8a9cf8aa