Commits · 2c345798ee22b8036c2b47a96c86a0c5dabe8ee6 · submodule / ngraph

13 Jul, 2018 4 commits

Backend/API: Implementation of ADD and MUL operations in the compile() (#1200) · 2c345798

shssf authored Jul 13, 2018

* Backend/API: Implementation of ADD and MUL operations in the compile method for IntelGPU

* Branch merge conflicts resolved

* Parameters number check moved to function. RESULT operation handling added.

2c345798

reshape inplace without copy data if possible. (#1206) · 268853d0
Louis Feng authored Jul 13, 2018

268853d0

Fix incorrect hash strings for softmax and 1d maxpool. (#1195) · 4659d60d

Chris Sullivan authored Jul 13, 2018

* Bug fix in softmax cache parameters.

* Additional bug fix for maxpool1d cache parameters.

* Formatting.

* Use nthreads in primitive hash.

4659d60d

gpu reshape optimization (#1174) · b5e69eaa

Fenglei authored Jul 13, 2018

* add gpu_timer to external function

* compiled version

* working version

* using block_begin and block_end

* add the missing '
;'

* move slice to cuda emiter

* change size_t to uint32_t in kernel

* working version

* change block size from 1 to 64

* fix bugs

* nthreads need to be size_t in broadcast op

* add rank to kernel name hash

* change reshape to cuda_emitter

* fix bugs

* bug, remove rank from kernel

* clang format

* update slice in convolution

* resolve index conflict

* change align to align_to_blocksize, add overflow check

* add gird size check and fix pool merge bug

* code style, change names

* fix merge conflict

* change kernel_runner to kernel_launch

b5e69eaa

12 Jul, 2018 4 commits

Added reshape and broadcast to CSE (#1221) · cf568ef9

Louis Feng authored Jul 12, 2018

* reshape inplace without copy data if possible.

* added reshape and broadcast to CSE.

* Fixed debug messages.

cf568ef9

remove custom install path (#1164) · 41942f8b

Robert Kimball authored Jul 12, 2018

* remove custom install path

* fix travis build

* Add NGRAPH_INSTALL_PREFIX as an alias for CMAKE_INSTALL_PREFIX to make our unit tests pass.

* change install path setting

41942f8b

Bob/backend list (#1220) · 8e1954d0

Robert Kimball authored Jul 12, 2018

* open only the unversioned library but check that it is built against the correct version of ngraph

* review comments

8e1954d0

gpu safe call - add CUDA_RT_SAFE_CALL (#1222) · 97b19515

Fenglei authored Jul 12, 2018

* add CUDA_SAFE_CALL to all cuda calls

* add CUDA_RT_SAFE_CALL

* add null ptr check before free

* init pointer to nullptr

* consolidate conditions

97b19515

11 Jul, 2018 2 commits
- DEX Part 3 (#1184) · d37fa712
  Jaikrishnan Menon authored Jul 11, 2018
```
* CPU Direct Execution: Implement ConvertLayout and refactor

* CPU Direct Execution: Implement Convolution
```
  d37fa712
- Disabeled RNN fusion pass in IA transformer (#1217) · 4cd2c602
  Pruthvi authored Jul 11, 2018
  
  4cd2c602
10 Jul, 2018 1 commit
- [Py] Enable retrieve data from constant node. (#1214) · 785c1ce7
  Adam Rogowiec authored Jul 10, 2018
```
* Enable retrieving data from Constant in python.

* Test on wide value range.
```
  785c1ce7
09 Jul, 2018 4 commits
- Liveness optimizations (#1210) · 0c721561
  Robert Kimball authored Jul 09, 2018
```
* Faster liveness.

Memory manager optimized for non-sharing of tensors.
Add pass manager profiler.

* Move pass profiler to a separate PR

* Move Memory Layout optimizations to a separate PR

* use find instead of count
```
  0c721561
- Cache functions so the backend does not need to recompile (#1209) · ffe3a631
  Robert Kimball authored Jul 09, 2018
```
* Cache some generated functions in backwards tests to speed performance

* more caching
```
  ffe3a631
- [ONNX] Apply code review comments (#1213) · 9fecc560
  Michał Karzyński authored Jul 09, 2018
  
  9fecc560
- Support for multiple precompiled header files (#1208) · 198431b6
  Robert Kimball authored Jul 09, 2018
```
Better CI performance
```
  198431b6
08 Jul, 2018 2 commits
- Memory Layout pass optimizations (#1212) · 0165b27e
  Robert Kimball authored Jul 08, 2018
```
* Memory Layout pass optimizations

* rename SIMPLE memory allocator
```
  0165b27e
- add pass profiler (#1211) · e3d95453
  Robert Kimball authored Jul 08, 2018
  
  e3d95453
07 Jul, 2018 4 commits
- Backend/API: cmake module to find Intel clDNN (#1155) · 26645912
  shssf authored Jul 07, 2018
  
  26645912
- New backend construction/destruction API (#1171) · ad4dd5b0
  Robert Kimball authored Jul 07, 2018
```
* complete the new backend construction/destruction API
* close each dlopen
* don't close libraries for now as it causes python to segfault
```
  ad4dd5b0
- adding comment (#1193) · 21d22459
  Nick Korovaiko authored Jul 07, 2018
  
  21d22459
- Added predicate for alpha, in BoundedRelu (#1205) · f2b73a76
  Pruthvi authored Jul 07, 2018
  
  f2b73a76
06 Jul, 2018 4 commits

Jbobba/conv sum cleanup (#1167) · 0768a969

Jayaram Bobba authored Jul 06, 2018

* inplace compute

* fix warnings

* Initial support for convolution sum fusion

* Added in-place support for conv sum fusion and test cases

* reverting spurious changes

* Bug fix to account for inplace input in conv sum fusion

* fix compilation error

* Addressed PR feedback

* Handle corner cases for conv sum fusion. Skip computation reuse while using an inplace kernel

* Check node argument for in-place relu assignment

* Addressed PR comments

* Addressed PR feedback

0768a969

Use mkldnn reorder only for transpose/dimshuffles. (#1188) · 5be99c0a

Nishant Patel authored Jul 06, 2018

* Usage of mkldnn reshape updated

* update reshape condition for mkldnn

* Add a test case and order in which conditions are checked

5be99c0a

Collect matched nodes (#1166) · e07637c0
Nick Korovaiko authored Jul 06, 2018
```
* collect matched nodes

* clear m_matched_list

* tests

* address feedback
```
e07637c0
[Py] Expose logical And, Or operations. (#1198) · 137f002b
Adam Rogowiec authored Jul 06, 2018

137f002b

05 Jul, 2018 4 commits
- Cyphers/contrib (#1202) · 7cd38322
  Scott Cyphers authored Jul 05, 2018
```
* Fix short markup

* Minor adjustments, license requirements.
```
  7cd38322
- make logical ops input type aware (#1203) · a7c5eb01
  Nick Korovaiko authored Jul 05, 2018
  
  a7c5eb01
- fix bugs in align_to_block_size function (#1191) · f1ebcd3e
  Fenglei authored Jul 05, 2018
```
* extra *block_size

* change grid_size to threads
```
  f1ebcd3e
- fix namespace error in macro (#1194) · af956916
  Yixing Lao authored Jul 05, 2018
  
  af956916
04 Jul, 2018 1 commit
- [ONNX] add 'Add' operator (#1192) · 15d743f1
  Artur Wojcik authored Jul 04, 2018
  
  15d743f1
03 Jul, 2018 6 commits
- Update documentation link to new ngraph-tf (#1185) · 08cabb12
  Adam Procter authored Jul 03, 2018
  
  08cabb12
- Batch dot operation for rank 3 multiply with rank 2 tensors (#1180) · 238ce788
  Louis Feng authored Jul 03, 2018
```
* hacking to support dot of 3 by 2 inputs with gemm_batch.

* clean up.
```
  238ce788
- nbench cleanup (#1183) · 9d09c7e5
  Robert Kimball authored Jul 03, 2018
```
* nbench cleanup

* update style
```
  9d09c7e5
- TF-flavoured group convolution (#1182) · b6bc86bf
  Nick Korovaiko authored Jul 03, 2018
```
* tf group convolution

* change perms
```
  b6bc86bf
- [Py] API helper function broadcast_to (#1170) · 2fc0bbb4
  tsocha authored Jul 03, 2018
  
  2fc0bbb4
- onnx [2]: add core wrappers (#1169) · c086eb2d
  Artur Wojcik authored Jul 03, 2018
```
* onnx: add core wrappers
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnx: add '\n' at end of files
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnx: fix compilation with clang
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>

* onnx: fix code style
Signed-off-by: Artur Wojcik <artur.wojcik@intel.com>
```
  c086eb2d
02 Jul, 2018 4 commits

move sigmoid to core fusion (#1132) · d05b5e39

Sandeep authored Jul 02, 2018

* declare sigmoid for core fusion

* add simple test for sigmoid

* info fusion status

* cp op as main op

* builds as expected

* move sigmoid fusion code

* add reference kernel

* sigmoid bprop reference kernel and clang-format

* add delta to bprop

* fprop called

* compiles bprop

* move tests

* serializer support

* address comments in code

* add doc

* naming similar to core ops

* fix failing test

* fix failing test

* address clang issue

* more changes

* change test macro

d05b5e39

include additional dependency for clang format (#1181) · 18e58ea9
L.S. Cook authored Jul 02, 2018

18e58ea9

MKLDNN BoundedRelu implementation for Relu6 (#1179) · eaa6091c

Pruthvi authored Jul 02, 2018

* 1. Added MKLDNNN BoundedRelu op support for Relu6
2. CpuLayout && CPU assignment pass for BoundedRelu Op
3. Unit test inter v/s CPU for BoundedReluOp
4. MKLDNN and default emitter code for BoundedReluOp

* Removed Debug prints

* 1. Added support for boundedrelu to work on any constant literal
2. unit test case for rank2, rank3, rank4 for bounded relu without serialized graph

* Removed is_six() method

eaa6091c

Conv+bias shape check for better error detection (#1176) · e42e5815

Louis Feng authored Jul 02, 2018

* Reshape bias to 1D for conv + bias bprop fusion

* Reshape goe2 back to 2D before replacing

* added shape checks to validate conv+bias op.

* removed conv+bias backprop merge for separate PR review.

* fixed conv_bias_bprop test.

* minor changes to error messages.

e42e5815