Commits · c6d1af4f6b76722c8419ffcd188d7c1726d29d76 · submodule / ngraph

16 Apr, 2018 8 commits
- Remove collect_tensor_views and clean up CallFrames (#866) · c6d1af4f
  Robert Kimball authored Apr 16, 2018
```
* remove tensor_call from backends

* remove obsolete methods
```
  c6d1af4f
- Fix element type for create_tensor of cached fprop nodes in backprop_derivative (#862) · aadc9ce4
  Adam Procter authored Apr 16, 2018
  
  aadc9ce4
- rename get_input_op to get_argument in python wrapper (#872) · d60cd1d5
  Robert Kimball authored Apr 16, 2018
  
  d60cd1d5
- CMake: Allow build target arch to be overridden (#859) · 99e02417
  Jaikrishnan Menon authored Apr 16, 2018
```
* CMake: Allow build target arch to be overridden

* Add DNGRAPH_TARGET_ARCH option to install docs
```
  99e02417
- get_input_op -> get_argument (#852) · 16571afd
  Nick Korovaiko authored Apr 16, 2018
```
* get_input_op -> get_argument

* more replacing

* more replacing2
```
  16571afd
- working version (#858) · d7216dfc
  Fenglei authored Apr 16, 2018
  
  d7216dfc
- [Py] Modify default paths (#845) · c7438a66
  tsocha authored Apr 16, 2018
```
* Update default paths in setup.py

* Update defaults arguments in tox
```
  c7438a66
- Update python wrapper to new Backend API (#863) · b5a0d734
  Robert Kimball authored Apr 16, 2018
```
* remove obsolete

* change to use new Backend API

* rename parameter
```
  b5a0d734
13 Apr, 2018 7 commits

Remove legacy Backend API (#848) · ec501913

Robert Kimball authored Apr 13, 2018

* remove deprecated

* remove all legacy Backend API usage

remove deprecated files

* pull in changes from master

* fix GPU calls

* disable tests in convolution generator

* update per PR comments. Enable performance counter feature.

* update per PR comments

* fix build error

* fix conditionally compiled test :(

ec501913

BatchNorm documentation (#856) · 1e091f6f
Scott Cyphers authored Apr 13, 2018
```
* BatchNorm documentation

* Fix typo, install URL

* Switch to desired BatchNorm
```
1e091f6f
make sure matcher respects argument order for non-commutative ops (#847) · b32b5c23
Nick Korovaiko authored Apr 13, 2018

b32b5c23
added the reference OS marker to the image name defined in the contrib/docker/Makefile (#841) · 638f36ee
DawnStone authored Apr 13, 2018
```
fixed variable settings in contrib/docker/make-dimage.sh script
```
638f36ee

[Py] Add python wrapper for nGraph Reduce operation. (#827) · c80a1076

arogowie-intel authored Apr 13, 2018

* Add python wrapper for nGraph Reduce operation.

- Add UT.

* Refactoring.

- Add UT case with default reduction on all axes.

* Extend `reduce` operation signature to also accept `Function` object.

- Add UT case.

* Fix formatting errors.

c80a1076

Add backend call validation and unit tests (#857) · e7cf2662
Robert Kimball authored Apr 13, 2018

e7cf2662

Add GPURuntimeContext and GPUPrimitiveEmitter to the gpu transformer (#837) · 026bede0

Chris Sullivan authored Apr 13, 2018

* Begin prototype of cudnn_emitter.

* Added GPURuntimeContext to gpu_external_function for passing through to JIT functions.

* gpu_emitters now utilize gpu runtime context.

* Moved cublas and cudnn handles into GPURuntimeContext pointer and out of callframe EntryPoint.

* Added CUDNNEmitter, comparable to MKLDNNEmitter,
which allows for cudnn kernels to be defined via
lambda primitives that are emitted and
subsequently called during graph execution.
An example implementation is provided for op::Sum.

* Added GPURuntimeContext to gpu_external_function for passing through to JIT functions.

* gpu_emitters now utilize gpu runtime context.

* Moved cublas and cudnn handles into GPURuntimeContext pointer and out of callframe EntryPoint.

* GPURuntimeContext should be stored as unique_ptr in external function.

* Extract raw pointer from unique for cudnn_emitter.

* Removing unrelated code from PR.

* GPURuntimeContext needs to be a strict C interface in case
the native compiler and clang are utilizing different glibc ABIs.
Updated to reflect this.

* Added cudnn::primitive typedef for better readability.

* Moved allocation of CudaFunctionPool to external function
so that it is available during gpu emission.

* Fixed too-late initialization of cudart.

* CUDNNEmitter moved into superset class GPUPrimitiveEmitter.
The GPUPrimitiveEmitter handles the emission of all gpu primitives,
including cudnn, cuda, and cublas. CUBLASEmitter support not yet included.

* Added unordered_map for cacheing primitives in the gpu_emitter.

* Added dtor to GPUPrimitiveEmitter to cleanup compiled functions.

* Adding back a serialized model graph that was accidentally rem* Added a few additional helpers to use ngraph::row_major_strides.

* added whitespace per @fengleitian's comment

* Remove implicit type conversions from size_t to int.

* Add op::MaxPool, op::MaxPoolBackprop and op::Pad to GPU transformer (#817)

* Added pooling for 1 and 2dimensions. 1d uses a cuda kernel and 2d utilizes cudnn.
Padding is not yet supported.

* Normalized call signature on gpu emission for 1d max pool. Added a few comments.

* Max pool backprop impl. inprogress. Amend this commit.

* Max pool backprop implemented. Note that cuDNN
requests the output tensor for the maxpool operation but it is not required for computation.

* Formatting and invokation for maxpool changed.

* Fixed too-late initialization of cudart.

* Added padding kernel that is used with maxpool. Need to investigate remaining tests.

* Changed dimensionality check to correctly
determine if data is 1d or not.

* Added 3d MaxPooling (forward), verified by forcing 2d case to use Nd pooling routines.

* Added 3d MaxPooling (backward), verified by forcing 2d case to use Nd pooling routines.

* Moved cudnn prologues for maxpool into ngraph runtime and out of primitive so
that the only execution occuring on the JIT runtime is the evaluation of the op kernel.

* Refactored forward and backward pooling into single CUDNNEmitter::build_pooling interface
with a runtime switch to determine if the op is forward or backward propagation.

* Cache preconstructed cudnn kernel for maxpool if it has already been constructed.

* Forgot to add padding arrays back into cudnn kernel for MaxPool in the 2d case.

* Fixed namespace issues and use join(...,'_')

* Refactored 4d/Nd tensor descriptor builder into single function.

* Changed conditionals and comments. Now throws if MaxPool on more than 3 spatial dimensions is requested.

* Fixed forward declare for GPURuntimeContext (class -> struct).

* Clang complains about missing braces on brace-initializer. Fixed implicit conversions.

* Fixed implicit conversions (clang).

* Reverting changes on autodiff test for maxpool. @Krovatkin will update later.

026bede0

12 Apr, 2018 6 commits

CPU: Fix element count calculation (#850) · dfae57c1
Jaikrishnan Menon authored Apr 12, 2018

dfae57c1

gpu slice (#843) · 041dd524

Fenglei authored Apr 12, 2018

* add slice op, first version

* change size to output size

* fix bugs

* working version

* using exist function for join and strides

* clang format

* revert accidental change

041dd524

RecurrentGraphRewrite + tests (#833) · b14d5665

Nick Korovaiko authored Apr 12, 2018

* add a getter for root node

* recurrent graph rewrite

* fix perms, rename match_root -> get_match_root

* fix comp errors

* make match_root return the topmost match; fix tests

b14d5665

gpu convolution support nd(n<4) (#824) · b9b7845c

Fenglei authored Apr 12, 2018

* add convolution in progress

* enable 1 test

* convolution in progress

* use filter descripter

* filter discreptor bug fix

* tensor format

* add missed dimension calculator

* forward convolution 4d without dilation and padding working

* data dilation(deconvolution) and enable some test

* add backprop convolution data and filter

* backprop can compile

* pass unit test, but still have problem on padding

* 2d, symmtric padding, no data dilation works now

* clean up code

* extend gpu convolution to nd

* fix some bugs

* working version for upto 3d convolution, code format.

* remove nunecessary changes

* add restriction for data dilation and asymmetric padding

* clang format

* support upto 3D convolution for now

* change comments to not implemented

* change comments to not implemented

* add quary for additional GPU workspace for convolution

* clang format

* code format

* using row_major_strides

* using join

* fix bug for join

* refactor dimension calculation

b9b7845c

[Py] Enable ngraph-cpp ops in Python API (#820) · 9ffb5145
tsocha authored Apr 12, 2018
```
* Enable BatchNorm op

* Enable function call op

* Enable get output element op
```
9ffb5145
CPU: Eliminate slices (#849) · eec19220
Jaikrishnan Menon authored Apr 12, 2018

eec19220

10 Apr, 2018 6 commits

Use new backend API in graph partition (#844) · 6e1b6058
Yixing Lao authored Apr 10, 2018
```
* new backend API in graph partition

* update API
```
6e1b6058
remove old branch references (#840) · da6cf5cb
Matthew Brookhart authored Apr 10, 2018

da6cf5cb

Zero Dimension Tensor Elimination (#617) · 2d75f665

Nick Korovaiko authored Apr 10, 2018

*  zero dimension tensor elimination init

* more ops + refactor + tests

* revert pattern.cpp

* add internal zero-length test

* address Scott's feedback

* fix comp errors

* proper static init

* get rid of unique-ptr

* refactor hashmap into virtual get_default_values on op classes

* fix formatting

2d75f665

back out api change (#842) · 96604f12
Robert Kimball authored Apr 10, 2018
```
* back out api change
```
96604f12

Remove the no longer supported alternative installation method for (#831) · db788de8

Sang Ik Lee authored Apr 10, 2018

* Remove the no longer supported alternative installation method for
python binding.

* Put back CMakeLists.txt as it is used by travis ci dockerfile.

* Remove python/CMakeLists.txt and update Travis CI

db788de8

Optimize 4D Reshape (#836) · b29f7220

Jaikrishnan Menon authored Apr 10, 2018

* CPU: Optimize 4D "nGraph" Reshapes (shuffle+reshape)

* CPU: Add kernel sources

* CPU: Replace 2D with 3D reshape

* CPU: Fixes

* CPU: Simplify

b29f7220

09 Apr, 2018 10 commits

remove parameter check from Function::get_ops() (#834) · 877ac969
Robert Kimball authored Apr 09, 2018
```
* remove parameter check from Function::get_ops()

* create validate pass to hold parameter validation
```
877ac969

Becky/enable more python gpu tests (#830) · e5c3769d

raramer01 authored Apr 09, 2018

* unskipping passing gpu tests

* skipping failing gpu tests

* import pytest as needed

* fix style issues

* unskip passing test

* add additional skip reason, unable to compile

e5c3769d

Repackaging match_recurring_pattern into RecurrentMatcher (#832) · 10ef07e6

Nick Korovaiko authored Apr 09, 2018

* repacking recurrent matching as a standalone class

* RecurrentMatcher

* add a getter for root node

* address Scott's feedback

10ef07e6

Editing so far for review and feedback (#813) · a2ab7b50

L.S. Cook authored Apr 09, 2018

* WIP editing so far for review and feedback

* Add missing env var export for neon install new process

* Add modified venv setup for TF

* More edits for FW integration and landpage

* Revise from PR feedback

* More PR feedback and editing for clarity

* Minor rewording, clearer explanation

* Final pass edit

* more editing

a2ab7b50

add GPU backend support for contrib/docker make process (#814) · 24afb41e

DawnStone authored Apr 09, 2018

* adding support for GPU backend to contrib/docker

added gpu dockerfiles

renamed Dockerfile for centos74

fixed NGRAPH_GPU_ENABLE cmake flag name

* Check for GPU support on the host system and fall back to CPU if not present

* removed double option for PREBUILT_LLVM

* updated README.md with additional references for GPU support

* added clarifying comments

cleaned up duplicate settings

* removed deprecated targets from the contrib/docker/Makefile

* resolved absolute vs. conditional assignment for variables based on reference OS

* removed example using a custom DOCKERFILE from README file

24afb41e

New backend/transformer API (#739) · 777600c6

Robert Kimball authored Apr 09, 2018

* force backend compile() to make a copy of the graph

fix copy_with_new_args on ops that have function pointers internal

update unit test for new backend API

add unit test for multiple simulataneous backends

* move get_subdevices virtual method to Manager class

* update GPU to latest

* update call methods

* add remove_compiled_function()

777600c6

Merge pull request #818 from NervanaSystems/tsocha/tox-update · ca4a83ea
Michał Karzyński authored Apr 09, 2018
```
[Py] Change Python version for tox
```
ca4a83ea
[Py] Tox hotfix, Update python version for tox · 545dc0b6
Tomasz Socha authored Apr 09, 2018

545dc0b6
Use less complex pass base where possible (#829) · 76047c77
Robert Kimball authored Apr 09, 2018

76047c77

Fuse zero-padded convolution backprop filters (#828) · 81c0ef79

Jaikrishnan Menon authored Apr 09, 2018

* CPU: Fuse zero-padded convolution backprop filters

* CPU: Add a testcase for zero-padded convolution backprop filters fusion

81c0ef79

08 Apr, 2018 1 commit
- start kahan summation with 0 instead of 1e-8 (#835) · 1adb84a1
  Matthew Brookhart authored Apr 08, 2018
  
  1adb84a1
06 Apr, 2018 2 commits

Support for Recurring Patterns (#782) · a8cd0e94

Nick Korovaiko authored Apr 06, 2018

* initial support for recurring matching

* fix a bug where patterns weren't populated w/ matched nodes; add recurrent tests

* add a missing newline

* address feedback

* fix function comment

a8cd0e94

[Py] Python nGraph operations wrappers. (#821) · fa6c2a60

arogowie-intel authored Apr 06, 2018

* Add/update Python wrappers for nGraph operations.

- NotEqual, OneHot, Power, Sqrt, Relu, Sign, Sin, Sinh, Tan, Subtract, Select, Tanh, Sum, Reduce,
Softmax, ReplaceSlice, Reverse
- Add UT for Relu, Sign, Sin, Sinh, Sqrt, Tan, Tanh,

* Add UT for cases when Cos and Sin are giving incorrect results.

* Alphabetically sorted imports.

* Small refactoring.

- Update docstrings
- Remove unnecesary auxiliary local variable.

fa6c2a60