Commits · 83e7dba5b8b6120fdd14d4abdab7fe0b951e603e · submodule / ngraph

09 Jul, 2018 1 commit
- Cache functions so the backend does not need to recompile (#1209) · ffe3a631
  Robert Kimball authored 6 years ago
```
* Cache some generated functions in backwards tests to speed performance

* more caching
```
  ffe3a631
02 Jul, 2018 1 commit

move sigmoid to core fusion (#1132) · d05b5e39

Sandeep authored 6 years ago

* declare sigmoid for core fusion

* add simple test for sigmoid

* info fusion status

* cp op as main op

* builds as expected

* move sigmoid fusion code

* add reference kernel

* sigmoid bprop reference kernel and clang-format

* add delta to bprop

* fprop called

* compiles bprop

* move tests

* serializer support

* address comments in code

* add doc

* naming similar to core ops

* fix failing test

* fix failing test

* address clang issue

* more changes

* change test macro

d05b5e39

02 Jun, 2018 1 commit
- Floating point comparison with ULP, adding close_f and all_close_f (#1068) · b8e28555
  Yixing Lao authored 6 years ago
  
  b8e28555
18 May, 2018 1 commit

Enable reverse_sequence for Interpreter (#977) · cd59bfe4

Nick Korovaiko authored 6 years ago

* use reference kernel for reverse_sequence for int

* move tests

* resolve CI errors

* TEST to NGRAPH_TEST

cd59bfe4

10 May, 2018 2 commits

Move test_control to test lib (#989) · 6d01a3bf
Yixing Lao authored 6 years ago
```
* test_control in util
```
6d01a3bf

New manifest driven method for disabling backend unit tests (#983) · 44b75607

Robert Kimball authored 6 years ago

* Add mechanism for disabling specific backend unit tests from a manifest file.
Populate the test manifest files for CPU, GPU and INTERPRETER.

* update docs for new manifest controlled transformer unit tests

44b75607

09 May, 2018 1 commit

CUDNN and CUDA kernels for AvgPool (forward/backward) (#951) · b1b3d4d6

Chris Sullivan authored 6 years ago

* Added op::AvgPool cudnn impl. which works for 2-3 spatial dimesions and no/symmetric padding. Enabled tests.

* Added cuda-c implementation of average pool which handles 1-3 spatial
dimensions as well as asymmetric padding. This commit also introduces
several helper functions for performing fast integer division and
fast constant memory access.

* Formatting. Removed bool that was used for testing to force the cuda impl. over cudnn.

* Added CUDNN AvgPoolBackprop implementation.

* Removed inline enum in preference of a helper struct. Removed instances of multiple declarations on a single line. Updated comments.

* Removed _prefix to helper functions in anonymous namespace.

b1b3d4d6

01 May, 2018 1 commit
- Add autodiff for the arc trig ops (#935) · 0c3bc7d0
  Matthew Brookhart authored 6 years ago
  
  0c3bc7d0
24 Apr, 2018 1 commit
- Update to enable pass backend unit tests (#904) · 1eb9f9bf
  Robert Kimball authored 6 years ago
```
* get all ops working

* enable autodiff tests for IE backend
```
  1eb9f9bf
21 Apr, 2018 1 commit

Add Inference Engine (IE) backend (#883) · 3d590dea

Adam Straw authored 6 years ago

* ie backend and manager with passing unit tests except for select/function

* fix function_call and select

* simplify implemenation by removing support for convert and select

* remove manager

3d590dea

16 Apr, 2018 1 commit
- Fix element type for create_tensor of cached fprop nodes in backprop_derivative (#862) · aadc9ce4
  Adam Procter authored 6 years ago
  
  aadc9ce4
13 Apr, 2018 2 commits

Remove legacy Backend API (#848) · ec501913

Robert Kimball authored 6 years ago

* remove deprecated

* remove all legacy Backend API usage

remove deprecated files

* pull in changes from master

* fix GPU calls

* disable tests in convolution generator

* update per PR comments. Enable performance counter feature.

* update per PR comments

* fix build error

* fix conditionally compiled test :(

ec501913

Add GPURuntimeContext and GPUPrimitiveEmitter to the gpu transformer (#837) · 026bede0

Chris Sullivan authored 6 years ago

* Begin prototype of cudnn_emitter.

* Added GPURuntimeContext to gpu_external_function for passing through to JIT functions.

* gpu_emitters now utilize gpu runtime context.

* Moved cublas and cudnn handles into GPURuntimeContext pointer and out of callframe EntryPoint.

* Added CUDNNEmitter, comparable to MKLDNNEmitter,
which allows for cudnn kernels to be defined via
lambda primitives that are emitted and
subsequently called during graph execution.
An example implementation is provided for op::Sum.

* Added GPURuntimeContext to gpu_external_function for passing through to JIT functions.

* gpu_emitters now utilize gpu runtime context.

* Moved cublas and cudnn handles into GPURuntimeContext pointer and out of callframe EntryPoint.

* GPURuntimeContext should be stored as unique_ptr in external function.

* Extract raw pointer from unique for cudnn_emitter.

* Removing unrelated code from PR.

* GPURuntimeContext needs to be a strict C interface in case
the native compiler and clang are utilizing different glibc ABIs.
Updated to reflect this.

* Added cudnn::primitive typedef for better readability.

* Moved allocation of CudaFunctionPool to external function
so that it is available during gpu emission.

* Fixed too-late initialization of cudart.

* CUDNNEmitter moved into superset class GPUPrimitiveEmitter.
The GPUPrimitiveEmitter handles the emission of all gpu primitives,
including cudnn, cuda, and cublas. CUBLASEmitter support not yet included.

* Added unordered_map for cacheing primitives in the gpu_emitter.

* Added dtor to GPUPrimitiveEmitter to cleanup compiled functions.

* Adding back a serialized model graph that was accidentally rem* Added a few additional helpers to use ngraph::row_major_strides.

* added whitespace per @fengleitian's comment

* Remove implicit type conversions from size_t to int.

* Add op::MaxPool, op::MaxPoolBackprop and op::Pad to GPU transformer (#817)

* Added pooling for 1 and 2dimensions. 1d uses a cuda kernel and 2d utilizes cudnn.
Padding is not yet supported.

* Normalized call signature on gpu emission for 1d max pool. Added a few comments.

* Max pool backprop impl. inprogress. Amend this commit.

* Max pool backprop implemented. Note that cuDNN
requests the output tensor for the maxpool operation but it is not required for computation.

* Formatting and invokation for maxpool changed.

* Fixed too-late initialization of cudart.

* Added padding kernel that is used with maxpool. Need to investigate remaining tests.

* Changed dimensionality check to correctly
determine if data is 1d or not.

* Added 3d MaxPooling (forward), verified by forcing 2d case to use Nd pooling routines.

* Added 3d MaxPooling (backward), verified by forcing 2d case to use Nd pooling routines.

* Moved cudnn prologues for maxpool into ngraph runtime and out of primitive so
that the only execution occuring on the JIT runtime is the evaluation of the op kernel.

* Refactored forward and backward pooling into single CUDNNEmitter::build_pooling interface
with a runtime switch to determine if the op is forward or backward propagation.

* Cache preconstructed cudnn kernel for maxpool if it has already been constructed.

* Forgot to add padding arrays back into cudnn kernel for MaxPool in the 2d case.

* Fixed namespace issues and use join(...,'_')

* Refactored 4d/Nd tensor descriptor builder into single function.

* Changed conditionals and comments. Now throws if MaxPool on more than 3 spatial dimensions is requested.

* Fixed forward declare for GPURuntimeContext (class -> struct).

* Clang complains about missing braces on brace-initializer. Fixed implicit conversions.

* Fixed implicit conversions (clang).

* Reverting changes on autodiff test for maxpool. @Krovatkin will update later.

026bede0

04 Apr, 2018 1 commit

Support multi-output ops in Adjoints (#796) · 5f0e8dc3

Nick Korovaiko authored 6 years ago

* refactor Adjoints to support multi-output ops

* passing tests

* switch to generate_adjoints(deltas) and backprop_node

* remove debugging code

* fix error msg

* fix typo adjoitns

* fix comp errors in mnist_mlp

5f0e8dc3

26 Mar, 2018 1 commit

Registers NNP_TESTER backend for NNP Testing (#729) · 2e8c6286

Yixing Lao authored 6 years ago

* registers nnp tester

* check output size

* rename to nnp tester

* revert changes in graph_util

2e8c6286

21 Mar, 2018 2 commits
- CallFrame order (#702) · 12876342
  Yixing Lao authored 6 years ago
```
Adjust CallFrame argument order to match Function
```
  12876342
- Directory rename (#701) · 6b0b64b4
  Robert Kimball authored 6 years ago
```
* rename directories to be consistent
* rename reference namespace to match directory
```
  6b0b64b4
20 Mar, 2018 1 commit

rename to nnp (#688) · bb831262

Sandeep authored 6 years ago

* topolotical-sort based node clustering

* cmake builds

* Argon manager renamed to NNP along with placement

* nnp dir cmake changes

* tests pass

* more renames

* somemore renames

* reslove redefination

* revert to ARGON_API

* more PR comments and remove nnp-fusion tests as redundant

* update path

* fix format

bb831262

14 Mar, 2018 1 commit
- gpu add onehot op (#638) · a86a9050
  Fenglei authored 6 years ago
```
* add onehot op

* refactor broadcast and onehot op
```
  a86a9050
09 Mar, 2018 1 commit
- fix bug for 2d2d2 dot, enable some bprop dot tests · 9fd64b6f
  fenglei.tian authored 6 years ago
  
  9fd64b6f
08 Mar, 2018 3 commits

enable supported backward tests · dd5c77e0
fenglei.tian authored 6 years ago

dd5c77e0
add sign op, fix constant bug · dd5a6769
fenglei.tian authored 6 years ago

dd5a6769

GPU op::Result implementation (#611) · 905cafd2

Chris Sullivan authored 6 years ago

* Added GPU emitter for op::Result.
For now it simply copies the output tensor.

All but 3 tests now pass. The remaining
failing tests are:
* GPU.dot_0_0
* GPU.dot_matrix_2x0_0x2
* GPU.dot_2x0_0

* Removed call to handle memory aliasing in gpu_external_function.

* fix gpu emitter bug that will return in the middle of function

* Merge pull request #609 from NervanaSystems/tfl/fix_return_bug

fix gpu emitter bug that will return in the middle of function

* GPU backend skips added for recent softmax test and updated aliased output test that uses op::Constant.

905cafd2

02 Mar, 2018 1 commit
- add softmax op (#542) · 0c43f175
  adstraw authored 6 years ago
```
add softmax op and documentation
```
  0c43f175
27 Feb, 2018 1 commit
- Replace using aliases with actual classes (#428) · ec6b3eae
  Scott Cyphers authored 6 years ago
```
* Replace using aliases with actual classes
```
  ec6b3eae
26 Feb, 2018 1 commit

Initial support for hybrid transformer (#526) · 7f08b97b

Yixing Lao authored 6 years ago

* initial support for hybrid transformer

* add broadcast_vector_rowwise_reversed for hybrid test

* headerc

* get function placement fix

* conv ref test generator graph node in labmda fuction

* rename map_parameter_to_source_node

* type change map_parameter_to_source_node

* use interpreter for numerical derivative

* better comments

7f08b97b

23 Feb, 2018 1 commit
- Fixes NGMX-338: Adds option to AvgPool padding. · d2d0196b
  Christian Convey authored 7 years ago
  
  d2d0196b
21 Feb, 2018 3 commits
- update test and cmake · 790dcd6c
  fenglei.tian authored 7 years ago
  
  790dcd6c
- add skip gpu test micro for new tests · e40f9c50
  fenglei.tian authored 7 years ago
  
  e40f9c50
- moving Relu op form Argon backend with CoreFusion (#489) · 4e29c153
  Sandeep authored 7 years ago
```
* relu for interpreter

* relu in serializer

* core fusion

* relu backprop

* relu backprop and test interpreter

* core fusion for CPU

* COREFusion -> CoreFusion

* relu MKL dnn
```
  4e29c153
20 Feb, 2018 1 commit
- Addressed PR review comments · 67fb65b8
  pthoreho authored 7 years ago
  
  67fb65b8
16 Feb, 2018 2 commits
- style fix · c6672b3d
  pthoreho authored 7 years ago
  
  c6672b3d
- - Added test for max pooling to verify mkldnn maxpool implementation · 03f6c0ab
  pthoreho authored 7 years ago
```
- added workaround to attach the maxpool workspace for bprop delta propogation
```
  03f6c0ab
14 Feb, 2018 1 commit
- skip tests for GPU · 2a89f8b4
  fenglei.tian authored 7 years ago
  
  2a89f8b4
12 Feb, 2018 1 commit
- fix Shape declarations (#488) · 00fb503f
  Robert Kimball authored 7 years ago
```
* fix Shape declarations
```
  00fb503f
08 Feb, 2018 1 commit
- Add LICENSE and switch to Intel Copyright (#466) · d9a9d2d7
  Jennifer Myers authored 7 years ago
  
  d9a9d2d7
07 Feb, 2018 1 commit

CPU backprop tests (#456) · 7a7e27d7

Adam Procter authored 7 years ago

* Enable CPU backprop tests

* Fix to dot codegen for cases where n_reduction_axes != 1

7a7e27d7

06 Feb, 2018 1 commit

modify existing autodiff unit tests to test fprop cache (#354) · dfb88350

adstraw authored 7 years ago

* modify existing autodiff unit tests to test fprop cache

* cleanup

* fix compile error introduced with bad merge

* remove invalid negative/negative backwards power test

dfb88350

31 Jan, 2018 1 commit

Back Propagation for Average Pooling (#407) · b408a08e

Nick Korovaiko authored 7 years ago

* bprop for avg pool

remove debug statements + formatting

* fix CPU test failures

* numeric tests

* use make_shared; unprotect c-tor

b408a08e

23 Jan, 2018 1 commit

convolution backprop (#404) · 72a2ce72

adstraw authored 7 years ago

* fix convlution reference script

* convolution backprop

* cleanup

* fix build warnings

* Missing include

* fix build warning part 2

* move numeric_compare to its own header
code review feedback

* fix build warnings 3

* fix build warnings 4

* clang-format

* cast to avoid implicit cast warning

72a2ce72