Commits · e7abc0f3520f083b0267529446df9ff4f584954e · submodule / ngraph

14 Mar, 2018 1 commit

Yet another serialization option (#619) · 28602f31

Robert Kimball authored 6 years ago

* Add cpio file read/write class and unit tests

add reserializer

Add unit test for serialize constants to cpio file. Fix bug in serializer if function has no parameters.

28602f31

05 Mar, 2018 1 commit

Include cleanup (#583) · cec89708

Robert Kimball authored 6 years ago

* cleanup

* cleanup

* fix all headers to be standalone as far as includes go

* include cleanup

* cleanup includes

* cleanup

* include tester

* wip

* cleanup

* cleanup

* cleanup

cec89708

28 Feb, 2018 1 commit
- Move json.hpp out of ngraph source directory (#549) · aeee2039
  Robert Kimball authored 6 years ago
```
* make json lib an external project
* add env var to turn warnings to errors
```
  aeee2039
27 Feb, 2018 1 commit
- nvrtc name in cmake, enable mkldnn only for CPU · c082fe1e
  fenglei.tian authored 6 years ago
  
  c082fe1e
26 Feb, 2018 1 commit

Initial support for hybrid transformer (#526) · 7f08b97b

Yixing Lao authored 6 years ago

* initial support for hybrid transformer

* add broadcast_vector_rowwise_reversed for hybrid test

* headerc

* get function placement fix

* conv ref test generator graph node in labmda fuction

* rename map_parameter_to_source_node

* type change map_parameter_to_source_node

* use interpreter for numerical derivative

* better comments

7f08b97b

21 Feb, 2018 1 commit

moving Relu op form Argon backend with CoreFusion (#489) · 4e29c153

Sandeep authored 6 years ago

* relu for interpreter

* relu in serializer

* core fusion

* relu backprop

* relu backprop and test interpreter

* core fusion for CPU

* COREFusion -> CoreFusion

* relu MKL dnn

4e29c153

20 Feb, 2018 1 commit
- refactor benchmark util · 348dd27f
  Ashok Emani authored 6 years ago
  
  348dd27f
14 Feb, 2018 3 commits
- skip tests for GPU · 2a89f8b4
  fenglei.tian authored 6 years ago
  
  2a89f8b4
- Allow caching of external dependencies (everything but TBB, which I can't figure out yet) (#473) · 2fe7f0f3
  Adam Procter authored 6 years ago
  
  2fe7f0f3
- add AllReduce op and MPI support (#425) · b9c5b9d3
  Sevin F. Varoglu authored 6 years ago
```
- enable distributed ngraph (MPI)
- add AllReduce op to ngraph core, interpreter and CPU backend
- add AllReduce unit test
```
  b9c5b9d3
13 Feb, 2018 3 commits
- disable gpu tests for now, since most will be fail · a6d78dd7
  fenglei.tian authored 6 years ago
  
  a6d78dd7
- cleanup code · 2db7022e
  fenglei.tian authored 6 years ago
  
  2db7022e
- cleanup code · f7d97aa1
  fenglei.tian authored 6 years ago
  
  f7d97aa1
09 Feb, 2018 5 commits
- GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum (#440) · da50410b
  Tristan Webb authored 6 years ago
```
* GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum

(A + B) * C test now with cuBLAS
Additional gemm and gemv calls
cmake updates for cuDNN calls
memcpy wrappers in gpu_util

Additional passing tests:
aliased outputs, parameter, constant tensor memcopy
```
  da50410b
- Remove execute permissions from non-executable files (#474) · e054366e
  Adam Procter authored 6 years ago
  
  e054366e
- Fix pep8 warning in copyright · c7a3a76b
  Jennifer Myers authored 6 years ago
  
  c7a3a76b
- add rentime cuda kernel compile · e63322d9
  fenglei.tian authored 6 years ago
  
  e63322d9
- ensures the argon repositoy is present (#471) · 3f3c580e
  Sandeep authored 6 years ago
  
  3f3c580e
08 Feb, 2018 2 commits
- Add LICENSE and switch to Intel Copyright (#466) · d9a9d2d7
  Jennifer Myers authored 6 years ago
  
  d9a9d2d7
- enable additional argon backend tests (#465) · 2712d6f6
  Sandeep authored 6 years ago
  
  2712d6f6
07 Feb, 2018 1 commit

CPU backprop tests (#456) · 7a7e27d7

Adam Procter authored 6 years ago

* Enable CPU backprop tests

* Fix to dot codegen for cases where n_reduction_axes != 1

7a7e27d7

06 Feb, 2018 1 commit
- argon fusion test for Relu (#447) · c3364269
  Sandeep authored 6 years ago
```
* test relu fusion for argon backend and enable permutation over max op
```
  c3364269
05 Feb, 2018 1 commit

inline_function_call (#439) · bef56921

Nick Korovaiko authored 6 years ago

inline

Inliner pass + tests

debugging

fix inliner failures due to the fact a random function is picked as an outermost one

copyright headers

bef56921

02 Feb, 2018 1 commit

GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum · 1f6284ff

Tristan Webb authored 6 years ago

GPU ew add and mult cuBLAS calls

GPU (A + B) * C with cuBLAS

Additional gemm and gemv calls

cmake updates for cuDNN calls

kernels WIP

params for dot gemm

more kernel WIP

memcpy wrappers

aliased outputs, parameter, constant tensor memcopy

comment cleanup

remove cruft

gpu faster gemm

MNIST WIP

Cleanup

1f6284ff

01 Feb, 2018 1 commit

Reshape Transformations + Simplification pass (#427) · f5930d37

Nick Korovaiko authored 6 years ago

* simplification pass

* serializer change to test models

* some small test fixes

* addressing Scott's feedback

* missed one nn

* formatting fixes

* simplification -> reshape_elimination

f5930d37

30 Jan, 2018 1 commit

fuse dot(a,b) + c (#418) · ea29c6e3

Nick Korovaiko authored 6 years ago

cblas_gemm working on mlp

rebase & small fixes

enable debug output

support replacing function's outputs

productizing CPUFusion

addressing Bob and Jayaram's feedback

removing json used for simplification tests

adding comments

fixing formatting errors and removing dead code

TODO msg

removing serializer changes

ea29c6e3

24 Jan, 2018 1 commit

Drwebb/gpu backend dot op (#413) · 94d80ffa

Tristan Webb authored 7 years ago

* Drwebb/gpu backend dot op (#387)

* GPU Dot prod emitter switch statement

* cuBLAS dot kernel call

* Flush out arg substitution into gpu dot kernel call

* Drwebb/gpu backend dot op (#392)

* Take in CodeWriter into gpu op emitters

* Introduce GPU function gen based on pass functions

* Additional gpu emitter stubs

* link cublas in to unit test and ngraph

* Use static code gen methods for GPU, add new GPU op stubs

* use pass manager to declare functions / cublas Updates

* Prune down gpu_external_function wip

* Switch back to GPU tensor views in GPU backend

* Pass in cublas handle to GPU external function

* cuMalloc memory in gpu tensor view

* Use cuda runtime malloc and free for tensor view managment c

* change GPU tensor view init, and use GPU tensor view for GPU call frame

* include headers as system dirs

* GPU tensor printing utility function

* cublasSetPointer to device mode / Fix copyright notification lowercasing

* Passing GPU dot product test using cuBLAS

Clean up

* Changes from review

94d80ffa

19 Jan, 2018 1 commit

Add flag to enable memory sanitizer (#393) · 0f836183

Robert Kimball authored 7 years ago

* cleanup in-memory header files

* add switch to enable memory sanitizer (works like valgrind)

* removed header file cleanup as it was causing a segfault on program termination

0f836183

11 Jan, 2018 2 commits
- add interpreter nan check option (#368) · 74850150
  Robert Kimball authored 7 years ago
```
* add interpreter nan check option

* add unit test
```
  74850150
- Better error message from runtime::Manager. · a2d97200
  Christian Convey authored 7 years ago
  
  a2d97200
09 Jan, 2018 1 commit

Remove an optimization for caching a list of ordered ops (#360) · 7e89f1bb

Nick Korovaiko authored 7 years ago

* remove caching of ordered_ops

* graph_util logging msgs

* small cleanup

* remove files for the TopologicalSort pass

* remove NGRAPH_DEBUG from graph_util.hpp

7e89f1bb

05 Jan, 2018 1 commit

Drwebb/gpu runtime boilerplate (#314) · feab44b5

Tristan Webb authored 7 years ago

* Simple boilerplate for GPU runtime files

  - GPUBackend
  - GPU ExternalFunction
  - GPUManager
  - GPUCallFrame

* Test for construction all GPU runtime classes

* Comment out calls, constructors haven't been defined

* Clang CUDA source example to later test compiling

Clang cuda example from:
https://gist.github.com/anonymous/855e277884eb6b388cd2f00d956c2fd4

* Initial nvptx compiler copied from CPU compiler sources

* Define FunctionMap and Instruction for gpu external function

* Rename Compiler -> NVPTXCompiler for gpu compile. Add call to compile for test

* Rename StaticCompiler -> NVPTXStaticCompiler for GPU code gen

* CAdd nvptx_compiler and nvptx_execution_engine to gpu sources

* Compiling source unit test using hardcoded PTX

* (a+b)*c test for GPU

* WIP Fix compile

* rmed accidentally included file

* Fix compile, and LLVM link errosr from nvptx_compiler.cpp

* Stub out parts needed for GPU manager

* Test GPU runtime method stubs

* Cleanup

* Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled

* Kill reflexive assertion

* change GPU naming convention to match CPU

* Snake case functions and identifiers in test case

* Change element type to match changes in master

* Make CUDA headers accessible for codegen with GPU transformer

* clang-format

* apply-code-format

feab44b5

29 Dec, 2017 1 commit

Get value types out of public API, multi-values from Function (#340) · d092cb91

Scott Cyphers authored 7 years ago

* Function can have multiple results
Remove external use of ValueType, TupleType, Tuple
Remove many external uses of Output and Input

* corresponding CPU backend changes

* Update master changes.

* Remove type arg from Function, add changes.md

* Merge changes.

* Move bodies to .cpp, add brief doc

* Merge CPU changes.

* Remove xla includes from non-xla files

* Remove xla from tests

* First part of xla tuple support

* change fprop_cache to assume multi-output bprop functions

* New wrappers for handling tuples with XLA

* Review comments

* remove old xla files

* fix merge errors

* hand edit models to use multi output instead of tuples

d092cb91

28 Dec, 2017 1 commit
- support build from ngraph repo with argon as external · 1c5abc19
  Yixing Lao authored 7 years ago
  
  1c5abc19
21 Dec, 2017 2 commits

Remove NGVM from src (#330) · 15959e73
Robert Kimball authored 7 years ago
```
* remove ngvm

* remove NGVM from cmake
```
15959e73

Fix autodiff uninitialized data error (#329) · 3269387e

Robert Kimball authored 7 years ago

* fix autodiff on non-NGVM backends. NGVM initializes all tensors to zero on allocation while the other backends do not. Had to initialize vector before use.

* change autodiff tests to use INTERPRETER

3269387e

18 Dec, 2017 1 commit

Convolution forward prop (#294) · 122db5ff

Adam Procter authored 7 years ago

* Test GitHub-JIRA integration, nothing useful in this commit

NGTF-388 #comment Testing JIRA integration

* WIP on convolution

* Type checking for convolution

* Docstrings for convolution

* Add convolution reference kernel; it works on some unit tests copied and pasted from my old branch.

* Bugfix for dilated conv, and improvement to conv test generation

* Remove get_arguments calls from convolution stuff

* Add convolution to CPU; also a few fixes to the test generation stuff

* Add copyright header to convolution ref script

* Move copyright header to the correct place

* A few more tests

* Remove fallback behavior of blanking out the convolution ref file, since we're not generating it from the build system anymore

* Delete stale comment

* Merge stuff for the convolution ref script

* Clean up rebase mess

* Review comments

* Review comment (n_foo -> foo_count)

122db5ff

13 Dec, 2017 1 commit
- Codegen for >2D concat following ref kernel pattern (#296) · fdab16db
  Adam Procter authored 7 years ago
  
  fdab16db
12 Dec, 2017 1 commit
- MNIST MLP benchmark test · 0014de5f
  Robert Kimball authored 7 years ago
```
LSTM benchmark test

performance counters
```
  0014de5f
05 Dec, 2017 1 commit

New Interpreter backend (#287) · 025a1b92

Robert Kimball authored 7 years ago

* New Interpreter backend

* PR review comments

* More RP fixes

* oops

* make autodiff tests backend aware

* wip

* wip

* more ops

* wip

* fix merge error

* merge fixes

025a1b92