Commits · ab810bb5c47c222498ba91bacf183158981343e1 · submodule / ngraph · GitLab

26 Feb, 2018 1 commit

Initial support for hybrid transformer (#526) · 7f08b97b

Yixing Lao authored 6 years ago

* initial support for hybrid transformer

* add broadcast_vector_rowwise_reversed for hybrid test

* headerc

* get function placement fix

* conv ref test generator graph node in labmda fuction

* rename map_parameter_to_source_node

* type change map_parameter_to_source_node

* use interpreter for numerical derivative

* better comments

7f08b97b

21 Feb, 2018 1 commit

moving Relu op form Argon backend with CoreFusion (#489) · 4e29c153

Sandeep authored 6 years ago

* relu for interpreter

* relu in serializer

* core fusion

* relu backprop

* relu backprop and test interpreter

* core fusion for CPU

* COREFusion -> CoreFusion

* relu MKL dnn

4e29c153

20 Feb, 2018 1 commit
- refactor benchmark util · 348dd27f
  Ashok Emani authored 6 years ago
  
  348dd27f
14 Feb, 2018 3 commits
- skip tests for GPU · 2a89f8b4
  fenglei.tian authored 6 years ago
  
  2a89f8b4
- Allow caching of external dependencies (everything but TBB, which I can't figure out yet) (#473) · 2fe7f0f3
  Adam Procter authored 6 years ago
  
  Unverified
  
  2fe7f0f3
- add AllReduce op and MPI support (#425) · b9c5b9d3
  Sevin F. Varoglu authored 6 years ago
```
- enable distributed ngraph (MPI)
- add AllReduce op to ngraph core, interpreter and CPU backend
- add AllReduce unit test
```
  Unverified
  
  b9c5b9d3
13 Feb, 2018 3 commits
- disable gpu tests for now, since most will be fail · a6d78dd7
  fenglei.tian authored 6 years ago
  
  a6d78dd7
- cleanup code · 2db7022e
  fenglei.tian authored 6 years ago
  
  2db7022e
- cleanup code · f7d97aa1
  fenglei.tian authored 6 years ago
  
  f7d97aa1
09 Feb, 2018 4 commits
- GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum (#440) · da50410b
  Tristan Webb authored 6 years ago
```
* GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum

(A + B) * C test now with cuBLAS
Additional gemm and gemv calls
cmake updates for cuDNN calls
memcpy wrappers in gpu_util

Additional passing tests:
aliased outputs, parameter, constant tensor memcopy
```
  Unverified
  
  da50410b
- Remove execute permissions from non-executable files (#474) · e054366e
  Adam Procter authored 6 years ago
  
  e054366e
- Fix pep8 warning in copyright · c7a3a76b
  Jennifer Myers authored 6 years ago
  
  c7a3a76b
- ensures the argon repositoy is present (#471) · 3f3c580e
  Sandeep authored 6 years ago
  
  Unverified
  
  3f3c580e
08 Feb, 2018 2 commits
- Add LICENSE and switch to Intel Copyright (#466) · d9a9d2d7
  Jennifer Myers authored 6 years ago
  
  d9a9d2d7
- enable additional argon backend tests (#465) · 2712d6f6
  Sandeep authored 6 years ago
  
  Unverified
  
  2712d6f6
07 Feb, 2018 1 commit

CPU backprop tests (#456) · 7a7e27d7

Adam Procter authored 6 years ago

* Enable CPU backprop tests

* Fix to dot codegen for cases where n_reduction_axes != 1

7a7e27d7

06 Feb, 2018 1 commit
- argon fusion test for Relu (#447) · c3364269
  Sandeep authored 6 years ago
```
* test relu fusion for argon backend and enable permutation over max op
```
  Unverified
  
  c3364269
05 Feb, 2018 1 commit

inline_function_call (#439) · bef56921

Nick Korovaiko authored 6 years ago

inline

Inliner pass + tests

debugging

fix inliner failures due to the fact a random function is picked as an outermost one

copyright headers

bef56921

02 Feb, 2018 1 commit

GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum · 1f6284ff

Tristan Webb authored 7 years ago

GPU ew add and mult cuBLAS calls

GPU (A + B) * C with cuBLAS

Additional gemm and gemv calls

cmake updates for cuDNN calls

kernels WIP

params for dot gemm

more kernel WIP

memcpy wrappers

aliased outputs, parameter, constant tensor memcopy

comment cleanup

remove cruft

gpu faster gemm

MNIST WIP

Cleanup

1f6284ff

01 Feb, 2018 1 commit

Reshape Transformations + Simplification pass (#427) · f5930d37

Nick Korovaiko authored 7 years ago

* simplification pass

* serializer change to test models

* some small test fixes

* addressing Scott's feedback

* missed one nn

* formatting fixes

* simplification -> reshape_elimination

f5930d37

30 Jan, 2018 1 commit

fuse dot(a,b) + c (#418) · ea29c6e3

Nick Korovaiko authored 7 years ago

cblas_gemm working on mlp

rebase & small fixes

enable debug output

support replacing function's outputs

productizing CPUFusion

addressing Bob and Jayaram's feedback

removing json used for simplification tests

adding comments

fixing formatting errors and removing dead code

TODO msg

removing serializer changes

ea29c6e3

24 Jan, 2018 1 commit

Drwebb/gpu backend dot op (#413) · 94d80ffa

Tristan Webb authored 7 years ago

* Drwebb/gpu backend dot op (#387)

* GPU Dot prod emitter switch statement

* cuBLAS dot kernel call

* Flush out arg substitution into gpu dot kernel call

* Drwebb/gpu backend dot op (#392)

* Take in CodeWriter into gpu op emitters

* Introduce GPU function gen based on pass functions

* Additional gpu emitter stubs

* link cublas in to unit test and ngraph

* Use static code gen methods for GPU, add new GPU op stubs

* use pass manager to declare functions / cublas Updates

* Prune down gpu_external_function wip

* Switch back to GPU tensor views in GPU backend

* Pass in cublas handle to GPU external function

* cuMalloc memory in gpu tensor view

* Use cuda runtime malloc and free for tensor view managment c

* change GPU tensor view init, and use GPU tensor view for GPU call frame

* include headers as system dirs

* GPU tensor printing utility function

* cublasSetPointer to device mode / Fix copyright notification lowercasing

* Passing GPU dot product test using cuBLAS

Clean up

* Changes from review

94d80ffa

19 Jan, 2018 1 commit

Add flag to enable memory sanitizer (#393) · 0f836183

Robert Kimball authored 7 years ago

* cleanup in-memory header files

* add switch to enable memory sanitizer (works like valgrind)

* removed header file cleanup as it was causing a segfault on program termination

0f836183

11 Jan, 2018 2 commits
- add interpreter nan check option (#368) · 74850150
  Robert Kimball authored 7 years ago
```
* add interpreter nan check option

* add unit test
```
  Unverified
  
  74850150
- Better error message from runtime::Manager. · a2d97200
  Christian Convey authored 7 years ago
  
  a2d97200
09 Jan, 2018 1 commit

Remove an optimization for caching a list of ordered ops (#360) · 7e89f1bb

Nick Korovaiko authored 7 years ago

* remove caching of ordered_ops

* graph_util logging msgs

* small cleanup

* remove files for the TopologicalSort pass

* remove NGRAPH_DEBUG from graph_util.hpp

7e89f1bb

05 Jan, 2018 1 commit

Drwebb/gpu runtime boilerplate (#314) · feab44b5

Tristan Webb authored 7 years ago

* Simple boilerplate for GPU runtime files

  - GPUBackend
  - GPU ExternalFunction
  - GPUManager
  - GPUCallFrame

* Test for construction all GPU runtime classes

* Comment out calls, constructors haven't been defined

* Clang CUDA source example to later test compiling

Clang cuda example from:
https://gist.github.com/anonymous/855e277884eb6b388cd2f00d956c2fd4

* Initial nvptx compiler copied from CPU compiler sources

* Define FunctionMap and Instruction for gpu external function

* Rename Compiler -> NVPTXCompiler for gpu compile. Add call to compile for test

* Rename StaticCompiler -> NVPTXStaticCompiler for GPU code gen

* CAdd nvptx_compiler and nvptx_execution_engine to gpu sources

* Compiling source unit test using hardcoded PTX

* (a+b)*c test for GPU

* WIP Fix compile

* rmed accidentally included file

* Fix compile, and LLVM link errosr from nvptx_compiler.cpp

* Stub out parts needed for GPU manager

* Test GPU runtime method stubs

* Cleanup

* Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled

* Kill reflexive assertion

* change GPU naming convention to match CPU

* Snake case functions and identifiers in test case

* Change element type to match changes in master

* Make CUDA headers accessible for codegen with GPU transformer

* clang-format

* apply-code-format

feab44b5

29 Dec, 2017 1 commit

Get value types out of public API, multi-values from Function (#340) · d092cb91

Scott Cyphers authored 7 years ago

* Function can have multiple results
Remove external use of ValueType, TupleType, Tuple
Remove many external uses of Output and Input

* corresponding CPU backend changes

* Update master changes.

* Remove type arg from Function, add changes.md

* Merge changes.

* Move bodies to .cpp, add brief doc

* Merge CPU changes.

* Remove xla includes from non-xla files

* Remove xla from tests

* First part of xla tuple support

* change fprop_cache to assume multi-output bprop functions

* New wrappers for handling tuples with XLA

* Review comments

* remove old xla files

* fix merge errors

* hand edit models to use multi output instead of tuples

d092cb91

28 Dec, 2017 1 commit
- support build from ngraph repo with argon as external · 1c5abc19
  Yixing Lao authored 7 years ago
  
  1c5abc19
21 Dec, 2017 2 commits

Remove NGVM from src (#330) · 15959e73
Robert Kimball authored 7 years ago
```
* remove ngvm

* remove NGVM from cmake
```
15959e73

Fix autodiff uninitialized data error (#329) · 3269387e

Robert Kimball authored 7 years ago

* fix autodiff on non-NGVM backends. NGVM initializes all tensors to zero on allocation while the other backends do not. Had to initialize vector before use.

* change autodiff tests to use INTERPRETER

3269387e

18 Dec, 2017 1 commit

Convolution forward prop (#294) · 122db5ff

Adam Procter authored 7 years ago

* Test GitHub-JIRA integration, nothing useful in this commit

NGTF-388 #comment Testing JIRA integration

* WIP on convolution

* Type checking for convolution

* Docstrings for convolution

* Add convolution reference kernel; it works on some unit tests copied and pasted from my old branch.

* Bugfix for dilated conv, and improvement to conv test generation

* Remove get_arguments calls from convolution stuff

* Add convolution to CPU; also a few fixes to the test generation stuff

* Add copyright header to convolution ref script

* Move copyright header to the correct place

* A few more tests

* Remove fallback behavior of blanking out the convolution ref file, since we're not generating it from the build system anymore

* Delete stale comment

* Merge stuff for the convolution ref script

* Clean up rebase mess

* Review comments

* Review comment (n_foo -> foo_count)

122db5ff

13 Dec, 2017 1 commit
- Codegen for >2D concat following ref kernel pattern (#296) · fdab16db
  Adam Procter authored 7 years ago
  
  Unverified
  
  fdab16db
12 Dec, 2017 1 commit
- MNIST MLP benchmark test · 0014de5f
  Robert Kimball authored 7 years ago
```
LSTM benchmark test

performance counters
```
  0014de5f
05 Dec, 2017 1 commit

New Interpreter backend (#287) · 025a1b92

Robert Kimball authored 7 years ago

* New Interpreter backend

* PR review comments

* More RP fixes

* oops

* make autodiff tests backend aware

* wip

* wip

* more ops

* wip

* fix merge error

* merge fixes

025a1b92

04 Dec, 2017 1 commit

Finish de-Eigenization (#282) · 7b305e3e

Adam Procter authored 7 years ago

* Simpler kernel for broadcast

* Fixed behavior for integer divide-by-zero, added unit tests

* Strided and higher-dimensional slice (just tested to 3D)

* Higher-dimensional sum

* Replace-slice de-Eigenized; NOT TESTED AT HIGHER DIMENSIONS YET

* Correct sum behavior when eliminating zero-length axes; add unit tests; also, add higher-dim unit tests for replace-slice

* Higher-dimensional reduce, 'cause hey, why not?

* Remove BroadcastScalarInstruction

* Adding test for an observed failure at trivial sum on 5-tensors

* De-Eigenized and higher-dimmified concat

* Replace 'auto' in the kernels

* temporary delete to ease merge

* Re-insert tests that were deleted to ease merge

* Refactor view-iteration

* De-Eigenize reshape

* Rework divide kernel to use std::enable_if to distinguish between floating and non-floating types

* Update docs to reflect newly implemented cases in several ops

* Rename parameters to View for more clarity; remove axis_walk_order (it's redundant)

* Formatting

* More terminological rejiggering

* De-Eigenize scalar-tensor product

* De-Eigenize dot

* Update docstrings

* Remove 'implementation status' tables from docstrings

* Change step -> strides everywhere for consistent terminology

* Formatting

* Replace asserts in view.cpp with exceptions

* Fix typo

* Fix incorrect result type in dot1d test (ouch...)

* Add missing support for Float64 to ngvm/external_function

* Add int16 and uint16 (how was this missing?)

* A few more additions relative to the missing element types

* Disable tests that will not pass on CPU; they can still be run with test/unit-test --gtest_also_run_disabled_tests --gtest_filter='DISABLED_NGVM.*'

* Move project_ and inject_ functions to common.[ch]pp, not view.[ch]pp

* Rename View to CoordinateTransform

* Add prefix ++ and += to CoordinateIterator

7b305e3e

30 Nov, 2017 2 commits
- Adding Numpy Style Transpose to the Builder (#271) · d4153c91
  Matthew Brookhart authored 7 years ago
```
* Add numpy_transpose to the builder for numpy-stype transpose operations

* fix docstring

* make sure to throw the error
```
  Unverified
  
  d4153c91
- De-Eigenize broadcast, and extend it to higher dimensions · b485bb33
  Adam Procter authored 7 years ago
  
  b485bb33
29 Nov, 2017 1 commit
- Adds autobroadcast builder. · 4630c37d
  Christian Convey authored 7 years ago
  
  4630c37d
28 Nov, 2017 1 commit

REBASE: graph pattern matcher half I/O half arguments/users (#269) · 3e68842b

Nick Korovaiko authored 7 years ago

* Start of pattern matcher

recursive graph matcher, pattern node

add matcher.cpp

add files for matcher, graph_rewrite

add const to on_match_class

fix comp errors

reshuffle pattern matching code across corresponding files

fix comment

run clang-format

graph_rewrite replace_node

getting simple test cases to work

op/pattern.cpp

toward graph_rewrite tests

older matcher API

before clean up tests

before rebase

build bbrks

more tests

clean up

more clean-up

more cleanup 2

more clean up 3

clean up 4

clang errors

clang errors2

apply code format

move match_class to matcher

major clean up after moving match_class to matcher.cpp

removing tracing changes

rebased as of 11/8

make matcher use i/o descs to traverse the graph; change replace_io

switching to io tds

graph_rewrite tests fail

all tests pass

formatting

unhandle outputs explicitly for now

reset permissions back to 0644; bad bad windows

fixes after rebase

* fixes

* addressing Scott's feedback

3e68842b