Commits · 7e89f1bbd62ce8f28c6b6c4a22cf021ed86c967c · submodule / ngraph

09 Jan, 2018 3 commits

Remove an optimization for caching a list of ordered ops (#360) · 7e89f1bb

Nick Korovaiko authored Jan 09, 2018

* remove caching of ordered_ops

* graph_util logging msgs

* small cleanup

* remove files for the TopologicalSort pass

* remove NGRAPH_DEBUG from graph_util.hpp

7e89f1bb

Fixes minor bugs in XLA-specific code. (#361) · 8627c495
Christian Convey authored Jan 09, 2018

8627c495

Optimizations to reduce compile time (#357) · 7f3dc2d7

Robert Kimball authored Jan 09, 2018

* much faster compile time
* Remove all variables and just directly access inputs, output, and temps.
* compare layouts when checking if two ops are equal
* make performance counters available to all backends

7f3dc2d7

08 Jan, 2018 2 commits
- Definitions of XLA ConvNet MNIST ops (#324) · 524d04fc
  Adam Procter authored Jan 08, 2018
  
  524d04fc
- Optimize the Coordinate class to prevent copies (#358) · 686ee9ab
  Robert Kimball authored Jan 08, 2018
  
  686ee9ab
06 Jan, 2018 1 commit
- fix boolean ops to return the input element::type instead of float32 (#356) · 07ba1bef
  Matthew Brookhart authored Jan 06, 2018
  
  07ba1bef
05 Jan, 2018 4 commits

Zero padding for convolution (#352) · 8c4ae5ea
Adam Procter authored Jan 05, 2018

8c4ae5ea
Remove descriptor::Value and runtime::Value (#355) · 06f9efd9
Robert Kimball authored Jan 05, 2018
```
* general cleanup

* remove runtime::Value

* more cleanup

* more cleanup
```
06f9efd9
Remove unused args from Input (#353) · f4bb3e46
Robert Kimball authored Jan 05, 2018
```
* cleanup

* remove arg_index

* remove argno from Input

* uncleanup
```
f4bb3e46

Drwebb/gpu runtime boilerplate (#314) · feab44b5

Tristan Webb authored Jan 05, 2018

* Simple boilerplate for GPU runtime files

  - GPUBackend
  - GPU ExternalFunction
  - GPUManager
  - GPUCallFrame

* Test for construction all GPU runtime classes

* Comment out calls, constructors haven't been defined

* Clang CUDA source example to later test compiling

Clang cuda example from:
https://gist.github.com/anonymous/855e277884eb6b388cd2f00d956c2fd4

* Initial nvptx compiler copied from CPU compiler sources

* Define FunctionMap and Instruction for gpu external function

* Rename Compiler -> NVPTXCompiler for gpu compile. Add call to compile for test

* Rename StaticCompiler -> NVPTXStaticCompiler for GPU code gen

* CAdd nvptx_compiler and nvptx_execution_engine to gpu sources

* Compiling source unit test using hardcoded PTX

* (a+b)*c test for GPU

* WIP Fix compile

* rmed accidentally included file

* Fix compile, and LLVM link errosr from nvptx_compiler.cpp

* Stub out parts needed for GPU manager

* Test GPU runtime method stubs

* Cleanup

* Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled

* Kill reflexive assertion

* change GPU naming convention to match CPU

* Snake case functions and identifiers in test case

* Change element type to match changes in master

* Make CUDA headers accessible for codegen with GPU transformer

* clang-format

* apply-code-format

feab44b5

04 Jan, 2018 2 commits
- add missing ops to serializer (#351) · 2218cf9f
  Robert Kimball authored Jan 04, 2018
  
  2218cf9f
- prerequisites for html build during docs development (#349) · fe33af85
  DawnStone authored Jan 04, 2018
```
* updated the sphinx version using pip install in Dockerfile.ngraph_cpp

added a make target to build the docs to the contrib/docker/Makefile

* avoid upgrade pip message during build
```
  fe33af85
03 Jan, 2018 1 commit
- bump argon version (#348) · c6bfa697
  Yixing Lao authored Jan 03, 2018
  
  c6bfa697
02 Jan, 2018 1 commit
- Fix a logic bug introduced by #325 (#347) · 2f0a262e
  Matthew Brookhart authored Jan 02, 2018
  
  2f0a262e
30 Dec, 2017 2 commits

Forward prop for max pooling (#305) · d901282e

Adam Procter authored Dec 30, 2017

* Definition and type checking for max pool

* Implement kernel, integrate into INTERPRETER, add a few unit tests, make function result type mismatch error message more informative (still need to update tests to reflect that)

* Temporarily delete unit tests to ease merge

* Temporarily delete unit tests to ease merge

* Restore deleted unit tests

* Fix a broken error message check in the unit tests

* Update to handle various TensorViewType-related things going away; add NGVM support

* Add codegen case

* Change various get_blah_shape methods to return const refs, and while we're here, make a similar change where it should have been done in convolution

* Use NDArray for max-pool tests

d901282e

recreate ops (#325) · 66d06693

varun-intel authored Dec 30, 2017

* recreate ops

* style

* recompute ops

* style

* fix

* recreate ops

* style

* recompute ops

* style

* fix

* some

* more

* style

* remove a line

* const

* style

* NodeMap was using non-standard operator[] behavior.

* Missing include

66d06693

29 Dec, 2017 2 commits

Get value types out of public API, multi-values from Function (#340) · d092cb91

Scott Cyphers authored Dec 29, 2017

* Function can have multiple results
Remove external use of ValueType, TupleType, Tuple
Remove many external uses of Output and Input

* corresponding CPU backend changes

* Update master changes.

* Remove type arg from Function, add changes.md

* Merge changes.

* Move bodies to .cpp, add brief doc

* Merge CPU changes.

* Remove xla includes from non-xla files

* Remove xla from tests

* First part of xla tuple support

* change fprop_cache to assume multi-output bprop functions

* New wrappers for handling tuples with XLA

* Review comments

* remove old xla files

* fix merge errors

* hand edit models to use multi output instead of tuples

d092cb91

Remove LLVM/Clang dependency in headers (#341) · 7c59ca2e
Yixing Lao authored Dec 29, 2017
```
* remove llvm/clang dependency in headers

* copy elision
```
7c59ca2e

28 Dec, 2017 6 commits

support build from ngraph repo with argon as external · 1c5abc19
Yixing Lao authored Dec 27, 2017

1c5abc19
Add bigger models to performance benchmarks (#342) · 2d2fc8c2
Robert Kimball authored Dec 28, 2017
```
* add larger test models
```
2d2fc8c2
Move header resource to .rodata (#344) · 19a10d79
Jai Menon authored Dec 28, 2017
```
This avoids bloating .data and clears the path
for code model fixes later
```
19a10d79
Rewrite the way constants are emitted in the CPU backend (#332) · 603a7d1a
Robert Kimball authored Dec 28, 2017
```
* wip

* constants as globals

* const emitter rewrite
```
603a7d1a

Build and execute TBB flow graphs in the CPU backend (#304) · c2c33748

Jai Menon authored Dec 28, 2017

* CMake: TBB integration placeholder

* CMake: Integrate TBB

* CMake: Indent

* CMake: Rewrite TBB integration

* CMake: More TBB integration changes

* CMake: Install TBB headers and DSOs

* CMake: Don't install the TBB debug DSO

* CMake: Propagate ngraph's configured compiler setting over to MKL-DNN

* CMake: Restore TBB debug DSO installation

* CMake: Add installed headers to search path.
This needs to be cleaned up along with other header search cleanup

* CPU: Build and execute TBB flowgraphs

* CPU: TBB fixes

* CPU: More TBB fixes

* CPU: Allow both TBB and serial codegen for now

* TBB: get_arguments -> get_input_ops

* CPU: Use node methods

* CPU: Add TBB headers in the build directory to the search path

* TBB: Incorporate various changes from master

* CMake: Indentation fix

* CMake: Indentation fix

* CMake: TBB is mandatory so remove additional predicates

* TBB: Add a test

* CMake: Fix linker flags with GCC

c2c33748

Fprop Cache Util Function (#312) · bc63f7bb

Matthew Brookhart authored Dec 28, 2017

* in progress

* working cache_fprop, no tests

* style fix

* all inputs to bprop (except adjoints) are cached from fprop

* fix typos, make sure to check count == 0

* fix code format

bc63f7bb

27 Dec, 2017 5 commits
- Bob/benchmark cleanup (#338) · 8f3da6b8
  Robert Kimball authored Dec 27, 2017
```
* cleanup

* cleanup

* expand

* wip

* undo
```
  8f3da6b8
- Add ReplaceSlice serialization (#339) · ff8a2008
  Robert Kimball authored Dec 27, 2017
  
  ff8a2008
- enable -O3 optimization (#333) · 69a6fb09
  Robert Kimball authored Dec 27, 2017
```
* enable -O3 optimization

* add flags to support release/debug builds
```
  69a6fb09
- Bob/nan (#335) · 2466bacd
  Robert Kimball authored Dec 27, 2017
```
* nan unit test

* fix NAN issue

* add INFINITY support
```
  2466bacd
- Revert "Adds more control for building MKL-DNN. (#322)" (#336) · 556fda0a
  Christian Convey authored Dec 27, 2017
```
This reverts commit 39383029.

It looks like the commit actually suppressed parallel makes
of MKL-DNN, at least in the case where ngraph itself was being
built with parallel make.  It also introduced problems with
make jobserver warnings.
```
  556fda0a
26 Dec, 2017 1 commit
- Embed header files into ngraph (#323) · 8aba2ada
  Robert Kimball authored Dec 26, 2017
```
* add resource file generator and store all headers used by codegen in memory.
```
  8aba2ada
22 Dec, 2017 2 commits
- Serializer emits simple element_type (#331) · 0f2a22e7
  Robert Kimball authored Dec 22, 2017
```
* cleanup

* cleanup

* update serializer to emit small, simple element_type. backwards compatible.

* allow for selecting indenting when serializing
```
  0f2a22e7
- Codegen: #error out if RTTI is enabled (#327) · 752396cd
  Jai Menon authored Dec 22, 2017
  
  752396cd
21 Dec, 2017 6 commits

Remove NGVM from src (#330) · 15959e73
Robert Kimball authored Dec 21, 2017
```
* remove ngvm

* remove NGVM from cmake
```
15959e73

Fix autodiff uninitialized data error (#329) · 3269387e

Robert Kimball authored Dec 21, 2017

* fix autodiff on non-NGVM backends. NGVM initializes all tensors to zero on allocation while the other backends do not. Had to initialize vector before use.

* change autodiff tests to use INTERPRETER

3269387e

set code model back to default as medium is causing the CPU.divide_by_zero_int32… · 04985466

Robert Kimball authored Dec 21, 2017

set code model back to default as medium is causing the CPU.divide_by_zero_int32 unit test to sefault when it throws an exception from the generated code (#328)

04985466

Jmenon/eigen opt (#326) · 588d69a4

Jai Menon authored Dec 21, 2017

* CPU: Optimize Eigen based rowwise vector broadcast

* CPU: Remove the need for transposing the broadcast vector

* CPU: Optimize to a replicate expression

* CPU: Change code model to medium and compile for the host CPU
instead of hardcoding BDW

588d69a4

Element Type simplification (#313) · 41cb4a2d

Robert Kimball authored Dec 21, 2017

* remove ParameterizedConstant
* use simpler element Type definition
* Move TraitedType to NGVM directory

41cb4a2d

20 Dec, 2017 2 commits
- Adds more control for building MKL-DNN. (#322) · 39383029
  Christian Convey authored Dec 20, 2017
```
* Adds CMake variables `MKLDNN_BUILD_COMMAND_EXTRA_FLAGS`
  and `MKLDNN_CMAKE_EXTRA_FLAGS`.
```
  39383029
- Add support for aliased output to CPU and INTERPRETER backends (#320) · d5e814aa
  Robert Kimball authored Dec 20, 2017
```
* aliased output unit test
* add support for aliased outputs to INTERPRETER and CPU
```
  d5e814aa