Commits · 408f3b25f0c67dbb4be610d0adecc2025af770cc · submodule / ngraph

19 Jan, 2018 3 commits

Tristan Webb authored Jan 19, 2018

* Add mention of blob ref of original file from caffe2

* Mention location of source listing originally from LLVM project

408f3b25

Forward prop for average pooling (#380) · 0931b83b

Adam Procter authored Jan 19, 2018

* Average pool type checking and kernel; type checking tests

* Fix and enable average-pool tests

* Docstring fix

* Extend AvgPool op type checking to support padding

* Untested code for padded avg-pool

* Unit tests for padded avg-pool

* Add CPU implementation

* Temp delete

* Docstring fix

* Docstring fix

* Add tests mixing padding and stride

* Temporary cut to ease merge

* Restore temporary cut for merge

* Empty commit to try to force CI to wake up

0931b83b

Fix improper bracketing of certain kernel calls in CPU codegen (#394) · eb74486c
Adam Procter authored Jan 19, 2018

eb74486c

18 Jan, 2018 5 commits
- bprop for MaxPool (#391) · 9264bc16
  Nick Korovaiko authored Jan 18, 2018
  
  9264bc16
- zero-sized tensor tests with multiple data types (#378) · d43a0557
  Matthew Brookhart authored Jan 18, 2018
  
  d43a0557
- Remove references on element::Type (m_element_type) in tensor.hpp,types.hpp,convert.hpp (#389) · ab3e3965
  Nick Korovaiko authored Jan 18, 2018
```
* remove refs on types in tensor,tensor_view_type,convert

* fix build breaks
```
  ab3e3965
- Rewrite of is_functionally_identical behavior (#366) · 46199d5f
  Robert Kimball authored Jan 18, 2018
```
* change default is_functionally_identical to return false so if an op forgets to override it gets a behavior that might be slower to compile but it will at least work
```
  46199d5f
- Yixing/empty tuple (#390) · 9d0d7a7c
  Robert Kimball authored Jan 18, 2018
```
* add test for empty tuple

* fix null function breaking
```
  9d0d7a7c
17 Jan, 2018 5 commits

add toggle to compiler diagnostic output (#388) · f6a578b4
Robert Kimball authored Jan 17, 2018

f6a578b4
remove a node from users (#379) · 981dabef
varun-intel authored Jan 17, 2018
```
* remove a node from users

* style
```
981dabef
Add mxnet seq2seq serialized model for benchmarking (#385) · 5ad1de22
Robert Kimball authored Jan 17, 2018
```
* add mxnet seq2seq forward and backward

* add benchmarks for seq2seq forward and backward
```
5ad1de22
Numerically stable sum so we can pass mxnet unit tests (#381) · b6c98de1
Matthew Brookhart authored Jan 17, 2018
```
* Numerically stable sum so we can pass mxnet unit tests

* Add a small initial residual
```
b6c98de1

Drwebb/gpu external function (#367) · c5549682

Tristan Webb authored Jan 17, 2018

* Initial GPU_ExternalFunction implementation

Other changes:

Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled

Initial passing (a+b)*c test

Properly link cuda libraries

Simple GPUTensorView implementation

Initial GPU emitter

GPU codegen initial function gen, no kernels yet

Rename GPU emitter and tensor_view_wrapper to match naming convention

* GPU external function based on BASE

* Fix stray base -> gpu

* TensorViewWrapper -> GPU_TensorViewWrapper

* Copy over emitter from base transformer

* Fix for naming dense layout

* Copy kernel emitters from base -> gpu and strip out kernel_utils

* Add aliases to GPU_TensorViewWrappers

* More fixes for naming descriptor::TensorViews

* Move in call_frame implementation from base -> gpu

* apply code format

* GPU codegen running A+B*C

gpu emitters
gpu ctx setup cuda_module kernels
Remove GPU_CF perf counters
Use gpu kernels in external function
Add GPU 1d dot test

Review Changes:
* Remove CPU specific kernel emitting method bodies

* Use copy_data from test/util.cpp, uncomment compileTest

* Use test_utils copy_data function

* Grab function name from pass manager for def, clean up indentation

c5549682

16 Jan, 2018 3 commits
- Add a few more openmp ops (#374) · e433e55a
  Matthew Brookhart authored Jan 16, 2018
```
* Add a few more openmp ops

* fix a warning

* fix merge error
```
  e433e55a
- Implement select-and-scatter (#364) · 29231e11
  Adam Procter authored Jan 16, 2018
  
  29231e11
- Yixing/argon install (#370) · d2b081c8
  Yixing Lao authored Jan 16, 2018
```
* bump argon version

* ask argon to install itself

* bump version again

* argon lib dir

* installs argon to ngraph_dist

* fix path

* upgrade argon version
```
  d2b081c8
14 Jan, 2018 2 commits
- Fix error where compiler's result is properly set to nullptr if compile fails (#375) · 5e80b771
  Robert Kimball authored Jan 14, 2018
```
Add support for reinitializing the compiler if a compile fails, allowing subsequent compiles to succeed
```
  5e80b771
- make CPU emit functions static so they can be called by other backends (#376) · 2775b0bf
  Robert Kimball authored Jan 14, 2018
  
  2775b0bf
12 Jan, 2018 1 commit
- Image batch dilation for convolution (#363) · c682fbf4
  Adam Procter authored Jan 12, 2018
```
Sub-PR: image dilation tests (#362) via @adstraw 
```
  c682fbf4
11 Jan, 2018 2 commits
- add interpreter nan check option (#368) · 74850150
  Robert Kimball authored Jan 11, 2018
```
* add interpreter nan check option

* add unit test
```
  74850150
- Better error message from runtime::Manager. · a2d97200
  Christian Convey authored Jan 11, 2018
  
  a2d97200
10 Jan, 2018 4 commits

Pattern matching for sum (#293) · 4345e39d

Nick Korovaiko authored Jan 10, 2018

* the first stab at pattern for sum

test refactoring, debug msg clean up, formatting fixes

removing v1 and cleaning up v2 + formatting

rollback the changes in reduce_ops

rename v2 -> sum_pred

remove unused funcs

switch to new c-tors

remove TensorViewType

removing an assert

fix a docstring to match a c-tor

* fixes after rebase

4345e39d

Implement reduce-window in interpreter and CPU (#359) · c5ffe8e9
Adam Procter authored Jan 10, 2018

c5ffe8e9
fix some is_functionally_identical methods (#365) · 7b1dc3e3
Robert Kimball authored Jan 10, 2018

7b1dc3e3

Switch from Eigen to OpenMP for loops for DS2 kernels (#345) · 7df687c1

Matthew Brookhart authored Jan 10, 2018

* speed up reduceslice with kernel emitter

* const-ify and fix a clang warning

* add elementwise ops, slice to for loops

* add broadcast codegen

* add Exp

* fix bugs introduced in eigen kernels

* fix another introduced bug in Eigen

* Fix an Atomic Bug with Sum, do some cleanup

* unit tests pass

* Add Reshape Op, passes Tests

* rewrite sum to correctly handle muti-threading

* Code Cleanup

* add some extra unary ops

* Address review comments

* fix an error in the review comment refactor

* Add Power op

* Add (most) of the Logic Ops

* Make Concat default to OpenMP kernel

* fix n-D reshape issue

7df687c1

09 Jan, 2018 3 commits

Remove an optimization for caching a list of ordered ops (#360) · 7e89f1bb

Nick Korovaiko authored Jan 09, 2018

* remove caching of ordered_ops

* graph_util logging msgs

* small cleanup

* remove files for the TopologicalSort pass

* remove NGRAPH_DEBUG from graph_util.hpp

7e89f1bb

Fixes minor bugs in XLA-specific code. (#361) · 8627c495
Christian Convey authored Jan 09, 2018

8627c495

Optimizations to reduce compile time (#357) · 7f3dc2d7

Robert Kimball authored Jan 09, 2018

* much faster compile time
* Remove all variables and just directly access inputs, output, and temps.
* compare layouts when checking if two ops are equal
* make performance counters available to all backends

7f3dc2d7

08 Jan, 2018 2 commits
- Definitions of XLA ConvNet MNIST ops (#324) · 524d04fc
  Adam Procter authored Jan 08, 2018
  
  524d04fc
- Optimize the Coordinate class to prevent copies (#358) · 686ee9ab
  Robert Kimball authored Jan 08, 2018
  
  686ee9ab
06 Jan, 2018 1 commit
- fix boolean ops to return the input element::type instead of float32 (#356) · 07ba1bef
  Matthew Brookhart authored Jan 06, 2018
  
  07ba1bef
05 Jan, 2018 4 commits

Zero padding for convolution (#352) · 8c4ae5ea
Adam Procter authored Jan 05, 2018

8c4ae5ea
Remove descriptor::Value and runtime::Value (#355) · 06f9efd9
Robert Kimball authored Jan 05, 2018
```
* general cleanup

* remove runtime::Value

* more cleanup

* more cleanup
```
06f9efd9
Remove unused args from Input (#353) · f4bb3e46
Robert Kimball authored Jan 05, 2018
```
* cleanup

* remove arg_index

* remove argno from Input

* uncleanup
```
f4bb3e46

Drwebb/gpu runtime boilerplate (#314) · feab44b5

Tristan Webb authored Jan 05, 2018

* Simple boilerplate for GPU runtime files

  - GPUBackend
  - GPU ExternalFunction
  - GPUManager
  - GPUCallFrame

* Test for construction all GPU runtime classes

* Comment out calls, constructors haven't been defined

* Clang CUDA source example to later test compiling

Clang cuda example from:
https://gist.github.com/anonymous/855e277884eb6b388cd2f00d956c2fd4

* Initial nvptx compiler copied from CPU compiler sources

* Define FunctionMap and Instruction for gpu external function

* Rename Compiler -> NVPTXCompiler for gpu compile. Add call to compile for test

* Rename StaticCompiler -> NVPTXStaticCompiler for GPU code gen

* CAdd nvptx_compiler and nvptx_execution_engine to gpu sources

* Compiling source unit test using hardcoded PTX

* (a+b)*c test for GPU

* WIP Fix compile

* rmed accidentally included file

* Fix compile, and LLVM link errosr from nvptx_compiler.cpp

* Stub out parts needed for GPU manager

* Test GPU runtime method stubs

* Cleanup

* Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled

* Kill reflexive assertion

* change GPU naming convention to match CPU

* Snake case functions and identifiers in test case

* Change element type to match changes in master

* Make CUDA headers accessible for codegen with GPU transformer

* clang-format

* apply-code-format

feab44b5

04 Jan, 2018 2 commits
- add missing ops to serializer (#351) · 2218cf9f
  Robert Kimball authored Jan 04, 2018
  
  2218cf9f
- prerequisites for html build during docs development (#349) · fe33af85
  DawnStone authored Jan 04, 2018
```
* updated the sphinx version using pip install in Dockerfile.ngraph_cpp

added a make target to build the docs to the contrib/docker/Makefile

* avoid upgrade pip message during build
```
  fe33af85
03 Jan, 2018 1 commit
- bump argon version (#348) · c6bfa697
  Yixing Lao authored Jan 03, 2018
  
  c6bfa697
02 Jan, 2018 1 commit
- Fix a logic bug introduced by #325 (#347) · 2f0a262e
  Matthew Brookhart authored Jan 02, 2018
  
  2f0a262e
30 Dec, 2017 1 commit

Forward prop for max pooling (#305) · d901282e

Adam Procter authored Dec 30, 2017

* Definition and type checking for max pool

* Implement kernel, integrate into INTERPRETER, add a few unit tests, make function result type mismatch error message more informative (still need to update tests to reflect that)

* Temporarily delete unit tests to ease merge

* Temporarily delete unit tests to ease merge

* Restore deleted unit tests

* Fix a broken error message check in the unit tests

* Update to handle various TensorViewType-related things going away; add NGVM support

* Add codegen case

* Change various get_blah_shape methods to return const refs, and while we're here, make a similar change where it should have been done in convolution

* Use NDArray for max-pool tests

d901282e