Files · 94d80ffabedb1c0b5ff551ab6de85448f9999d2f · submodule / ngraph

Drwebb/gpu backend dot op (#413) · 94d80ffa

Tristan Webb authored Jan 24, 2018

* Drwebb/gpu backend dot op (#387)

* GPU Dot prod emitter switch statement

* cuBLAS dot kernel call

* Flush out arg substitution into gpu dot kernel call

* Drwebb/gpu backend dot op (#392)

* Take in CodeWriter into gpu op emitters

* Introduce GPU function gen based on pass functions

* Additional gpu emitter stubs

* link cublas in to unit test and ngraph

* Use static code gen methods for GPU, add new GPU op stubs

* use pass manager to declare functions / cublas Updates

* Prune down gpu_external_function wip

* Switch back to GPU tensor views in GPU backend

* Pass in cublas handle to GPU external function

* cuMalloc memory in gpu tensor view

* Use cuda runtime malloc and free for tensor view managment c

* change GPU tensor view init, and use GPU tensor view for GPU call frame

* include headers as system dirs

* GPU tensor printing utility function

* cublasSetPointer to device mode / Fix copyright notification lowercasing

* Passing GPU dot product test using cuBLAS

Clean up

* Changes from review

94d80ffa

Name	Last commit	Last update
cmake		Loading commit data...
contrib/docker		Loading commit data...
doc		Loading commit data...
maint		Loading commit data...
src		Loading commit data...
test		Loading commit data...
third-party		Loading commit data...
.clang-format		Loading commit data...
.gitignore		Loading commit data...
CMakeLists.txt		Loading commit data...
README-RESNET.rst		Loading commit data...
README.md		Loading commit data...
changes.md		Loading commit data...

README-RESNET.rst