• Tristan Webb's avatar
    Drwebb/gpu backend dot op (#413) · 94d80ffa
    Tristan Webb authored
    * Drwebb/gpu backend dot op (#387)
    
    * GPU Dot prod emitter switch statement
    
    * cuBLAS dot kernel call
    
    * Flush out arg substitution into gpu dot kernel call
    
    * Drwebb/gpu backend dot op (#392)
    
    * Take in CodeWriter into gpu op emitters
    
    * Introduce GPU function gen based on pass functions
    
    * Additional gpu emitter stubs
    
    * link cublas in to unit test and ngraph
    
    * Use static code gen methods for GPU, add new GPU op stubs
    
    * use pass manager to declare functions / cublas Updates
    
    * Prune down gpu_external_function wip
    
    * Switch back to GPU tensor views in GPU backend
    
    * Pass in cublas handle to GPU external function
    
    * cuMalloc memory in gpu tensor view
    
    * Use cuda runtime malloc and free for tensor view managment c
    
    * change GPU tensor view init, and use GPU tensor view for GPU call frame
    
    * include headers as system dirs
    
    * GPU tensor printing utility function
    
    * cublasSetPointer to device mode / Fix copyright notification lowercasing
    
    * Passing GPU dot product test using cuBLAS
    
    Clean up
    
    * Changes from review
    94d80ffa
Name
Last commit
Last update
..
models Loading commit data...
ref_generators Loading commit data...
util Loading commit data...
CMakeLists.txt Loading commit data...
autodiff.in.cpp Loading commit data...
backend_debug_api.cpp Loading commit data...
backend_performance.cpp Loading commit data...
backend_test.in.cpp Loading commit data...
build_graph.cpp Loading commit data...
builder.cpp Loading commit data...
builder_autobroadcast.cpp Loading commit data...
builder_xla.cpp Loading commit data...
codegen.cpp Loading commit data...
convolution_test.in.cpp Loading commit data...
copy.cpp Loading commit data...
cudnn.cpp Loading commit data...
eigen.cpp Loading commit data...
element_type.cpp Loading commit data...
file_util.cpp Loading commit data...
input_output_assign.cpp Loading commit data...
main.cpp Loading commit data...
mkldnn.cpp Loading commit data...
ngraph.cpp Loading commit data...
op.cpp Loading commit data...
pass_liveness.cpp Loading commit data...
pass_manager.cpp Loading commit data...
pass_memory_layout.cpp Loading commit data...
pattern.cpp Loading commit data...
runtime_manager.cpp Loading commit data...
serialize.cpp Loading commit data...
shape.cpp Loading commit data...
tensor.cpp Loading commit data...
type_prop.cpp Loading commit data...
update_reference.sh Loading commit data...
util.cpp Loading commit data...
uuid.cpp Loading commit data...