• Tristan Webb's avatar
    Drwebb/gpu external function (#367) · c5549682
    Tristan Webb authored
    * Initial GPU_ExternalFunction implementation
    
    Other changes:
    
    Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled
    
    Initial passing (a+b)*c test
    
    Properly link cuda libraries
    
    Simple GPUTensorView implementation
    
    Initial GPU emitter
    
    GPU codegen initial function gen, no kernels yet
    
    Rename GPU emitter and tensor_view_wrapper to match naming convention
    
    * GPU external function based on BASE
    
    * Fix stray base -> gpu
    
    * TensorViewWrapper -> GPU_TensorViewWrapper
    
    * Copy over emitter from base transformer
    
    * Fix for naming dense layout
    
    * Copy kernel emitters from base -> gpu and strip out kernel_utils
    
    * Add aliases to GPU_TensorViewWrappers
    
    * More fixes for naming descriptor::TensorViews
    
    * Move in call_frame implementation from base -> gpu
    
    * apply code format
    
    * GPU codegen running A+B*C
    
    gpu emitters
    gpu ctx setup cuda_module kernels
    Remove GPU_CF perf counters
    Use gpu kernels in external function
    Add GPU 1d dot test
    
    Review Changes:
    * Remove CPU specific kernel emitting method bodies
    
    * Use copy_data from test/util.cpp, uncomment compileTest
    
    * Use test_utils copy_data function
    
    * Grab function name from pass manager for def, clean up indentation
    c5549682
Name
Last commit
Last update
..
models Loading commit data...
ref_generators Loading commit data...
util Loading commit data...
CMakeLists.txt Loading commit data...
autodiff.in.cpp Loading commit data...
backend_debug_api.cpp Loading commit data...
backend_performance.cpp Loading commit data...
backend_test.in.cpp Loading commit data...
build_graph.cpp Loading commit data...
builder.cpp Loading commit data...
builder_autobroadcast.cpp Loading commit data...
builder_xla.cpp Loading commit data...
codegen.cpp Loading commit data...
convolution_test.in.cpp Loading commit data...
copy.cpp Loading commit data...
cudnn.cpp Loading commit data...
eigen.cpp Loading commit data...
element_type.cpp Loading commit data...
file_util.cpp Loading commit data...
input_output_assign.cpp Loading commit data...
main.cpp Loading commit data...
mkldnn.cpp Loading commit data...
ngraph.cpp Loading commit data...
op.cpp Loading commit data...
pass_liveness.cpp Loading commit data...
pass_manager.cpp Loading commit data...
pass_memory_layout.cpp Loading commit data...
pattern.cpp Loading commit data...
runtime_manager.cpp Loading commit data...
serialize.cpp Loading commit data...
shape.cpp Loading commit data...
tensor.cpp Loading commit data...
type_prop.cpp Loading commit data...
update_reference.sh Loading commit data...
util.cpp Loading commit data...
uuid.cpp Loading commit data...