• Chris Sullivan's avatar
    Auto. gen. kernel signatures and argument expansion (#1326) · 8476dea0
    Chris Sullivan authored
    * Add GPUKernelArgs for storing kernel arguments.
    
    * Formatting.
    
    * Resolve tensor addresses when extracting arg list via GPUKernelArgs.
    
    * Updated arg list resolution so that placeholder arguments can be added anywhere in the argument list.
    
    * const ref. args and changed add_args to use add_arg. also expanded type_names map.
    
    * GPUKernelArgs bug fix for return values.
    
    * add_placeholders expects pointers for later resolution
    
    * Formatting.
    
    * Add comments to GPUKernelArgs
    
    * Changed GPUKernelArgs interface to use a runtime variable number of arguments.
    
    * Removed/updated comment.
    
    * Address review comments: Remove combined address resolution and argument list retrieval. Remove unecessary extra type entries in type_map.
    
    * Add space between pragma once and includes.
    
    * Broadcast optimization (#1322)
    
    * Implement GPUKernelArgs with op::Broadcast.
    
    * Removed excess type insertion in kernel signature for broadcast impl.
    
    * Support new auto kernel signature generation for op::Broadcast. Add boolean to helpers to determine if parameters are registers or arrays.
    
    * Removed commented code.
    
    * Update broadcast impl. for new GPUKernelArgs interface.
    
    * Updated based on interface change to GPUKernelArgs.
    
    * Formatting.
    
    * CUDNNHostParameters now implement GPUHostParameters. (#1324)
    
    * Formatting.
    8476dea0
Name
Last commit
Last update
.ci/travis/ubuntu Loading commit data...
cmake Loading commit data...
contrib/docker Loading commit data...
doc Loading commit data...
licenses Loading commit data...
maint Loading commit data...
python Loading commit data...
src Loading commit data...
test Loading commit data...
.clang-format Loading commit data...
.gitignore Loading commit data...
.gitmodules Loading commit data...
.travis.yml Loading commit data...
CMakeLists.txt Loading commit data...
CONTRIB.md Loading commit data...
INSTALL.md Loading commit data...
LICENSE Loading commit data...
README.md Loading commit data...
VERSION.in Loading commit data...
changes.md Loading commit data...