• Chris Sullivan's avatar
    Nd convolution via blocked GEMM for C{d1,...,dn}N layout (#1131) · ae45c984
    Chris Sullivan authored
    * Added blank convolution kernel and refactored coordinate transform kernel helper.
    
    * Added op::Reshape to the CUDAEmitter.
    
    * Added 2-Nd tiled convolution.
    
    * Bug fixes with data_dilation and filter loop. Still need to add test for coverage of register tiling.
    
    * Styling.
    
    * Removed some comments and code added for testing.
    
    * Some tests became enabled in merge, removing them.
    ae45c984
Name
Last commit
Last update
.ci/travis/ubuntu Loading commit data...
cmake Loading commit data...
contrib/docker Loading commit data...
doc Loading commit data...
licenses Loading commit data...
maint Loading commit data...
python Loading commit data...
src Loading commit data...
test Loading commit data...
.clang-format Loading commit data...
.gitignore Loading commit data...
.gitmodules Loading commit data...
.travis.yml Loading commit data...
CMakeLists.txt Loading commit data...
CONTRIB.md Loading commit data...
INSTALL.md Loading commit data...
LICENSE Loading commit data...
README.md Loading commit data...
VERSION.in Loading commit data...
changes.md Loading commit data...