• Chris Sullivan's avatar
    CUDNN and CUDA kernels for AvgPool (forward/backward) (#951) · b1b3d4d6
    Chris Sullivan authored
    * Added op::AvgPool cudnn impl. which works for 2-3 spatial dimesions and no/symmetric padding. Enabled tests.
    
    * Added cuda-c implementation of average pool which handles 1-3 spatial
    dimensions as well as asymmetric padding. This commit also introduces
    several helper functions for performing fast integer division and
    fast constant memory access.
    
    * Formatting. Removed bool that was used for testing to force the cuda impl. over cudnn.
    
    * Added CUDNN AvgPoolBackprop implementation.
    
    * Removed inline enum in preference of a helper struct. Removed instances of multiple declarations on a single line. Updated comments.
    
    * Removed _prefix to helper functions in anonymous namespace.
    b1b3d4d6
Name
Last commit
Last update
.ci/travis/ubuntu Loading commit data...
cmake Loading commit data...
contrib/docker Loading commit data...
doc Loading commit data...
licenses Loading commit data...
maint Loading commit data...
python Loading commit data...
src Loading commit data...
test Loading commit data...
third-party Loading commit data...
.clang-format Loading commit data...
.gitignore Loading commit data...
.gitmodules Loading commit data...
.travis.yml Loading commit data...
CMakeLists.txt Loading commit data...
INSTALL.md Loading commit data...
LICENSE Loading commit data...
README.md Loading commit data...
VERSION.in Loading commit data...
changes.md Loading commit data...