-
Fenglei authored
* enable multi datatpye support for Cudnn. refactor binary ops using cudnn * fix bugs * add tests to skip list that CUDNN does not support * not int support on cudnn for backward pooling * no GPU.dot_4d_5d_multi_axis_big_fp64_VERY_SLOW test anymore * clang format * throw if datatype is int8 or int32 for backward pooling * comments * fix list in unit_test.manifest * add type support for alpha, beta * fix bugs * datatype support for alpha, beta * missing () * clang format * batchnorm backward bug fix * remove debug info * change member function name to snake case. remove comments * use nullptr instead of NULL * code style, use cuDNN everywhere in comments * add cudnn host parameters memory manager. * change name to allocate_by_datatype * compiled * debug * fix bug: using list instead of vector, vector address will change each time it resize * add CUDNN_DATA_UINT8 and CUDNN_DATA_UINT8x4
eef2b19d
Name |
Last commit
|
Last update |
---|---|---|
.ci/travis/ubuntu | ||
cmake | ||
contrib/docker | ||
doc | ||
licenses | ||
maint | ||
python | ||
src | ||
test | ||
.clang-format | ||
.gitignore | ||
.gitmodules | ||
.travis.yml | ||
CMakeLists.txt | ||
CONTRIB.md | ||
INSTALL.md | ||
LICENSE | ||
README.md | ||
VERSION.in | ||
changes.md |