• Julien's avatar
    Merge pull request #16218 from JulienMaille:cuda-dnn-for-older-gpus · 4e2ef8c8
    Julien authored
    Enable cuda4dnn on hardware without support for __half
    
    * Enable cuda4dnn on hardware without support for half (ie. compute capability < 5.3)
    
    Update CMakeLists.txt
    
    Lowered minimum CC to 3.0
    
    * UPD: added ifdef on new copy kernel
    
    * added fp16 support detection at runtime
    
    * Clarified #if condition on atomicAdd definition
    
    * More explicit CMake error message
    4e2ef8c8
concat.cu 11.9 KB