• Namgoo Lee's avatar
    __shfl_up_sync with mask for CUDA >= 9 · 970293a2
    Namgoo Lee authored
    * __shfl_up_sync with proper mask value for CUDA >= 9
    
    * BlockScanInclusive for CUDA >= 9
    
    * compatible_shfl_up for use in integral.hpp
    
    * Use CLAHE in cudev
    
    * Add tests for BlockScan
    970293a2
test_scan.cu 4.56 KB