• Fenglei's avatar
    nvgpu optimize reshape v3 (#1617) · 84de3bf4
    Fenglei authored
    * pass args instead of pointer to array
    
    * add 3d tiled reshpae
    
    * working version
    
    * add shared mem version of 2d, 3d reshape
    
    * remove unused code
    
    * style
    
    * resolve commits
    
    * add test for 3D reshape, some 3D reshape will be treat as 2D
    84de3bf4
gpu_cuda_kernel_builder.hpp 11.7 KB