• Fenglei's avatar
    gpu replace slice optimize (#1411) · c46d4546
    Fenglei authored
    * optimize replace slice
    
    * fix bugs
    
    * fix bug
    
    * optimize pad dynamic
    
    * fix bug
    
    * fix bug
    
    * fix bug
    
    * remove *
    
    * add gpu_assignment to pass
    
    * refactor cuda replace slice.
    
    * fix bug
    
    * refactor replace slice
    
    * working version
    
    * clang format
    
    * us layout instead of assignment
    
    * us layout instead of assignment in cmakelist
    
    * update gpu_layout
    
    * fix bugs
    
    * resolve conflict
    
    * GPUShape to NVShape
    
    * using kernel args
    
    * using kernel args
    
    * fix bugs
    
    * fix bugs
    
    * fix bug, remove mkldnn.h from gpu_layout.cpp
    
    * fix bug for pad_below
    
    * remove cast to rep_slice
    
    * fix bugs
    
    * clang format
    
    * change add_in_place_oi_pair({0, 0, false} to add_in_place_oi_pair({0, 0, true};
    c46d4546
gpu_cuda_kernel_builder.cpp 53.9 KB