-
Fenglei authored
* optimize replace slice * fix bugs * fix bug * optimize pad dynamic * fix bug * fix bug * fix bug * remove * * add gpu_assignment to pass * refactor cuda replace slice. * fix bug * refactor replace slice * working version * clang format * us layout instead of assignment * us layout instead of assignment in cmakelist * update gpu_layout * fix bugs * resolve conflict * GPUShape to NVShape * using kernel args * using kernel args * fix bugs * fix bugs * fix bug, remove mkldnn.h from gpu_layout.cpp * fix bug for pad_below * remove cast to rep_slice * fix bugs * clang format * change add_in_place_oi_pair({0, 0, false} to add_in_place_oi_pair({0, 0, true};
c46d4546