Merge pull request #16063 from YashasSamaga:cuda4dnn-shortcut-unequal
support eltwise sum with different number of input channels in CUDA backend * add shortcut primitive * add offsets in shortcut kernel * skip tests involving more than two inputs * remove redundant modulus operation * support multiple inputs * remove whole file indentation * skip acc in0 trunc test if weighted * use shortcut iff channels are unequal
Showing
Please
register
or
sign in
to comment