modules/dnn/src · 2938860b3f292e600cfb7404faa78660ca86516c · submodule / opencv

Provide a few AVX512 optimized functions for the DNN module · 2938860b

Arjan van de Ven authored Dec 25, 2017

This patch adds AVX512 optimized fastConv as well as the hookups
needed to get these called in the convolution_layer.

AVX512 fastConv is code-identical on a C level to the AVX2 one,
but is measurably faster due to AVX512 having more registers available
to cache results in.
Signed-off-by: Arjan van de Ven <arjan@linux.intel.com>

2938860b

Name	Last commit	Last update
..
caffe		Loading commit data...
darknet		Loading commit data...
layers		Loading commit data...
ocl4dnn		Loading commit data...
opencl		Loading commit data...
tensorflow		Loading commit data...
torch		Loading commit data...
dnn.cpp		Loading commit data...
halide_scheduler.cpp		Loading commit data...
halide_scheduler.hpp		Loading commit data...
init.cpp		Loading commit data...
nms.cpp		Loading commit data...
nms.inl.hpp		Loading commit data...
op_halide.cpp		Loading commit data...
op_halide.hpp		Loading commit data...
precomp.hpp		Loading commit data...