Files · 2938860b3f292e600cfb7404faa78660ca86516c · submodule / opencv

Provide a few AVX512 optimized functions for the DNN module · 2938860b

Arjan van de Ven authored Dec 25, 2017

This patch adds AVX512 optimized fastConv as well as the hookups
needed to get these called in the convolution_layer.

AVX512 fastConv is code-identical on a C level to the AVX2 one,
but is measurably faster due to AVX512 having more registers available
to cache results in.
Signed-off-by: Arjan van de Ven <arjan@linux.intel.com>

2938860b

Name	Last commit	Last update
.github		Loading commit data...
3rdparty		Loading commit data...
apps		Loading commit data...
cmake		Loading commit data...
data		Loading commit data...
doc		Loading commit data...
include		Loading commit data...
modules		Loading commit data...
platforms		Loading commit data...
samples		Loading commit data...
.gitattributes		Loading commit data...
.gitignore		Loading commit data...
.tgitconfig		Loading commit data...
CMakeLists.txt		Loading commit data...
CONTRIBUTING.md		Loading commit data...
LICENSE		Loading commit data...
README.md		Loading commit data...

README.md