modules/dnn · b593cae0377f5b7257cf32f5029e0a2609828682 · submodule / opencv_contrib

some further optimizations and cleanups in dnn (#1237) · b593cae0

Vadim Pisarevsky authored Jun 21, 2017

* some further optimizations and cleanups in dnn:
+ got rid of dnn::gemm; it's not perf critical anymore (perhaps)
+ embedded col2im functionality into convolution_layer.cpp, since it's not used anywhere else
+ parallel max pooling. even better performance can be achieved if we knew that max indices are not needed (and they are not needed in most networks)
+ somewhat optimized deconvolution layer: optimized bias addition (merged it with col2im), optimized col2im slightly.
+ hopefully fixed incorrect memory access in fully-connected layer; restored aligned memory reads (they should work fine now)

* hopefully fixed regressions in ENet performance

* fixed some typos in deconvolution; added SIMD optimization for the max pooling layer

* fixed warnings in SIMD-less build configuration

b593cae0

Name	Last commit	Last update
..
3rdparty/protobuf		Loading commit data...
cmake		Loading commit data...
include/opencv2		Loading commit data...
misc		Loading commit data...
perf		Loading commit data...
samples		Loading commit data...
src		Loading commit data...
test		Loading commit data...
tutorials		Loading commit data...
CMakeLists.txt		Loading commit data...
README.md		Loading commit data...

README.md