modules/dnn/src · e551d15c2b58edce36fe5a5c23072cbe6ce74a93 · submodule / opencv_contrib

enabled convolution & activation fusion (#1245) · e551d15c

Vadim Pisarevsky authored Jun 22, 2017

* enabled convolution & activation fusion

* a few more optimizations:
+ optimized the common case when the indices of max pooling layer are not used. in this case we use the more efficient branch that computes just maximums over the aperture.
+ optimized the convolution + activation fusion when the activation is relu, which is another common case
+ convolution can now be fused with batch norm. It's the zero-cost fusion. If the batch norm is followed by relu, all three (conv + batchnorm + relu) are fused together. this modification seriously improved ENet performance

* hopefully fixed warnings on Windows

e551d15c

Name	Last commit	Last update
..
caffe		Loading commit data...
layers		Loading commit data...
opencl		Loading commit data...
tensorflow		Loading commit data...
torch		Loading commit data...
dnn.cpp		Loading commit data...
halide_scheduler.cpp		Loading commit data...
halide_scheduler.hpp		Loading commit data...
init.cpp		Loading commit data...
op_halide.cpp		Loading commit data...
op_halide.hpp		Loading commit data...
precomp.hpp		Loading commit data...