• Vadim Pisarevsky's avatar
    another round of dnn optimization (#9011) · 8b3d6603
    Vadim Pisarevsky authored
    * another round of dnn optimization:
    * increased malloc alignment across OpenCV from 16 to 64 bytes to make it AVX2 and even AVX-512 friendly
    * improved SIMD optimization of pooling layer, optimized average pooling
    * cleaned up convolution layer implementation
    * made activation layer "attacheable" to all other layers, including fully connected and addition layer.
    * fixed bug in the fusion algorithm: "LayerData::consumers" should not be cleared, because it desctibes the topology.
    * greatly optimized permutation layer, which improved SSD performance
    * parallelized element-wise binary/ternary/... ops (sum, prod, max)
    
    * also, added missing copyrights to many of the layer implementation files
    
    * temporarily disabled (again) the check for intermediate blobs consistency; fixed warnings from various builders
    8b3d6603
Name
Last commit
Last update
.github Loading commit data...
3rdparty Loading commit data...
apps Loading commit data...
cmake Loading commit data...
data Loading commit data...
doc Loading commit data...
include Loading commit data...
modules Loading commit data...
platforms Loading commit data...
samples Loading commit data...
.gitattributes Loading commit data...
.gitignore Loading commit data...
.tgitconfig Loading commit data...
CMakeLists.txt Loading commit data...
CONTRIBUTING.md Loading commit data...
LICENSE Loading commit data...
README.md Loading commit data...