• Vadim Pisarevsky's avatar
    another round of dnn optimization (#9011) · 8b3d6603
    Vadim Pisarevsky authored
    * another round of dnn optimization:
    * increased malloc alignment across OpenCV from 16 to 64 bytes to make it AVX2 and even AVX-512 friendly
    * improved SIMD optimization of pooling layer, optimized average pooling
    * cleaned up convolution layer implementation
    * made activation layer "attacheable" to all other layers, including fully connected and addition layer.
    * fixed bug in the fusion algorithm: "LayerData::consumers" should not be cleared, because it desctibes the topology.
    * greatly optimized permutation layer, which improved SSD performance
    * parallelized element-wise binary/ternary/... ops (sum, prod, max)
    
    * also, added missing copyrights to many of the layer implementation files
    
    * temporarily disabled (again) the check for intermediate blobs consistency; fixed warnings from various builders
    8b3d6603
reshape_layer.cpp 9.08 KB