• Vadim Pisarevsky's avatar
    another round of dnn optimization (#9011) · 8b3d6603
    Vadim Pisarevsky authored
    * another round of dnn optimization:
    * increased malloc alignment across OpenCV from 16 to 64 bytes to make it AVX2 and even AVX-512 friendly
    * improved SIMD optimization of pooling layer, optimized average pooling
    * cleaned up convolution layer implementation
    * made activation layer "attacheable" to all other layers, including fully connected and addition layer.
    * fixed bug in the fusion algorithm: "LayerData::consumers" should not be cleared, because it desctibes the topology.
    * greatly optimized permutation layer, which improved SSD performance
    * parallelized element-wise binary/ternary/... ops (sum, prod, max)
    
    * also, added missing copyrights to many of the layer implementation files
    
    * temporarily disabled (again) the check for intermediate blobs consistency; fixed warnings from various builders
    8b3d6603
Name
Last commit
Last update
..
calib3d Loading commit data...
core Loading commit data...
cudaarithm Loading commit data...
cudabgsegm Loading commit data...
cudacodec Loading commit data...
cudafeatures2d Loading commit data...
cudafilters Loading commit data...
cudaimgproc Loading commit data...
cudalegacy Loading commit data...
cudaobjdetect Loading commit data...
cudaoptflow Loading commit data...
cudastereo Loading commit data...
cudawarping Loading commit data...
cudev Loading commit data...
dnn Loading commit data...
features2d Loading commit data...
flann Loading commit data...
highgui Loading commit data...
imgcodecs Loading commit data...
imgproc Loading commit data...
java Loading commit data...
ml Loading commit data...
objdetect Loading commit data...
photo Loading commit data...
python Loading commit data...
shape Loading commit data...
stitching Loading commit data...
superres Loading commit data...
ts Loading commit data...
video Loading commit data...
videoio Loading commit data...
videostab Loading commit data...
viz Loading commit data...
world Loading commit data...
CMakeLists.txt Loading commit data...