• Vadim Pisarevsky's avatar
    some further optimizations and cleanups in dnn (#1237) · b593cae0
    Vadim Pisarevsky authored
    * some further optimizations and cleanups in dnn:
    + got rid of dnn::gemm; it's not perf critical anymore (perhaps)
    + embedded col2im functionality into convolution_layer.cpp, since it's not used anywhere else
    + parallel max pooling. even better performance can be achieved if we knew that max indices are not needed (and they are not needed in most networks)
    + somewhat optimized deconvolution layer: optimized bias addition (merged it with col2im), optimized col2im slightly.
    + hopefully fixed incorrect memory access in fully-connected layer; restored aligned memory reads (they should work fine now)
    
    * hopefully fixed regressions in ENet performance
    
    * fixed some typos in deconvolution; added SIMD optimization for the max pooling layer
    
    * fixed warnings in SIMD-less build configuration
    b593cae0
Name
Last commit
Last update
..
batch_norm_layer.cpp Loading commit data...
blank_layer.cpp Loading commit data...
concat_layer.cpp Loading commit data...
convolution_layer.cpp Loading commit data...
crop_layer.cpp Loading commit data...
detection_output_layer.cpp Loading commit data...
elementwise_layers.cpp Loading commit data...
eltwise_layer.cpp Loading commit data...
flatten_layer.cpp Loading commit data...
fully_connected_layer.cpp Loading commit data...
layers_common.avx2.cpp Loading commit data...
layers_common.cpp Loading commit data...
layers_common.hpp Loading commit data...
lrn_layer.cpp Loading commit data...
max_unpooling_layer.cpp Loading commit data...
mvn_layer.cpp Loading commit data...
normalize_bbox_layer.cpp Loading commit data...
padding_layer.cpp Loading commit data...
permute_layer.cpp Loading commit data...
pooling_layer.cpp Loading commit data...
prior_box_layer.cpp Loading commit data...
recurrent_layers.cpp Loading commit data...
reshape_layer.cpp Loading commit data...
scale_layer.cpp Loading commit data...
shift_layer.cpp Loading commit data...
slice_layer.cpp Loading commit data...
softmax_layer.cpp Loading commit data...
split_layer.cpp Loading commit data...