• Arjan van de Ven's avatar
    Merge pull request #10468 from fenrus75:avx512-2 · a75840d1
    Arjan van de Ven authored
    * Add a 512 bit codepath to the AVX512 fastConv function
    
    this patch adds a 512 wide codepath to the fastConv() function for
    AVX512 use.
    The basic idea is to process the first N * 16 elements of the vector
    with avx512, and then run the rest of the vector using the traditional
    AVX2 codepath.
    
    * dnn: use unaligned AVX512 load (OpenCV aligns data on 32-byte boundary)
    
    * dnn: change "vecsize" condition for AVX512
    
    * dnn: fix indentation
    a75840d1
Name
Last commit
Last update
..
batch_norm_layer.cpp Loading commit data...
blank_layer.cpp Loading commit data...
concat_layer.cpp Loading commit data...
convolution_layer.cpp Loading commit data...
crop_layer.cpp Loading commit data...
detection_output_layer.cpp Loading commit data...
elementwise_layers.cpp Loading commit data...
eltwise_layer.cpp Loading commit data...
flatten_layer.cpp Loading commit data...
fully_connected_layer.cpp Loading commit data...
layers_common.cpp Loading commit data...
layers_common.hpp Loading commit data...
layers_common.simd.hpp Loading commit data...
lrn_layer.cpp Loading commit data...
max_unpooling_layer.cpp Loading commit data...
mvn_layer.cpp Loading commit data...
normalize_bbox_layer.cpp Loading commit data...
padding_layer.cpp Loading commit data...
permute_layer.cpp Loading commit data...
pooling_layer.cpp Loading commit data...
prior_box_layer.cpp Loading commit data...
proposal_layer.cpp Loading commit data...
recurrent_layers.cpp Loading commit data...
region_layer.cpp Loading commit data...
reorg_layer.cpp Loading commit data...
reshape_layer.cpp Loading commit data...
resize_nearest_neighbor_layer.cpp Loading commit data...
scale_layer.cpp Loading commit data...
shift_layer.cpp Loading commit data...
slice_layer.cpp Loading commit data...
softmax_layer.cpp Loading commit data...
split_layer.cpp Loading commit data...