• Vadim Pisarevsky's avatar
    further improvements in split & merge; started using non-temporary store instructions (#12063) · 43820d89
    Vadim Pisarevsky authored
    * 1. changed static const __m128/256 to const __m128/256 to avoid wierd instructions and calls inserted by compiler.
    2. added universal intrinsics that wrap MOVNTPS and other such (non-temporary or "no cache" store) instructions. v_store_interleave() and v_store() got respective flags/overloaded variants
    3. rewrote split & merge to use the "no cache" store instructions. It resulted in dramatic performance improvement when processing big arrays
    
    * hopefully, fixed some test failures where 4-channel v_store_interleave() is used
    
    * added missing implementation of the new universal intrinsics (v_store_aligned_nocache() etc.)
    
    * fixed silly typo in the new intrinsics in intrin_vsx.hpp
    
    * still trying to fix VSX compiler errors
    
    * still trying to fix VSX compiler errors
    
    * still trying to fix VSX compiler errors
    
    * still trying to fix VSX compiler errors
    43820d89
Name
Last commit
Last update
..
calib3d Loading commit data...
core Loading commit data...
cudaarithm Loading commit data...
cudabgsegm Loading commit data...
cudacodec Loading commit data...
cudafeatures2d Loading commit data...
cudafilters Loading commit data...
cudaimgproc Loading commit data...
cudalegacy Loading commit data...
cudaobjdetect Loading commit data...
cudaoptflow Loading commit data...
cudastereo Loading commit data...
cudawarping Loading commit data...
cudev Loading commit data...
dnn Loading commit data...
features2d Loading commit data...
flann Loading commit data...
highgui Loading commit data...
imgcodecs Loading commit data...
imgproc Loading commit data...
java Loading commit data...
js Loading commit data...
ml Loading commit data...
objdetect Loading commit data...
photo Loading commit data...
python Loading commit data...
shape Loading commit data...
stitching Loading commit data...
superres Loading commit data...
ts Loading commit data...
video Loading commit data...
videoio Loading commit data...
videostab Loading commit data...
viz Loading commit data...
world Loading commit data...
CMakeLists.txt Loading commit data...