• Paul E. Murphy's avatar
    fast_math: implement optimized PPC routines · f38a61c6
    Paul E. Murphy authored
    Implement cvRound using inline asm. No compiler support
    exists today to properly optimize this. This results in
    about a 4x speedup over the default rounding. Likewise,
    simplify the growing number of rounding function overloads.
    
    For P9 enabled targets, utilize the classification
    testing instruction to test for Inf/Nan values. Operation
    speedup is about 1.2x for FP32, and 1.5x for FP64 operands.
    
    For P8 targets, fallback to the GCC nan inline. It provides
    a 1.1/1.4x improvement for FP32/FP64 arguments.
    f38a61c6
Name
Last commit
Last update
..
calib3d Loading commit data...
core Loading commit data...
cudaarithm Loading commit data...
cudabgsegm Loading commit data...
cudacodec Loading commit data...
cudafeatures2d Loading commit data...
cudafilters Loading commit data...
cudaimgproc Loading commit data...
cudalegacy Loading commit data...
cudaobjdetect Loading commit data...
cudaoptflow Loading commit data...
cudastereo Loading commit data...
cudawarping Loading commit data...
cudev Loading commit data...
dnn Loading commit data...
features2d Loading commit data...
flann Loading commit data...
highgui Loading commit data...
imgcodecs Loading commit data...
imgproc Loading commit data...
java Loading commit data...
js Loading commit data...
ml Loading commit data...
objdetect Loading commit data...
photo Loading commit data...
python Loading commit data...
shape Loading commit data...
stitching Loading commit data...
superres Loading commit data...
ts Loading commit data...
video Loading commit data...
videoio Loading commit data...
videostab Loading commit data...
viz Loading commit data...
world Loading commit data...
CMakeLists.txt Loading commit data...