• Paul E. Murphy's avatar
    fast_math: implement optimized PPC routines · f38a61c6
    Paul E. Murphy authored
    Implement cvRound using inline asm. No compiler support
    exists today to properly optimize this. This results in
    about a 4x speedup over the default rounding. Likewise,
    simplify the growing number of rounding function overloads.
    
    For P9 enabled targets, utilize the classification
    testing instruction to test for Inf/Nan values. Operation
    speedup is about 1.2x for FP32, and 1.5x for FP64 operands.
    
    For P8 targets, fallback to the GCC nan inline. It provides
    a 1.1/1.4x improvement for FP32/FP64 arguments.
    f38a61c6
Name
Last commit
Last update
.github Loading commit data...
3rdparty Loading commit data...
apps Loading commit data...
cmake Loading commit data...
data Loading commit data...
doc Loading commit data...
include Loading commit data...
modules Loading commit data...
platforms Loading commit data...
samples Loading commit data...
.editorconfig Loading commit data...
.gitattributes Loading commit data...
.gitignore Loading commit data...
CMakeLists.txt Loading commit data...
CONTRIBUTING.md Loading commit data...
LICENSE Loading commit data...
README.md Loading commit data...