- 07 Aug, 2019 3 commits
-
-
Paul E. Murphy authored
Implement cvRound using inline asm. No compiler support exists today to properly optimize this. This results in about a 4x speedup over the default rounding. Likewise, simplify the growing number of rounding function overloads. For P9 enabled targets, utilize the classification testing instruction to test for Inf/Nan values. Operation speedup is about 1.2x for FP32, and 1.5x for FP64 operands. For P8 targets, fallback to the GCC nan inline. It provides a 1.1/1.4x improvement for FP32/FP64 arguments.
-
Paul E. Murphy authored
Add a new macro definition OPENCV_USE_FASTMATH_GCC_BUILTINS to enable usage of GCC inline math functions, if available and requested by the user. Likewise, enable it for POWER. This is nearly always a substantial improvement over using integer manipulation as most operations can be done in several instructions with no branching. The result is a 1.5-1.8x speedup in the ceil/floor operations. 1. As tested with AT 12.0-1 (GCC 8.3.1) compiler on P9 LE.
-
Paul E. Murphy authored
Add a basic sanity test to verify the rounding functions work as expected. Likewise, extend the rounding performance test to cover the additional float -> int fast math functions.
-
- 25 Jul, 2019 8 commits
-
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Chip Kerchner authored
* Unroll multiply and add instructions in dotProd_32f - 35% faster. * Eliminate unnecessary v_reduce_sum instructions.
-
Alexander Alekhin authored
-
Dmitry Kurtaev authored
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
- 24 Jul, 2019 2 commits
-
-
Alexander Alekhin authored
-
Hugo Lindström authored
* Improve support for Windows Embedded Compact * Remove redundant set(WINCE true) and format CMake
-
- 21 Jul, 2019 2 commits
-
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
- 20 Jul, 2019 2 commits
-
-
Dmitry Kurtaev authored
-
Alexander Alekhin authored
-
- 19 Jul, 2019 5 commits
-
-
Alexander Alekhin authored
-
Lubov Batanina authored
* Added support for the ONNX "ReduceMean" Layer. (as this is the same as the GlobalAveragePool) * Add ReduceMean test * Fix ONNX importer * Fix ReduceMean * Add assert * Split test * Fix split test
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Tomoaki Teshima authored
* use SOLVE_EPNP for the initial guess
-
- 18 Jul, 2019 5 commits
-
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Vitaly Tuzov authored
WUI based implementation to initUndistortRectifyMap (#14994) * Add initUndistortRectifyMap performance test * Move cv namespace boundaries * Add wide universal intrinsics based implementation to initUndistortRectifyMap * Dispatch undistort
-
Lubov Batanina authored
* Fix dumpToFile * Add test * Fix test
-
Alexander Alekhin authored
-
- 17 Jul, 2019 1 commit
-
-
Liubov Batanina authored
-
- 16 Jul, 2019 5 commits
-
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Chip Kerchner authored
* Reduce store gathering pressures - speeds thresholds by up to 20% * Rename temporary histogram array and initialize so that MACOSX builder is happy
-
Alexander Alekhin authored
- writeRaw(): support structs - readRaw(): 'len' is buffer limit in bytes (documentation is fixed)
-
Alexander Alekhin authored
-
- 15 Jul, 2019 1 commit
-
-
Alexander Alekhin authored
-
- 12 Jul, 2019 5 commits
-
-
Alexander Alekhin authored
-
Hugo Lindström authored
-
Alexander Alekhin authored
-
Vitaly Tuzov authored
-
Lubov Batanina authored
* Support Deconvolution3D on IE backend * Add test tag * Fix tests
-
- 11 Jul, 2019 1 commit
-
-
Alexander Alekhin authored
may cause crashes on ARM platform
-