1. 13 Sep, 2018 1 commit
  2. 10 Sep, 2018 1 commit
    • Vadim Pisarevsky's avatar
      added basic support for CV_16F (the new datatype etc.) (#12463) · 6d7f5871
      Vadim Pisarevsky authored
      * added basic support for CV_16F (the new datatype etc.). CV_USRTYPE1 is now equal to CV_16F, which may break some [rarely used] functionality. We'll see
      
      * fixed just introduced bug in norm; reverted errorneous changes in Torch importer (need to find a better solution)
      
      * addressed some issues found during the PR review
      
      * restored the patch to fix some perf test failures
      6d7f5871
  3. 06 Sep, 2018 1 commit
    • Vadim Pisarevsky's avatar
      Merge pull request #12411 from vpisarev:wide_convert · 80b62a41
      Vadim Pisarevsky authored
      * rewrote Mat::convertTo() and convertScaleAbs() to wide universal intrinsics; added always-available and SIMD-optimized FP16<=>FP32 conversion
      
      * fixed compile warnings
      
      * fix some more compile errors
      
      * slightly relaxed accuracy threshold for int->float conversion (since we now do it using single-precision arithmetics, not double-precision)
      
      * fixed compile errors on iOS, Android and in the baseline C++ version (intrin_cpp.hpp)
      
      * trying to fix ARM-neon builds
      
      * trying to fix ARM-neon builds
      
      * trying to fix ARM-neon builds
      
      * trying to fix ARM-neon builds
      80b62a41
  4. 04 Sep, 2018 1 commit
  5. 02 Sep, 2018 1 commit
  6. 01 Aug, 2018 1 commit
    • amatyuko's avatar
      Fix for SSE2 intrinsics problem in the part of saturation arithmetic processing… · 3ea2586a
      amatyuko authored
      Fix for SSE2 intrinsics problem in the part of saturation arithmetic processing during 32s->16u packed conversion -
      for some big negative values less than -INT_MAX+32767 the sign of the numbers is lost due to overflow that leads to
      incorrect saturation to MAX value, instead of zero.
      The issue is not reproduced with CV_ENABLED_INTRINSICS=OFF
      3ea2586a
  7. 31 May, 2018 1 commit
  8. 18 Apr, 2018 1 commit
  9. 07 Apr, 2018 1 commit
  10. 06 Mar, 2018 1 commit
  11. 12 Feb, 2018 2 commits
  12. 14 Dec, 2017 1 commit
    • Tomoaki Teshima's avatar
      core: remove raw SSE2/NEON implementation from convert.cpp (#9831) · ca1a0a11
      Tomoaki Teshima authored
      * remove raw SSE2/NEON implementation from convert.cpp
        * remove raw implementation from Cvt_SIMD
        * remove raw implementation from cvtScale_SIMD
        * remove raw implementation from cvtScaleAbs_SIMD
        * remove duplicated implementation cvt_<float, short>
        * remove duplicated implementation cvtScale_<short, short, float>
        * add "from double" version of Cvt_SIMD
        * modify the condition of test ConvertScaleAbs
      
      * Update convert.cpp
      
      fixed crash in cvtScaleAbs(8s=>8u)
      
      * fixed compile error on Win32
      
      * fixed several test failures because of accuracy loss in cvtScale(int=>int)
      
      * fixed NEON implementation of v_cvt_f64(int=>double) intrinsic
      
      * another attempt to fix test failures
      
      * keep trying to fix the test failures and just introduced compile warnings
      
      * fixed one remaining test (subtractScalar)
      ca1a0a11
  13. 28 Nov, 2017 1 commit
    • Alexander Alekhin's avatar
      ocl: avoid unnecessary loading/initializing OpenCL subsystem · 0ed3209b
      Alexander Alekhin authored
      If there are no OpenCL/UMat methods calls from application.
      
      OpenCL subsystem is initialized:
      - haveOpenCL() is called from application
      - useOpenCL() is called from application
      - access to OpenCL allocator: UMat is created (empty UMat is ignored) or UMat <-> Mat conversions are called
      
      Don't call OpenCL functions if OPENCV_OPENCL_RUNTIME=disabled
      (independent from OpenCL linkage type)
      0ed3209b
  14. 23 Aug, 2017 1 commit
    • Pavel Vlasov's avatar
      ICV2017u3 package update; · a57718e1
      Pavel Vlasov authored
      - Optimizations set change. Now IPP integrations will provide code for SSE42, AVX2 and AVX512 (SKX) CPUs only. For HW below SSE42 IPP code is disabled.
      - Performance regressions fixes for IPP code paths;
      - cv::boxFilter integration improvement;
      - cv::filter2D integration improvement;
      a57718e1
  15. 17 Jul, 2017 1 commit
  16. 04 Jul, 2017 1 commit
  17. 12 Jun, 2017 1 commit
  18. 06 Jun, 2017 1 commit
  19. 23 May, 2017 1 commit
  20. 25 Apr, 2017 1 commit
    • Pavel Vlasov's avatar
      Update for IPP for OpenCV 2017u2 integration; · 11c2ffaf
      Pavel Vlasov authored
      Updated integrations for:
      cv::split
      cv::merge
      cv::insertChannel
      cv::extractChannel
      cv::Mat::convertTo - now with scaled conversions support
      cv::LUT - disabled due to performance issues
      Mat::copyTo
      Mat::setTo
      cv::flip
      cv::copyMakeBorder - currently disabled
      cv::polarToCart
      cv::pow - ipp pow function was removed due to performance issues
      cv::hal::magnitude32f/64f - disabled for <= SSE42, poor performance
      cv::countNonZero
      cv::minMaxIdx
      cv::norm
      cv::canny - new integration. Disabled for threaded;
      cv::cornerHarris
      cv::boxFilter
      cv::bilateralFilter
      cv::integral
      11c2ffaf
  21. 20 Apr, 2017 1 commit
  22. 19 Apr, 2017 1 commit
  23. 06 Apr, 2017 1 commit
  24. 21 Feb, 2017 1 commit
  25. 16 Dec, 2016 2 commits
  26. 14 Dec, 2016 1 commit
  27. 09 Dec, 2016 1 commit
  28. 06 Dec, 2016 1 commit
  29. 02 Dec, 2016 1 commit
  30. 29 Nov, 2016 3 commits
  31. 29 Sep, 2016 1 commit
  32. 23 Sep, 2016 1 commit
    • Tomoaki Teshima's avatar
      check FP16 build condition correctly · c7cb116d
      Tomoaki Teshima authored
        * use __GNUC_MINOR__ in correct place to check the version of GCC
        * check processor support of FP16 at run time
        * check compiler support of FP16 and pass correct compiler option
        * rely on ENABLE_AVX on gcc since AVX is generated when mf16c is passed
        * guard correctly using ifdef in case of various configuration
        * use v_float16x4 correctly by including the right header file
      c7cb116d
  33. 04 Sep, 2016 1 commit
    • Tomoaki Teshima's avatar
      use universal intrinsic for FP16 · 903789f7
      Tomoaki Teshima authored
        * use v_float16x4 (universal intrinsic) instead of raw SSE/NEON implementation
        * define v_load_f16/v_store_f16 since v_load can't be distinguished when short pointer passed
        * brush up implementation on old compiler (guard correctly)
        * add test for v_load_f16 and round trip conversion of v_float16x4
        * fix conversion error
      903789f7
  34. 24 Aug, 2016 1 commit
  35. 19 Aug, 2016 1 commit
  36. 09 Aug, 2016 1 commit