Commits · 8b87c4b96a1c1025099cf0dd1cfd6b09139202c6 · submodule / opencv

14 Dec, 2017 1 commit

core: remove raw SSE2/NEON implementation from convert.cpp (#9831) · ca1a0a11

Tomoaki Teshima authored 7 years ago

* remove raw SSE2/NEON implementation from convert.cpp
  * remove raw implementation from Cvt_SIMD
  * remove raw implementation from cvtScale_SIMD
  * remove raw implementation from cvtScaleAbs_SIMD
  * remove duplicated implementation cvt_<float, short>
  * remove duplicated implementation cvtScale_<short, short, float>
  * add "from double" version of Cvt_SIMD
  * modify the condition of test ConvertScaleAbs

* Update convert.cpp

fixed crash in cvtScaleAbs(8s=>8u)

* fixed compile error on Win32

* fixed several test failures because of accuracy loss in cvtScale(int=>int)

* fixed NEON implementation of v_cvt_f64(int=>double) intrinsic

* another attempt to fix test failures

* keep trying to fix the test failures and just introduced compile warnings

* fixed one remaining test (subtractScalar)

ca1a0a11

28 Nov, 2017 1 commit

ocl: avoid unnecessary loading/initializing OpenCL subsystem · 0ed3209b

Alexander Alekhin authored 7 years ago

If there are no OpenCL/UMat methods calls from application.

OpenCL subsystem is initialized:
- haveOpenCL() is called from application
- useOpenCL() is called from application
- access to OpenCL allocator: UMat is created (empty UMat is ignored) or UMat <-> Mat conversions are called

Don't call OpenCL functions if OPENCV_OPENCL_RUNTIME=disabled
(independent from OpenCL linkage type)

0ed3209b

23 Aug, 2017 1 commit

ICV2017u3 package update; · a57718e1

Pavel Vlasov authored 7 years ago

- Optimizations set change. Now IPP integrations will provide code for SSE42, AVX2 and AVX512 (SKX) CPUs only. For HW below SSE42 IPP code is disabled.
- Performance regressions fixes for IPP code paths;
- cv::boxFilter integration improvement;
- cv::filter2D integration improvement;

a57718e1

17 Jul, 2017 1 commit
- core: fix convertTo() AVX2 optimization · b4716b1d
  Alexander Alekhin authored 7 years ago
  
  b4716b1d
04 Jul, 2017 1 commit
- AVX and SSE4.1 optimized conversion implementations migrated to separate files · 5448d918
  Vitaly Tuzov authored 7 years ago
  
  5448d918
12 Jun, 2017 1 commit
- suppress unreachable code warning · 94848a3e
  Tomoaki Teshima authored 7 years ago
```
 - fix the define condition based on the comment
```
  94848a3e
06 Jun, 2017 1 commit

update convertFp16 using CV_CPU_CALL_FP16 · e269ef96

Tomoaki Teshima authored 7 years ago

 * avoid link error (move the implementation of software version to header)
 * make getConvertFuncFp16 local (move from precomp.hpp to convert.hpp)
 * fix error on 32bit x86

e269ef96

23 May, 2017 1 commit
- add OpenCL version of convertFp16 and test · d81cdb8e
  Tomoaki Teshima authored 7 years ago
```
 * disable vector operation for now
 * brush up the implementation based on comment
```
  d81cdb8e
25 Apr, 2017 1 commit

Update for IPP for OpenCV 2017u2 integration; · 11c2ffaf

Pavel Vlasov authored 7 years ago

Updated integrations for:
cv::split
cv::merge
cv::insertChannel
cv::extractChannel
cv::Mat::convertTo - now with scaled conversions support
cv::LUT - disabled due to performance issues
Mat::copyTo
Mat::setTo
cv::flip
cv::copyMakeBorder - currently disabled
cv::polarToCart
cv::pow - ipp pow function was removed due to performance issues
cv::hal::magnitude32f/64f - disabled for <= SSE42, poor performance
cv::countNonZero
cv::minMaxIdx
cv::norm
cv::canny - new integration. Disabled for threaded;
cv::cornerHarris
cv::boxFilter
cv::bilateralFilter
cv::integral

11c2ffaf

20 Apr, 2017 1 commit
- IPP for OpenCV 2017u2 initial enabling patch; · 35c72168
  Pavel Vlasov authored 7 years ago
  
  35c72168
19 Apr, 2017 1 commit

Merge pull request #8535 from arnaudbrejeon:std_array · 636ab095

Arnaud Brejeon authored 7 years ago

Add support for std::array<T, N> (#8535)

* Add support for std::array<T, N>

* Add std::array<Mat, N> support

* Remove UMat constructor with std::array parameter

636ab095

06 Apr, 2017 1 commit
- Extended set of OpenVX HAL calls disabled for small images · bf5b7843
  Vitaly Tuzov authored 8 years ago
  
  bf5b7843
21 Feb, 2017 1 commit
- OpenVX calls updated to use single common OpenVX context per thread · 9a4b5a45
  Vitaly Tuzov authored 8 years ago
  
  9a4b5a45
16 Dec, 2016 2 commits
- openvx_cvt disabled for Khronos, fixed sstep and dstep usage · fcdbe162
  Rostislav Vasilikhin authored 8 years ago
  
  fcdbe162
- OpenVX convert enabled · cf5e976f
  Rostislav Vasilikhin authored 8 years ago
  
  cf5e976f
14 Dec, 2016 1 commit
- OpenVX wrappers rewritten with CV_OVX_RUN, VX_DbgThrow · 8b9422a0
  Rostislav Vasilikhin authored 8 years ago
  
  8b9422a0
09 Dec, 2016 1 commit
- trying to enable canny_vx adding a new test comparing canny_cv vs canny_vx · 76c38f0c
  apavlenko authored 8 years ago
  
  76c38f0c
06 Dec, 2016 1 commit

Merge pull request #7794 from savuor:fix/ovx_cvt_continuous · 695b2017

Rostislav Vasilikhin authored 8 years ago

Fixed OpenVX wrapper for Mat::convertTo() (#7794)

* fixed for cases of unrolled (w*h x 1) matrices

* more error handling

695b2017

02 Dec, 2016 1 commit
- Added OpenVX based processing to LUT · ced81f72
  Vitaly Tuzov authored 8 years ago
  
  ced81f72
29 Nov, 2016 3 commits
- fixed: data types, empty input case · 2b56b174
  Rostislav Vasilikhin authored 8 years ago
  
  2b56b174
- added OpenVX call to Mat::convertTo() (w/o scaling) · 0a695881
  Rostislav Vasilikhin authored 8 years ago
  
  0a695881
- Solve exception for 3D Mat · c56c0e14
  LaurentBerger authored 8 years ago
  
  c56c0e14
29 Sep, 2016 1 commit
- build: fix aarch64 build with aarch64-linux-gnu-g++-4.8 · a9ab629f
  Alexander Alekhin authored 8 years ago
  
  a9ab629f
23 Sep, 2016 1 commit

check FP16 build condition correctly · c7cb116d

Tomoaki Teshima authored 8 years ago

  * use __GNUC_MINOR__ in correct place to check the version of GCC
  * check processor support of FP16 at run time
  * check compiler support of FP16 and pass correct compiler option
  * rely on ENABLE_AVX on gcc since AVX is generated when mf16c is passed
  * guard correctly using ifdef in case of various configuration
  * use v_float16x4 correctly by including the right header file

c7cb116d

04 Sep, 2016 1 commit

use universal intrinsic for FP16 · 903789f7

Tomoaki Teshima authored 8 years ago

  * use v_float16x4 (universal intrinsic) instead of raw SSE/NEON implementation
  * define v_load_f16/v_store_f16 since v_load can't be distinguished when short pointer passed
  * brush up implementation on old compiler (guard correctly)
  * add test for v_load_f16 and round trip conversion of v_float16x4
  * fix conversion error

903789f7

24 Aug, 2016 1 commit
- brush up fp16 implementation · c5d7791b
  Tomoaki Teshima authored 8 years ago
```
  * DRY
  * switch to Cv32suf and remove fp32Int32
  * add Cv16suf
```
  c5d7791b
19 Aug, 2016 1 commit
- Instrumentation for OpenCV API regions and IPP functions; · 30a6cee2
  Pavel Vlasov authored 8 years ago
  
  30a6cee2
09 Aug, 2016 1 commit

fix build error on JetsonTK1 · 3debc78a

Tomoaki Teshima authored 8 years ago

  * avoid using vld1_f16 and vst1_f16 on gcc 4 series (Ubuntu 14.04)
  * guard correctly with #if
  * use static inline

3debc78a

03 Aug, 2016 1 commit

brush up convertFp16 · 87ca607f

Tomoaki Teshima authored 8 years ago

  * raise an error when wrong bit depth passed
  * raise an build error when wrong depth is specified for cvtScaleHalf_
  * remove unnecessary safe check in cvtScaleHalf_
  * use intrinsic instead of direct pointer access
  * update the explanation

87ca607f

29 Jul, 2016 1 commit
- show CPU feature correctly when FP16 is available · c57f8780
  Tomoaki Teshima authored 8 years ago
```
  * make sure that CV_FP16 has the correct meaning
  * check FP16 feature correctly
```
  c57f8780
20 Jul, 2016 1 commit
- fix android pack build · 2ec63e4d
  Alexander Alekhin authored 8 years ago
  
  2ec63e4d
08 Jul, 2016 1 commit
- bigdata: add test, resolve split/merge issue · 5f269d08
  Alexander Alekhin authored 8 years ago
  
  5f269d08
08 Jun, 2016 1 commit
- fix run time error on Mac · d0a83909
  Tomoaki Teshima authored 8 years ago
```
  * integrate HW version and SW version to same function
```
  d0a83909
07 Jun, 2016 2 commits

fix to support wider compiler · fd76ed5c

Tomoaki Teshima authored 8 years ago

  * check compiler more strictly
  * use gcc version of fp16 conversion if it's possible (gcc 4.7 and later)
  * use current SW implementation in other cases

fd76ed5c

fix warning · 6f6eebbc
Tomoaki Teshima authored 8 years ago

6f6eebbc

06 Jun, 2016 1 commit
- fix corner case when number is small · fbfd3158
  Tomoaki Teshima authored 8 years ago
  
  fbfd3158
05 Jun, 2016 1 commit
- follow other interface · eccf2fa4
  Tomoaki Teshima authored 8 years ago
```
  * remove useHW option
  * update test
```
  eccf2fa4
21 May, 2016 1 commit

add feature to convert FP32(float) to FP16(half) · b2ad7cd9

Tomoaki Teshima authored 8 years ago

  * check compiler support
  * check HW support before executing
  * add test doing round trip conversion from / to FP32
  * treat array correctly if size is not multiple of 4
  * add declaration to prevent warning
  * make it possible to enable fp16 on 32bit ARM
  * let the conversion possible on non-supported HW, too.
  * add test using both HW and SW implementation

b2ad7cd9

25 Dec, 2015 1 commit
- fix normalize in case of inplace operations · 6997d423
  Alexander Alekhin authored 9 years ago
```
fixes #5876
```
  6997d423
03 Dec, 2015 1 commit

HAL: improvements · b4bcdd10

Maksim Shabunin authored 9 years ago

- added new functions from core module: split, merge, add, sub, mul, div, ...
- added function replacement mechanism
- added example of HAL replacement library

b4bcdd10