Commits · d1a3b530befc66b643188b9e774f30b1884bf266 · submodule / opencv

16 Jan, 2018 1 commit

Propagate calculated Gaussian kernel size(ref). Otherwise, ipp_GaussianBlur will… · 20b32612

Woody Chow authored 7 years ago

Propagate calculated Gaussian kernel size(ref). Otherwise, ipp_GaussianBlur will fail if user doesn't specify a kernel size. (#10579)

20b32612

15 Dec, 2017 1 commit
- Fixed 3 issues found by static analysis · 1033f2b1
  Maksim Shabunin authored 7 years ago
  
  1033f2b1
01 Dec, 2017 7 commits
- remove matrix release · 7bfb3805
  elenagvo authored 7 years ago
  
  7bfb3805
- fix the parameters order · 81519537
  elenagvo authored 7 years ago
  
  81519537
- fix accelerators order · 0f12351a
  elenagvo authored 7 years ago
  
  0f12351a
- remove complex data structs · 7aadbc96
  elenagvo authored 7 years ago
  
  7aadbc96
- call HAL for GaussianBlur is fixed · ce659756
  elenagvo authored 7 years ago
  
  ce659756
- add hal for GaussianBlur · c2c73331
  elenagvo authored 7 years ago
  
  c2c73331
- add HAL for BoxFilter · cb9e110a
  elenagvo authored 7 years ago
  
  cb9e110a
28 Nov, 2017 1 commit

ocl: avoid unnecessary loading/initializing OpenCL subsystem · 0ed3209b

Alexander Alekhin authored 7 years ago

If there are no OpenCL/UMat methods calls from application.

OpenCL subsystem is initialized:
- haveOpenCL() is called from application
- useOpenCL() is called from application
- access to OpenCL allocator: UMat is created (empty UMat is ignored) or UMat <-> Mat conversions are called

Don't call OpenCL functions if OPENCV_OPENCL_RUNTIME=disabled
(independent from OpenCL linkage type)

0ed3209b

23 Nov, 2017 1 commit
- fix the parameters order · 5d0a8d2a
  elenagvo authored 7 years ago
  
  5d0a8d2a
20 Nov, 2017 1 commit
- add HAL for medianBlur · 3a09da71
  elenagvo authored 7 years ago
  
  3a09da71
17 Nov, 2017 1 commit
- change border type for medianBlur to BORDER_ISOLATED · 20c08eab
  elenagvo authored 7 years ago
  
  20c08eab
27 Oct, 2017 1 commit
- imgproc: fix bilateral filter SIMD 32f optimization · cc9ab7e5
  Alexander Alekhin authored 7 years ago
  
  cc9ab7e5
28 Sep, 2017 1 commit

Merge pull request #9714 from tomoaki0705:universalBilateral · 139b3273

Tomoaki Teshima authored 7 years ago

imgproc: use universal intrinsic as much as possible (#9714)

* use universal intrinsic as much as possible
  * make SSE3 part as common as possible with universal intrinsic implementation
  * put the reducing part out of the main loop

* follow the comment
  * fix the typo
  * use v_reduce_sum4

* follow the comment again
  * remove all CV_SSE3 part from smooth.cpp

139b3273

22 Sep, 2017 1 commit
- replace raw SSE2/NEON implementation with universal intrinsic · e932160a
  Tomoaki Teshima authored 7 years ago
  
  e932160a
08 Sep, 2017 1 commit
- Fixed some issues found by static analysis · 248e2c7d
  Maksim Shabunin authored 7 years ago
  
  248e2c7d
23 Aug, 2017 1 commit

ICV2017u3 package update; · a57718e1

Pavel Vlasov authored 7 years ago

- Optimizations set change. Now IPP integrations will provide code for SSE42, AVX2 and AVX512 (SKX) CPUs only. For HW below SSE42 IPP code is disabled.
- Performance regressions fixes for IPP code paths;
- cv::boxFilter integration improvement;
- cv::filter2D integration improvement;

a57718e1

01 Aug, 2017 1 commit

Merge pull request #8951 from hrnr:akaze_part2 · bb6496d9

Jiri Horner authored 7 years ago

[GSOC] Speeding-up AKAZE, part #2 (#8951)

* feature2d: instrument more functions used in AKAZE

* rework Compute_Determinant_Hessian_Response

* this takes 84% of time of Feature_Detection
* run everything in parallel
* compute Scharr kernels just once
* compute sigma more efficiently
* allocate all matrices in evolution without zeroing

* features2d: add one bigger image to tests

* now test have images: 600x768, 900x600 and 1385x700 to cover different resolutions

* explicitly zero Lx and Ly

* add Lflow and Lstep to evolution as in original AKAZE code

* reworked computing keypoints orientation

integrated faster function from https://github.com/h2suzuki/fast_akaze

* use standard fastAtan2 instead of getAngle

* compute keypoints orientation in parallel

* fix visual studio warnings

* replace some wrapped functions with direct calls to OpenCV functions

* improved readability for people familiar with opencv
* do not same image twice in base level

* rework diffusity stencil

* use one pass stencil for diffusity from https://github.com/h2suzuki/fast_akaze
* improve locality in Create_Scale_Space

* always compute determinat od hessian and spacial derivatives

* this needs to be computed always as we need derivatives while computing descriptors
* fixed tests of AKAZE with KAZE descriptors which have been affected by this

Currently it computes all first and second order derivatives together and the determiant of the hessian. For descriptors it would be enough to compute just first order derivates, but it is not probably worth it optimize for scenario where descriptors and keypoints are computed separately, since it is already very inefficient. When computing keypoint and descriptors together it is faster to do it the current way (preserves locality).

* parallelize non linear diffusion computation

* do multiplication right in the nlp diffusity kernel

* rework kfactor computation

* get rid of sharing buffers when creating scale space pyramid, the performace impact is neglegible

* features2d: initialize TBB scheduler in perf tests

* ensures more stable output
* more reasonable profiles, since the first call of parallel_for_ is not getting big performace hit

* compute_kfactor: interleave finding of maximum and computing distance

* no need to go twice through the data

* start to use UMats in AKAZE to leverage OpenCl in the future

* fixed bug that prevented computing determinant for scale pyramid of size 1 (just the base image)
* all descriptors now support writing to uninitialized memory
* use InputArray and OutputArray for input image and descriptors, allows to make use UMAt that user passes to us

* enable use of all existing ocl paths in AKAZE

* all parts that uses ocl-enabled functions should use ocl by now

* imgproc: fix dispatching of IPP version when OCL is disabled

* when OCL is disabled IPP version should be always prefered (even when the dst is UMat)

* get rid of copy in DeterminantHessian response

* this slows CPU version considerably
* do no run in parallel when running with OCL

* store derivations as UMat in pyramid

* enables OCL path computing of determint hessian
* will allow to compute descriptors on GPU in the future

* port diffusivity to OCL

* diffusivity itself is not a blocker, but this saves us downloading and uploading derivations

* implement kernel for nonlinear scalar diffusion step

* download the pyramid from GPU just once

we don't want to downlaod matrices ad hoc from gpu when the function in AKAZE needs it. There is a HUGE mapping overhead and without shared memory support a LOT of unnecessary transfers.

This maps/downloads matrices just once.

* fix bug with uninitialized values in non linear diffusion

* this was causing spurious segfaults in stitching tests due to propagation of NaNs
* added new test, which checks for NaNs (added new debug asserts for NaNs)
* valgrind now says everything is ok

* add nonlinear diffusion step OCL implementation

* Lt in pyramid changed to UMat, it will be downlaoded from GPU along with Lx, Ly
* fix bug in pm_g2 kernel. OpenCV mangles dimensions passed to OpenCL, so we need to check for boundaries in each OCL kernel.

* port computing of determinant to OCL

* computing of determinant is not a blocker, but with this change we don't need to download all spatial derivatives to CPU, we only download determinant
* make Ldet in the pyramid UMat, download it from CPU together with the other parts of the pyramid
* add profiling macros

* fix visual studio warning

* instrument non_linear_diffusion

* remove changes I have made to TEvolution

* TEvolution is used only in KAZE now

* Revert "features2d: initialize TBB scheduler in perf tests"

This reverts commit ba81e2a711ae009ce3c5459775627b6423112669.

bb6496d9

10 Jul, 2017 1 commit
- build: detect Android via '__ANDROID__' macro · a4a47b53
  Alexander Alekhin authored 7 years ago
```
https://sourceforge.net/p/predef/wiki/OperatingSystems
```
  a4a47b53
28 Jun, 2017 1 commit
- Fixed several issues found by static analysis · a769d69a
  Maksim Shabunin authored 7 years ago
  
  a769d69a
01 Jun, 2017 1 commit
- Fallback to single threaded version of IPP gaussian blur / bilateral filter when… · f743603b
  Woody Chow authored 7 years ago
```
Fallback to single threaded version of IPP gaussian blur / bilateral filter when the mutlithreaded version cannot be called.
```
  f743603b
31 May, 2017 1 commit
- Multithread IPP gaussian blur · d22fb5f9
  Woody Chow authored 7 years ago
  
  d22fb5f9
25 May, 2017 1 commit
- Moved size restrictions for OpenVX processed images to corresponding cpp files · 1d62a025
  Vitaly Tuzov authored 7 years ago
  
  1d62a025
23 May, 2017 1 commit
- replaced SSE2 code with universal intrinsics; improved accuracy of the box… · 883d925f
  Vadim Pisarevsky authored 7 years ago
```
replaced SSE2 code with universal intrinsics; improved accuracy of the box filter; it should now be bit-exact
```
  883d925f
25 Apr, 2017 1 commit

Update for IPP for OpenCV 2017u2 integration; · 11c2ffaf

Pavel Vlasov authored 7 years ago

Updated integrations for:
cv::split
cv::merge
cv::insertChannel
cv::extractChannel
cv::Mat::convertTo - now with scaled conversions support
cv::LUT - disabled due to performance issues
Mat::copyTo
Mat::setTo
cv::flip
cv::copyMakeBorder - currently disabled
cv::polarToCart
cv::pow - ipp pow function was removed due to performance issues
cv::hal::magnitude32f/64f - disabled for <= SSE42, poor performance
cv::countNonZero
cv::minMaxIdx
cv::norm
cv::canny - new integration. Disabled for threaded;
cv::cornerHarris
cv::boxFilter
cv::bilateralFilter
cv::integral

11c2ffaf

20 Apr, 2017 1 commit
- IPP for OpenCV 2017u2 initial enabling patch; · 35c72168
  Pavel Vlasov authored 7 years ago
  
  35c72168
11 Apr, 2017 2 commits
- Disabled vxuConvolution call for sepFilter evaluation · 4c0d833d
  Vitaly Tuzov authored 7 years ago
  
  4c0d833d
- Disabled vxuConvolution call for Sobel, GaussianBlur and Box filter evaluation · 87bb7431
  Vitaly Tuzov authored 7 years ago
  
  87bb7431
06 Apr, 2017 1 commit
- Extended set of OpenVX HAL calls disabled for small images · bf5b7843
  Vitaly Tuzov authored 8 years ago
  
  bf5b7843
28 Feb, 2017 1 commit

fix medianBlur accessviolation · 5169c799

Jejos authored 8 years ago

medianBlur called with "empty" source and ksize >= 7 crashes application with accessviolation. With this extra assert this is avoided and the application may normally catch the thrown exception.

5169c799

21 Feb, 2017 1 commit
- OpenVX calls updated to use single common OpenVX context per thread · 9a4b5a45
  Vitaly Tuzov authored 8 years ago
  
  9a4b5a45
21 Dec, 2016 1 commit

Merge pull request #7802 from terfendail:ovxhal_wrappers_migration · be7d060e

Vitaly Tuzov authored 8 years ago

* OpenVX HAL updated to use generic OpenVX wrappers

* vxErr class from OpenVX HAL replaced with ivx::WrapperError

* reduced usage of vxImage class from OpenVX HAL replaced with ivx::Image

* vxImage class rewritten as ivx::Image subclass that calls swapHandle prior release

* Fix OpenVX HAL build

* Fix for review comments

be7d060e

14 Dec, 2016 1 commit
- OpenVX wrappers rewritten with CV_OVX_RUN, VX_DbgThrow · 8b9422a0
  Rostislav Vasilikhin authored 8 years ago
  
  8b9422a0
09 Dec, 2016 3 commits
- fixing build errors · f3ec56fc
  apavlenko authored 8 years ago
  
  f3ec56fc
- disabling due to accuracy issues · 541d5b02
  apavlenko authored 8 years ago
  
  541d5b02
- fixing compilation · ccd8031a
  apavlenko authored 8 years ago
  
  ccd8031a
06 Dec, 2016 1 commit

5x5 gaussian blur optimization · 396921dd

Li Peng authored 8 years ago

Add new 5x5 gaussian blur kernel for CV_8UC1 format,
it is 50% ~ 70% faster than current ocl kernel in the perf test.
Signed-off-by: Li Peng <peng.li@intel.com>

396921dd

02 Dec, 2016 1 commit
- Added OpenVX based processing to gaussianBlur · afc73969
  Vitaly Tuzov authored 8 years ago
  
  afc73969
30 Nov, 2016 1 commit
- Added OpenVX based processing to boxFilter · 6d55e992
  Vitaly Tuzov authored 8 years ago
  
  6d55e992