Commits · a4e598f474b5986d0be9ba0b0fafcbf5ee82fe02 · submodule / opencv

30 Dec, 2014 4 commits
- use new BufferPool class for some cudaarithm routines · a4e598f4
  Vladislav Vinogradov authored 10 years ago
  
  a4e598f4
- use new getInputMat/getOutputMat/syncOutput methods in cudaarithm routines · 7454189c
  Vladislav Vinogradov authored 10 years ago
  
  7454189c
- remove reciprocal version of cuda::divide · 3d0410c1
  Vladislav Vinogradov authored 10 years ago
```
it might cause errors, due to implicit type conversion and another cuda::divide
overload
```
  3d0410c1
- add auxiliary functions to work with Input/Output arrays: · 00e7816c
  Vladislav Vinogradov authored 10 years ago
```
they allow to perform asynchronous upload/download into temporary buffer
to get valid GpuMat object
```
  00e7816c
26 Dec, 2014 2 commits
- disable GeneralizedHoughGuil performance test · fe3f236a
  Vladislav Vinogradov authored 10 years ago
  
  fe3f236a
- improve error reporting in _InputArray methods · f36546db
  Vladislav Vinogradov authored 10 years ago
  
  f36546db
25 Dec, 2014 4 commits
- disable sanity test for GeneralizedHoughGuil · 7c901e39
  Vladislav Vinogradov authored 10 years ago
```
the algorithm is not stable yet
```
  7c901e39
- fix tests for match template · 18d1be45
  Vladislav Vinogradov authored 10 years ago
  
  18d1be45
- fix cuda match template: · 26afa49d
  Vladislav Vinogradov authored 10 years ago
```
use correct types for integral/sum outputs
```
  26afa49d
- rewrite cuda::cvtColor with new device layer and fix test failures · 9b8c3fd6
  Vladislav Vinogradov authored 10 years ago
  
  9b8c3fd6
24 Dec, 2014 1 commit
- Added enviroment search paths for OpenNI2 for linux and fixed specific warning · 128e5095
  Maksim Shabunin authored 10 years ago
  
  128e5095
23 Dec, 2014 13 commits
- fix GpuMat::swap method: · e7e0da01
  Vladislav Vinogradov authored 10 years ago
```
add swap instruction for allocator field
```
  e7e0da01
- refactor CV_CUDA_TEST_MAIN, use CV_TEST_MAIN for it · b33f3bb2
  Vladislav Vinogradov authored 10 years ago
```
use CV_CUDA_TEST_MAIN for opencv_test_core to initialize CUDA
device information
```
  b33f3bb2
- add Allocator parameter to cudev::GpuMat_ contructors · 8237418b
  Vladislav Vinogradov authored 10 years ago
  
  8237418b
- add cuda::HostMem::getAllocator method · f054d631
  Vladislav Vinogradov authored 10 years ago
```
it allows to use cudaHostAlloc methods for cv::Mat objects
```
  f054d631
- add more FeatureSet constants · 2f8e1798
  Vladislav Vinogradov authored 10 years ago
  
  2f8e1798
- move CUDA core tests to core module · 1be1a289
  Vladislav Vinogradov authored 10 years ago
  
  1be1a289
- rename CudaMem -> HostMem to better reflect its purpose · 53862687
  Vladislav Vinogradov authored 10 years ago
  
  53862687
- move allocMatFromBuf function to farneback.cpp: · 9210d8e5
  Vladislav Vinogradov authored 10 years ago
```
* it is the only place, where it is used
* no need to make this function public
```
  9210d8e5
- minor reorganization for CUDA doxygen groups: · 1d82aecf
  Vladislav Vinogradov authored 10 years ago
```
move main CUDA group to modules/core/cuda.hpp
```
  1d82aecf
- mark old CUDA device layer as deprecated and remove it from doxygen documentation · b5ab82fd
  Vladislav Vinogradov authored 10 years ago
```
add a note to use new cudev module as a replacement
```
  b5ab82fd
- fix null stream initialization for multi-gpu systems · 68e08bbe
  Vladislav Vinogradov authored 10 years ago
  
  68e08bbe
- move StackAllocator to cpp file · 05d40946
  Vladislav Vinogradov authored 10 years ago
```
it is internal class, no need to export it
```
  05d40946
- fix cuda::BufferPool deinitialization · 7ed38b97
  Vladislav Vinogradov authored 10 years ago
```
The deinitialization of BufferPool internal objects is controled by global
object, but it depends on other global objects, which leads to errors
caused by undefined deinitialization order of global objects.

I merge global objects initialization into single class, which performs
initialization and deinitialization in correct order.
```
  7ed38b97
22 Dec, 2014 4 commits
- fix crash when sample point out of image boundaries · d71e0017
  Jiri Drbalek authored 10 years ago
  
  d71e0017
- increase epsilons for tests due to different optimizations (IPP vs CUDA, float vs double) · ec33c4ae
  Vladislav Vinogradov authored 10 years ago
  
  ec33c4ae
- update cudev color conversions according to the latest changes in CPU code · 25f33a7e
  Vladislav Vinogradov authored 10 years ago
  
  25f33a7e
- disable -Wshadow warning for CUDA modules: · 48c9c24d
  Vladislav Vinogradov authored 10 years ago
```
it is generated by CUDA headers and we can't fix it
```
  48c9c24d
21 Dec, 2014 1 commit

Change DescriptorExtractor_ORB regression test · fffe2464

orestis authored 10 years ago

to compensate for neon ieee754 non-compliancy.
Also changed the comparison between max valid and calculated distance to
make the error message more accurate (in case curMaxDist == maxDist)

fffe2464

20 Dec, 2014 1 commit

Change gaussianBlur5x5 perf test epsilon · 9811a739

orestis authored 10 years ago

Set it 1 instead of 0.001, as is already done in gaussianBlur3x3. That
will allow integer destination matrices that are not exactly the same,
but very close to the expected result, to pass the test.

9811a739

19 Dec, 2014 10 commits
- SymmRowSmallVec_32f 1x5 asymm · 9c6da035
  orestis authored 10 years ago
```
NEON speedup: 2.31x
Auto-vect speedup: 2.26x

Test kernel: [-0.9432, -1.1528, 0, 1.1528, 0.9432]
```
  9c6da035
- SymmRowSmallVec_32f 1x5 · 13c08551
  orestis authored 10 years ago
```
NEON speedup: 2.36x
Auto-vect speedup: 2.36x

Test kernel: [0.1, 0.2408, 0.3184, 0.2408, 0.1]
```
  13c08551
- SymmColumnVec_32f16s asymm · ed0ce481
  orestis authored 10 years ago
```
NEON speedup: 9.46x
Auto-vect speedup: 1x

Test kernel: [-0.9432, -1.1528, 0, 1.1528, 0.9432]
```
  ed0ce481
- SymmColumnVec_32f16s · a2a13179
  orestis authored 10 years ago
```
NEON speedup: 8.64x
Auto-vect speedup: 1x

Test kernel: [0.1, 0.2408, 0.3184, 0.2408, 0.1]
```
  a2a13179
- SymmColumnSmallVec_32s16s 3x1 asymm · 37e01845
  orestis authored 10 years ago
```
NEON speedup: 2.12x
Auto-vect speedup: 1.01x

Test kernel: [-2, 0, 2]
```
  37e01845
- SymmColumnSmallVec_32s16s [-1, 0, 1] · 4443d6b0
  orestis authored 10 years ago
```
NEON speedup: 3.27x
Auto-vect speedup: 1.01x
```
  4443d6b0
- SymmColumnSmallVec_32s16s 3x1 · 99e782e6
  orestis authored 10 years ago
```
NEON speedup: 1.75x
Auto-vect speedup: 1x
```
  99e782e6
- SymmColumnSmallVec_32s16s [3, 10, 3] Scharr · 33dfeb85
  orestis authored 10 years ago
```
NEON speedup: 2.04x
Auto-vect speedup: 1x
```
  33dfeb85
- SymmColumnSmallVec_32s16s [1, -2, 1] · 61a7f48b
  orestis authored 10 years ago
```
NEON speedup: 2.75x
Auto-vect speedup: 1.01x
```
  61a7f48b
- SymmColumnSmallVec_32s16s [1, 2, 1] · 4f906372
  orestis authored 10 years ago
```
NEON speedup: 2.66x
Auto-vect speedup: 1x
```
  4f906372