- 30 Dec, 2014 4 commits
-
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
it might cause errors, due to implicit type conversion and another cuda::divide overload
-
Vladislav Vinogradov authored
they allow to perform asynchronous upload/download into temporary buffer to get valid GpuMat object
-
- 26 Dec, 2014 2 commits
-
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
-
- 25 Dec, 2014 4 commits
-
-
Vladislav Vinogradov authored
the algorithm is not stable yet
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
use correct types for integral/sum outputs
-
Vladislav Vinogradov authored
-
- 24 Dec, 2014 1 commit
-
-
Maksim Shabunin authored
-
- 23 Dec, 2014 13 commits
-
-
Vladislav Vinogradov authored
add swap instruction for allocator field
-
Vladislav Vinogradov authored
use CV_CUDA_TEST_MAIN for opencv_test_core to initialize CUDA device information
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
it allows to use cudaHostAlloc methods for cv::Mat objects
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
* it is the only place, where it is used * no need to make this function public
-
Vladislav Vinogradov authored
move main CUDA group to modules/core/cuda.hpp
-
Vladislav Vinogradov authored
add a note to use new cudev module as a replacement
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
it is internal class, no need to export it
-
Vladislav Vinogradov authored
The deinitialization of BufferPool internal objects is controled by global object, but it depends on other global objects, which leads to errors caused by undefined deinitialization order of global objects. I merge global objects initialization into single class, which performs initialization and deinitialization in correct order.
-
- 22 Dec, 2014 4 commits
-
-
Jiri Drbalek authored
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
-
Vladislav Vinogradov authored
it is generated by CUDA headers and we can't fix it
-
- 21 Dec, 2014 1 commit
-
-
orestis authored
to compensate for neon ieee754 non-compliancy. Also changed the comparison between max valid and calculated distance to make the error message more accurate (in case curMaxDist == maxDist)
-
- 20 Dec, 2014 1 commit
-
-
orestis authored
Set it 1 instead of 0.001, as is already done in gaussianBlur3x3. That will allow integer destination matrices that are not exactly the same, but very close to the expected result, to pass the test.
-
- 19 Dec, 2014 10 commits
-
-
orestis authored
NEON speedup: 2.31x Auto-vect speedup: 2.26x Test kernel: [-0.9432, -1.1528, 0, 1.1528, 0.9432]
-
orestis authored
NEON speedup: 2.36x Auto-vect speedup: 2.36x Test kernel: [0.1, 0.2408, 0.3184, 0.2408, 0.1]
-
orestis authored
NEON speedup: 9.46x Auto-vect speedup: 1x Test kernel: [-0.9432, -1.1528, 0, 1.1528, 0.9432]
-
orestis authored
NEON speedup: 8.64x Auto-vect speedup: 1x Test kernel: [0.1, 0.2408, 0.3184, 0.2408, 0.1]
-
orestis authored
NEON speedup: 2.12x Auto-vect speedup: 1.01x Test kernel: [-2, 0, 2]
-
orestis authored
NEON speedup: 3.27x Auto-vect speedup: 1.01x
-
orestis authored
NEON speedup: 1.75x Auto-vect speedup: 1x
-
orestis authored
NEON speedup: 2.04x Auto-vect speedup: 1x
-
orestis authored
NEON speedup: 2.75x Auto-vect speedup: 1.01x
-
orestis authored
NEON speedup: 2.66x Auto-vect speedup: 1x
-