1. 13 Jan, 2017 1 commit
  2. 06 Dec, 2016 1 commit
  3. 02 Dec, 2016 1 commit
  4. 30 Nov, 2016 1 commit
  5. 29 Nov, 2016 1 commit
  6. 23 Nov, 2016 1 commit
  7. 17 Nov, 2016 1 commit
  8. 14 Nov, 2016 1 commit
  9. 08 Nov, 2016 1 commit
  10. 07 Nov, 2016 1 commit
  11. 04 Nov, 2016 2 commits
    • Li Peng's avatar
      morph ocl kernel for erode and dilate filter · 35198b84
      Li Peng authored
      This kernel is for CV_8UC1 format and 3x3 kernel size,
      It is about 33% ~ 55% faster than current ocl kernel with below perf test
      
      python ./modules/ts/misc/run.py -t imgproc --gtest_filter=OCL_ErodeFixture*
      python ./modules/ts/misc/run.py -t imgproc --gtest_filter=OCL_DilateFixture*
      
      Also add accuracy test cases for this kernel, the test command is
      
      ./bin/opencv_test_imgproc --gtest_filter=OCL_Filter/MorphFilter3x3*
      Signed-off-by: 's avatarLi Peng <peng.li@intel.com>
      35198b84
    • Tetragramm's avatar
      Fix the OpenCL portion to match the c++ code. · 17df65e6
      Tetragramm authored
      Fix an undiscovered bug in the c++ code.
      17df65e6
  12. 26 Oct, 2016 1 commit
  13. 17 Oct, 2016 1 commit
  14. 09 Aug, 2016 1 commit
    • Alexander Alekhin's avatar
      ocl: fix Canny for Intel devices · b8e08d5d
      Alexander Alekhin authored
      There is an issue with processing of abs(short) function for
      negative argument.
      
      Affected OpenCL devices:
      - iGPU: Intel(R) HD Graphics 520 (OpenCL 2.0 )
      - CPU: Intel(R) Core(TM) i5-6300U CPU @ 2.40GHz (OpenCL 2.0 (Build 10094))
      b8e08d5d
  15. 24 Apr, 2016 1 commit
    • ohnozzy's avatar
      Add OpenCL support to linearPolar & logPolar · db9f6117
      ohnozzy authored
      Add OpenCL  support to linearPolar & logPolar.
      The OpenCL code use float instead of double, so that it does not require
      cl_khr_fp64 extension, with slight precision lost.
      
      Add explicit conversion
      
      Add explicit conversion from double to float to eliminate warning during
      compilation.
      db9f6117
  16. 15 Mar, 2016 1 commit
    • Zhigang Gong's avatar
      fix potential race condition in canny.cl. · 0b08d255
      Zhigang Gong authored
      See the below code snippet:
      
      while(l_counter != 0)
      {
          int mod = l_counter % LOCAL_TOTAL;
          int pix_per_thr = l_counter / LOCAL_TOTAL + ((lid < mod) ? 1 : 0);
      
          for (int i = 0; i < pix_per_thr; ++i)
          {
              int index = atomic_dec(&l_counter) - 1;
              ....
          }
          ....
          barrier(CLK_LOCAL_MEM_FENCE);
      }
      
      If we don't put a barrier before the for loop, then there is a possiblity
      that some work item enter this loop but the others are not, the the l_counter
      will be reduced in the for loop and may be changed to zero, and the other
      work items may can't enter the while loop. If this happens, it breaks the
      barrier's rule which requires all the work items reach the same barrier.
      And it may hang the GPU depends on the implementation of opencl platform.
      
      This issue is raised at:
      https://github.com/Itseez/opencv/issues/5175Signed-off-by: 's avatarZhigang Gong <zhigang.gong@linux.intel.com>
      0b08d255
  17. 26 May, 2015 2 commits
  18. 22 Apr, 2015 1 commit
  19. 26 Nov, 2014 1 commit
  20. 07 Nov, 2014 1 commit
  21. 06 Nov, 2014 1 commit
  22. 05 Nov, 2014 1 commit
  23. 28 Oct, 2014 1 commit
  24. 27 Oct, 2014 1 commit
  25. 21 Oct, 2014 5 commits
  26. 07 Oct, 2014 1 commit
  27. 29 Sep, 2014 2 commits
  28. 17 Sep, 2014 1 commit
    • Chuanbo Weng's avatar
      Use vload to read unaligned data instead of dereference operator. · c5552788
      Chuanbo Weng authored
      According to opencl 1.2 spec 6.1.5:
          For arguments to a __kernel function declared to be a pointer to a
          data type, the OpenCL compiler can assume that the pointee is always
          appropriately aligned as required by the data type. The behavior of
          an unaligned load or store is undefined, except for the
          vloadn, vload_halfn, vstoren, and vstore_halfn functions defined in
          section 6.12.7.
      
      Original code read data of type T from address not aligned by multiple
      of sizeof(T), so the result is incorrect. With this patch, the cases
      ./opencv_perf_imgproc
      --gtest_filter=OCL_ImgSize_TmplSize_Method_MatType_MatchTemplate.MatchTemplate/*
      could work well with beignet 0.9.3.
      Signed-off-by: 's avatarChuanbo Weng <chuanbo.weng@intel.com>
      c5552788
  29. 11 Sep, 2014 2 commits
  30. 05 Sep, 2014 2 commits
  31. 04 Sep, 2014 1 commit