• Chuanbo Weng's avatar
    Remove unnecessary kercn limitation of 4. · 2d8c89c4
    Chuanbo Weng authored
    When accessing global memory by DWORD4, memory bandwidth
    can be fully utilized on Intel platform. This patch will
    make more image format(e.g. 8UC4) be processed in DWORD4
    by work-item. After applying this patch, 3 subcase of
    ./opencv_perf_core --gtest_filter=OCL_RepeatFixture_Repeat.Repeat/*
    can be speedup on HD4000 graphics card with Beignet:
    OCL_RepeatFixture_Repeat.Repeat/2, 64% improvement.
    OCL_RepeatFixture_Repeat.Repeat/6, 50% improvement.
    OCL_RepeatFixture_Repeat.Repeat/8, 56% improvement.
    Signed-off-by: 's avatarChuanbo Weng <chuanbo.weng@intel.com>
    2d8c89c4
Name
Last commit
Last update
3rdparty Loading commit data...
apps Loading commit data...
cmake Loading commit data...
data Loading commit data...
doc Loading commit data...
include Loading commit data...
modules Loading commit data...
platforms Loading commit data...
samples Loading commit data...
.gitattributes Loading commit data...
.gitignore Loading commit data...
.tgitconfig Loading commit data...
CMakeLists.txt Loading commit data...
LICENSE Loading commit data...
README.md Loading commit data...
index.rst Loading commit data...