• Tomoaki Teshima's avatar
    Merge pull request #9753 from tomoaki0705:universalMatmul · 3cbe60cc
    Tomoaki Teshima authored
    * add accuracy test and performance check for matmul
      * add performance tests for transform and dotProduct
      * add test Core_TransformLargeTest for 8u version of transform
    
    * remove raw SSE2/NEON implementation from matmul.cpp
      * use universal intrinsic instead of raw intrinsic
      * remove unused templated function
      * add v_matmuladd which multiply 3x3 matrix and add 3x1 vector
      * add v_rotate_left/right in universal intrinsic
      * suppress intrinsic on some function and platform
      * add pure SW implementation of new universal intrinsics
      * add test for new universal intrinsics
    
    * core: prevent memory access after the end of buffer
    
    * fix perf tests
    3cbe60cc
Name
Last commit
Last update
..
cuda Loading commit data...
opencl Loading commit data...
algorithm.cpp Loading commit data...
alloc.cpp Loading commit data...
arithm.cpp Loading commit data...
arithm_core.hpp Loading commit data...
arithm_simd.hpp Loading commit data...
array.cpp Loading commit data...
bufferpool.impl.hpp Loading commit data...
command_line_parser.cpp Loading commit data...
conjugate_gradient.cpp Loading commit data...
convert.avx2.cpp Loading commit data...
convert.cpp Loading commit data...
convert.fp16.cpp Loading commit data...
convert.hpp Loading commit data...
convert.sse4_1.cpp Loading commit data...
copy.cpp Loading commit data...
cuda_gpu_mat.cpp Loading commit data...
cuda_host_mem.cpp Loading commit data...
cuda_info.cpp Loading commit data...
cuda_stream.cpp Loading commit data...
datastructs.cpp Loading commit data...
directx.cpp Loading commit data...
directx.inc.hpp Loading commit data...
downhill_simplex.cpp Loading commit data...
dxt.cpp Loading commit data...
gl_core_3_1.cpp Loading commit data...
gl_core_3_1.hpp Loading commit data...
glob.cpp Loading commit data...
hal_internal.cpp Loading commit data...
hal_internal.hpp Loading commit data...
hal_replacement.hpp Loading commit data...
intel_gpu_gemm.inl.hpp Loading commit data...
kmeans.cpp Loading commit data...
lapack.cpp Loading commit data...
lda.cpp Loading commit data...
lpsolver.cpp Loading commit data...
mathfuncs.cpp Loading commit data...
mathfuncs_core.dispatch.cpp Loading commit data...
mathfuncs_core.simd.hpp Loading commit data...
matmul.cpp Loading commit data...
matop.cpp Loading commit data...
matrix.cpp Loading commit data...
matrix_decomp.cpp Loading commit data...
merge.cpp Loading commit data...
ocl.cpp Loading commit data...
ocl_deprecated.hpp Loading commit data...
opengl.cpp Loading commit data...
out.cpp Loading commit data...
ovx.cpp Loading commit data...
parallel.cpp Loading commit data...
parallel_pthreads.cpp Loading commit data...
pca.cpp Loading commit data...
persistence.cpp Loading commit data...
precomp.hpp Loading commit data...
rand.cpp Loading commit data...
softfloat.cpp Loading commit data...
split.cpp Loading commit data...
stat.cpp Loading commit data...
stat.dispatch.cpp Loading commit data...
stat.simd.hpp Loading commit data...
stl.cpp Loading commit data...
system.cpp Loading commit data...
tables.cpp Loading commit data...
trace.cpp Loading commit data...
types.cpp Loading commit data...
umatrix.cpp Loading commit data...
va_intel.cpp Loading commit data...