-
Tomoaki Teshima authored
* use universal intrinsic instead of raw intrinsic * add 2 channels de-interleave on x86 platform * add v_int32x4 version of v_muladd * add accumulate version of v_dotprod based on the commit from seiko2plus on bf1852d * remove some verify check in performance test * avoid the out of boundary access and keep the performance
a82e70cd