2-channel interleaved load/store for universal intrinsics (float only)
* Added 2-channel ops to match existing 3-channel and 4-channel ops * v_load_deinterleave() and v_store_interleave() * Implements float32x4 only on SSE (but all types on NEON and CPP) * Includes tests * Will be used to vectorize 2D functions, such as estimateAffine2D()
Showing
Please
register
or
sign in
to comment