Unverified Commit 8c24af66 authored by Chip Kerchner's avatar Chip Kerchner Committed by GitHub

Merge pull request #16556 from ChipKerchner:vectorizeIntegralSumPixels

* Vectorize calculating integral for line for single and multiple channels

* Single vector processing for 4-channels - 25-30% faster

* Single vector processing for 4-channels - 25-30% faster

* Fixed AVX512 code for 4 channels

* Disable 3 channel 8UC1 to 32S for SSE2 and SSE3 (slower).  Use new version of 8UC1 to 64F for AVX512.
parent 7ffab23a
This diff is collapsed.
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment