- 26 Dec, 2017 1 commit
-
-
Arjan van de Ven authored
This patch adds AVX512 optimized fastConv as well as the hookups needed to get these called in the convolution_layer. AVX512 fastConv is code-identical on a C level to the AVX2 one, but is measurably faster due to AVX512 having more registers available to cache results in. Signed-off-by:
Arjan van de Ven <arjan@linux.intel.com>
-
- 25 Dec, 2017 1 commit
-
-
Arjan van de Ven authored
The opencv infrastructure mostly has the basics for supporting avx512 math functions, but it wasn't hooked up (likely due to lack of users) In order to compile the DNN functions for AVX512, a few things need to be hooked up and this patch does that Signed-off-by:
Arjan van de Ven <arjan@linux.intel.com>
-
- 22 Dec, 2017 29 commits
-
-
Alexander Alekhin authored
-
Alexander Alekhin authored
3.4.0
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Dmitry Kurtaev authored
-
Vitaly Tuzov authored
-
Alexander Alekhin authored
-
Li Peng authored
Signed-off-by:
Li Peng <peng.li@intel.com>
-
Alexander Alekhin authored
-
Vadim Pisarevsky authored
-
Vadim Pisarevsky authored
-
Vadim Pisarevsky authored
-
Vadim Pisarevsky authored
-
Vadim Pisarevsky authored
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Maksim Shabunin authored
-
berak authored
-
Vitaly Tuzov authored
-
Alexander Alekhin authored
-
Vadim Pisarevsky authored
-
Alexander Alekhin authored
-
Alexander Alekhin authored
-
Vadim Pisarevsky authored
-
wxzs5 authored
-
Vitaly Tuzov authored
-
- 21 Dec, 2017 9 commits
-
-
Tomoaki Teshima authored
* disable the warning in CMake, not int the code using pragma
-
Alexander Alekhin authored
-
Vadim Pisarevsky authored
-
Vadim Pisarevsky authored
-
Alexander Alekhin authored
-
Tomoaki Teshima authored
* confirmed test failure on Jetson TX1 and TX2 * show the performance but not bit exact result
-
Li Peng authored
Signed-off-by:
Li Peng <peng.li@intel.com>
-
Li Peng authored
Signed-off-by:
Li Peng <peng.li@intel.com>
-
Dmitry Kurtaev authored
-