• Tomoaki Teshima's avatar
    use universal intrinsic for FP16 · 903789f7
    Tomoaki Teshima authored
      * use v_float16x4 (universal intrinsic) instead of raw SSE/NEON implementation
      * define v_load_f16/v_store_f16 since v_load can't be distinguished when short pointer passed
      * brush up implementation on old compiler (guard correctly)
      * add test for v_load_f16 and round trip conversion of v_float16x4
      * fix conversion error
    903789f7
Name
Last commit
Last update
.github Loading commit data...
3rdparty Loading commit data...
apps Loading commit data...
cmake Loading commit data...
data Loading commit data...
doc Loading commit data...
include Loading commit data...
modules Loading commit data...
platforms Loading commit data...
samples Loading commit data...
.gitattributes Loading commit data...
.gitignore Loading commit data...
.tgitconfig Loading commit data...
CMakeLists.txt Loading commit data...
CONTRIBUTING.md Loading commit data...
LICENSE Loading commit data...
README.md Loading commit data...