1. 04 Sep, 2016 1 commit
    • Tomoaki Teshima's avatar
      use universal intrinsic for FP16 · 903789f7
      Tomoaki Teshima authored
        * use v_float16x4 (universal intrinsic) instead of raw SSE/NEON implementation
        * define v_load_f16/v_store_f16 since v_load can't be distinguished when short pointer passed
        * brush up implementation on old compiler (guard correctly)
        * add test for v_load_f16 and round trip conversion of v_float16x4
        * fix conversion error
      903789f7
  2. 02 Sep, 2016 2 commits
  3. 01 Sep, 2016 3 commits
  4. 31 Aug, 2016 3 commits
  5. 30 Aug, 2016 1 commit
    • Tomoaki Teshima's avatar
      add 64F intrinsic in HAL NEON · 7fef96be
      Tomoaki Teshima authored
        * use universal intrinsic for accumulate series using float/double
        * accumulate, accumulateSquare, accumulateProduct and accumulateWeighted
        * add v_cvt_f64_high in both SSE/NEON
        * add test for conversion v_cvt_f64_high in test_intrin.cpp
        * improve some existing universal intrinsic by using new instructions in Aarch64
        * add workaround for Android build in intrin_neon.hpp
      7fef96be
  6. 29 Aug, 2016 4 commits
  7. 27 Aug, 2016 1 commit
  8. 26 Aug, 2016 10 commits
  9. 25 Aug, 2016 7 commits
  10. 24 Aug, 2016 3 commits
  11. 23 Aug, 2016 4 commits
  12. 22 Aug, 2016 1 commit