1. 07 Feb, 2018 1 commit
  2. 24 Jan, 2018 1 commit
  3. 23 Jan, 2018 1 commit
  4. 14 Dec, 2017 1 commit
  5. 15 Nov, 2017 1 commit
  6. 09 Nov, 2017 1 commit
  7. 18 Aug, 2017 1 commit
  8. 08 Mar, 2017 1 commit
  9. 03 Mar, 2017 1 commit
  10. 23 Feb, 2017 1 commit
    • Manojkumar Bhosale's avatar
      Add MSA optimized Interpolate/MergeUV/Misc functions · 45b176d1
      Manojkumar Bhosale authored
      BUG=libyuv:634
      
      Change-Id: If8d60bd57f01fe95bc2fd26196466574195cc126
      
      Performance Gain (vs C auto-vectorized)
      InterpolateRow_MSA      - ~3.3x
      InterpolateRow_Any_MSA  - ~2.5x
      ARGBSetRow_MSA          - ~1.0x
      ARGBSetRow_Any_MSA      - ~1.0x
      ARGBToRGB24Row_MSA      - ~1.9x
      ARGBToRGB24Row_Any_MSA  - ~1.6x
      MergeUVRow_MSA          - ~1.6x
      MergeUVRow_Any_MSA      - ~1.2x
      
      Performance Gain (vs C non-vectorized)
      InterpolateRow_MSA      - ~11.3x
      InterpolateRow_Any_MSA  - ~ 7.9x
      ARGBSetRow_MSA          - ~ 6.2x
      ARGBSetRow_Any_MSA      - ~ 4.0x
      ARGBToRGB24Row_MSA      - ~ 9.9x
      ARGBToRGB24Row_Any_MSA  - ~ 8.4x
      MergeUVRow_MSA          - ~12.7x
      MergeUVRow_Any_MSA      - ~ 8.0x
      
      Change-Id: If8d60bd57f01fe95bc2fd26196466574195cc126
      Reviewed-on: https://chromium-review.googlesource.com/445817Reviewed-by: 's avatarFrank Barchard <fbarchard@google.com>
      Commit-Queue: Frank Barchard <fbarchard@google.com>
      45b176d1
  11. 15 Feb, 2017 1 commit
  12. 11 Feb, 2017 1 commit
  13. 11 Jan, 2017 1 commit
    • Frank Barchard's avatar
      Libyuv MIPS DSPR2 optimizations. · 000d2fa9
      Frank Barchard authored
      Optimized functions:
      
      I444ToARGBRow_DSPR2
      I422ToARGB4444Row_DSPR2
      I422ToARGB1555Row_DSPR2
      NV12ToARGBRow_DSPR2
      BGRAToUVRow_DSPR2
      BGRAToYRow_DSPR2
      ABGRToUVRow_DSPR2
      ARGBToYRow_DSPR2
      ABGRToYRow_DSPR2
      RGBAToUVRow_DSPR2
      RGBAToYRow_DSPR2
      ARGBToUVRow_DSPR2
      RGB24ToARGBRow_DSPR2
      RAWToARGBRow_DSPR2
      RGB565ToARGBRow_DSPR2
      ARGB1555ToARGBRow_DSPR2
      ARGB4444ToARGBRow_DSPR2
      ScaleAddRow_DSPR2
      
      Bug-fixes in functions:
      
      ScaleRowDown2_DSPR2
      ScaleRowDown4_DSPR2
      
      BUG=
      
      Review-Url: https://codereview.chromium.org/2626123003 .
      000d2fa9
  14. 21 Dec, 2016 1 commit
    • Manojkumar Bhosale's avatar
      Add MSA optimized remaining scale row functions · 288bfbef
      Manojkumar Bhosale authored
      R=fbarchard@google.com
      BUG=libyuv:634
      
      Performance Gain (vs C vectorized)
      ScaleRowDown2_MSA            - ~22.3x
      ScaleRowDown2_Any_MSA        - ~19.9x
      ScaleRowDown2Linear_MSA      - ~31.2x
      ScaleRowDown2Linear_Any_MSA  - ~29.4x
      ScaleRowDown2Box_MSA         - ~20.1x
      ScaleRowDown2Box_Any_MSA     - ~19.6x
      ScaleRowDown4_MSA            - ~11.7x
      ScaleRowDown4_Any_MSA        - ~11.2x
      ScaleRowDown4Box_MSA         - ~15.1x
      ScaleRowDown4Box_Any_MSA     - ~15.1x
      ScaleRowDown38_MSA           - ~1x
      ScaleRowDown38_Any_MSA       - ~1x
      ScaleRowDown38_2_Box_MSA     - ~1.7x
      ScaleRowDown38_2_Box_Any_MSA - ~1.7x
      ScaleRowDown38_3_Box_MSA     - ~1.7x
      ScaleRowDown38_3_Box_Any_MSA - ~1.7x
      ScaleAddRow_MSA              - ~1.2x
      ScaleAddRow_Any_MSA          - ~1.15x
      
      Performance Gain (vs C non-vectorized)
      ScaleRowDown2_MSA            - ~22.4x
      ScaleRowDown2_Any_MSA        - ~19.8x
      ScaleRowDown2Linear_MSA      - ~31.6x
      ScaleRowDown2Linear_Any_MSA  - ~29.4x
      ScaleRowDown2Box_MSA         - ~20.1x
      ScaleRowDown2Box_Any_MSA     - ~19.6x
      ScaleRowDown4_MSA            - ~11.7x
      ScaleRowDown4_Any_MSA        - ~11.2x
      ScaleRowDown4Box_MSA         - ~15.1x
      ScaleRowDown4Box_Any_MSA     - ~15.1x
      ScaleRowDown38_MSA           - ~3.2x
      ScaleRowDown38_Any_MSA       - ~3.2x
      ScaleRowDown38_2_Box_MSA     - ~2.4x
      ScaleRowDown38_2_Box_Any_MSA - ~2.3x
      ScaleRowDown38_3_Box_MSA     - ~2.9x
      ScaleRowDown38_3_Box_Any_MSA - ~2.8x
      ScaleAddRow_MSA              - ~8x
      ScaleAddRow_Any_MSA          - ~7.46x
      
      Review-Url: https://codereview.chromium.org/2559683002 .
      288bfbef
  15. 08 Nov, 2016 1 commit
  16. 16 Sep, 2016 1 commit
  17. 05 Feb, 2016 1 commit
  18. 02 Feb, 2016 1 commit
  19. 17 Dec, 2015 1 commit
  20. 16 Dec, 2015 1 commit
  21. 15 Dec, 2015 1 commit
  22. 09 Jun, 2015 1 commit
    • fbarchard@google.com's avatar
      Box filter for YUV use rows with accumulation buffer for better memory behavior.… · 05416e2d
      fbarchard@google.com authored
      Box filter for YUV use rows with accumulation buffer for better memory behavior.  The old code would do columns accumulated into registers, and then store the result once.  This was slow from a memory point of view.  The new code does a row of source at a time, updating an accumulation buffer every row.  The accumulation buffer is small, and should fit cache.  Before each accumulation of N rows, the buffer needs to be reset to zero.  If the memset is a bottleneck, it would be faster to do the first row without an add, storing to the accumulation buffer, and then add for the remaining rows.
      BUG=425
      TESTED=out\release\libyuv_unittest --gtest_filter=*ScaleTo1x1*
      R=harryjin@google.com
      
      Review URL: https://webrtc-codereview.appspot.com/52659004
      
      git-svn-id: http://libyuv.googlecode.com/svn/trunk@1428 16f28f9a-4ce2-e073-06de-1de4eb20be90
      05416e2d
  23. 30 Apr, 2015 1 commit
  24. 28 Apr, 2015 3 commits
  25. 27 Apr, 2015 1 commit
  26. 22 Apr, 2015 1 commit
  27. 17 Apr, 2015 1 commit
  28. 16 Apr, 2015 1 commit
  29. 14 Apr, 2015 1 commit
  30. 13 Apr, 2015 1 commit
  31. 07 Apr, 2015 1 commit
  32. 03 Apr, 2015 1 commit
  33. 02 Apr, 2015 1 commit
  34. 30 Mar, 2015 1 commit
  35. 26 Mar, 2015 1 commit
  36. 24 Mar, 2015 1 commit
  37. 19 Mar, 2015 1 commit
  38. 16 Mar, 2015 1 commit