1. 14 Dec, 2017 1 commit
  2. 18 Aug, 2017 1 commit
  3. 09 Jun, 2017 1 commit
  4. 06 Jun, 2017 1 commit
  5. 11 Jan, 2017 1 commit
    • Frank Barchard's avatar
      Libyuv MIPS DSPR2 optimizations. · 000d2fa9
      Frank Barchard authored
      Optimized functions:
      
      I444ToARGBRow_DSPR2
      I422ToARGB4444Row_DSPR2
      I422ToARGB1555Row_DSPR2
      NV12ToARGBRow_DSPR2
      BGRAToUVRow_DSPR2
      BGRAToYRow_DSPR2
      ABGRToUVRow_DSPR2
      ARGBToYRow_DSPR2
      ABGRToYRow_DSPR2
      RGBAToUVRow_DSPR2
      RGBAToYRow_DSPR2
      ARGBToUVRow_DSPR2
      RGB24ToARGBRow_DSPR2
      RAWToARGBRow_DSPR2
      RGB565ToARGBRow_DSPR2
      ARGB1555ToARGBRow_DSPR2
      ARGB4444ToARGBRow_DSPR2
      ScaleAddRow_DSPR2
      
      Bug-fixes in functions:
      
      ScaleRowDown2_DSPR2
      ScaleRowDown4_DSPR2
      
      BUG=
      
      Review-Url: https://codereview.chromium.org/2626123003 .
      000d2fa9
  6. 21 Dec, 2016 1 commit
    • Manojkumar Bhosale's avatar
      Add MSA optimized remaining scale row functions · 288bfbef
      Manojkumar Bhosale authored
      R=fbarchard@google.com
      BUG=libyuv:634
      
      Performance Gain (vs C vectorized)
      ScaleRowDown2_MSA            - ~22.3x
      ScaleRowDown2_Any_MSA        - ~19.9x
      ScaleRowDown2Linear_MSA      - ~31.2x
      ScaleRowDown2Linear_Any_MSA  - ~29.4x
      ScaleRowDown2Box_MSA         - ~20.1x
      ScaleRowDown2Box_Any_MSA     - ~19.6x
      ScaleRowDown4_MSA            - ~11.7x
      ScaleRowDown4_Any_MSA        - ~11.2x
      ScaleRowDown4Box_MSA         - ~15.1x
      ScaleRowDown4Box_Any_MSA     - ~15.1x
      ScaleRowDown38_MSA           - ~1x
      ScaleRowDown38_Any_MSA       - ~1x
      ScaleRowDown38_2_Box_MSA     - ~1.7x
      ScaleRowDown38_2_Box_Any_MSA - ~1.7x
      ScaleRowDown38_3_Box_MSA     - ~1.7x
      ScaleRowDown38_3_Box_Any_MSA - ~1.7x
      ScaleAddRow_MSA              - ~1.2x
      ScaleAddRow_Any_MSA          - ~1.15x
      
      Performance Gain (vs C non-vectorized)
      ScaleRowDown2_MSA            - ~22.4x
      ScaleRowDown2_Any_MSA        - ~19.8x
      ScaleRowDown2Linear_MSA      - ~31.6x
      ScaleRowDown2Linear_Any_MSA  - ~29.4x
      ScaleRowDown2Box_MSA         - ~20.1x
      ScaleRowDown2Box_Any_MSA     - ~19.6x
      ScaleRowDown4_MSA            - ~11.7x
      ScaleRowDown4_Any_MSA        - ~11.2x
      ScaleRowDown4Box_MSA         - ~15.1x
      ScaleRowDown4Box_Any_MSA     - ~15.1x
      ScaleRowDown38_MSA           - ~3.2x
      ScaleRowDown38_Any_MSA       - ~3.2x
      ScaleRowDown38_2_Box_MSA     - ~2.4x
      ScaleRowDown38_2_Box_Any_MSA - ~2.3x
      ScaleRowDown38_3_Box_MSA     - ~2.9x
      ScaleRowDown38_3_Box_Any_MSA - ~2.8x
      ScaleAddRow_MSA              - ~8x
      ScaleAddRow_Any_MSA          - ~7.46x
      
      Review-Url: https://codereview.chromium.org/2559683002 .
      288bfbef
  7. 07 Dec, 2016 1 commit
    • Manojkumar Bhosale's avatar
      Add MSA optimized ARGB scaling functions · 56b5bbb0
      Manojkumar Bhosale authored
      R=fbarchard@google.com
      BUG=libyuv:634
      
      Performance Gain (vs C vectorized)
      ScaleARGBRowDown2_MSA           - ~2.6x
      ScaleARGBRowDown2Linear_MSA     - ~7.9x
      ScaleARGBRowDown2Box_MSA        - ~3.7x
      ScaleARGBRowDownEven_MSA        - ~1.2x
      ScaleARGBRowDownEvenBox_MSA     - ~3.5x
      
      ScaleARGBRowDown2_Any_MSA       - ~2.6x
      ScaleARGBRowDown2Linear_Any_MSA - ~7.9x
      ScaleARGBRowDown2Box_Any_MSA    - ~3.6x
      ScaleARGBRowDownEven_Any_MSA    - ~1.2x
      ScaleARGBRowDownEvenBox_Any_MSA - ~3.5x
      
      Performance Gain (vs C non-vectorized)
      ScaleARGBRowDown2_MSA           - 2.6x
      ScaleARGBRowDown2Linear_MSA     - 13.5x
      ScaleARGBRowDown2Box_MSA        - 5.8x
      ScaleARGBRowDownEven_MSA        - 1.2x
      ScaleARGBRowDownEvenBox_MSA     - 3.7x
      
      ScaleARGBRowDown2_Any_MSA       - 2.6x
      ScaleARGBRowDown2Linear_Any_MSA - 13.5x
      ScaleARGBRowDown2Box_Any_MSA    - 5.3x
      ScaleARGBRowDownEven_Any_MSA    - 1.2x
      ScaleARGBRowDownEvenBox_Any_MSA - 3.7x
      
      Review URL: https://codereview.chromium.org/2527983002 .
      56b5bbb0
  8. 08 Nov, 2016 1 commit
  9. 06 Jan, 2016 1 commit
  10. 16 Dec, 2015 1 commit
  11. 15 Dec, 2015 1 commit
  12. 09 Jun, 2015 1 commit
    • fbarchard@google.com's avatar
      Box filter for YUV use rows with accumulation buffer for better memory behavior.… · 05416e2d
      fbarchard@google.com authored
      Box filter for YUV use rows with accumulation buffer for better memory behavior.  The old code would do columns accumulated into registers, and then store the result once.  This was slow from a memory point of view.  The new code does a row of source at a time, updating an accumulation buffer every row.  The accumulation buffer is small, and should fit cache.  Before each accumulation of N rows, the buffer needs to be reset to zero.  If the memset is a bottleneck, it would be faster to do the first row without an add, storing to the accumulation buffer, and then add for the remaining rows.
      BUG=425
      TESTED=out\release\libyuv_unittest --gtest_filter=*ScaleTo1x1*
      R=harryjin@google.com
      
      Review URL: https://webrtc-codereview.appspot.com/52659004
      
      git-svn-id: http://libyuv.googlecode.com/svn/trunk@1428 16f28f9a-4ce2-e073-06de-1de4eb20be90
      05416e2d
  13. 26 May, 2015 1 commit
  14. 22 May, 2015 1 commit
  15. 30 Apr, 2015 2 commits
  16. 27 Apr, 2015 1 commit
  17. 24 Apr, 2015 1 commit
  18. 22 Apr, 2015 1 commit
  19. 07 Apr, 2015 1 commit
  20. 03 Apr, 2015 1 commit
  21. 31 Mar, 2015 1 commit
  22. 30 Mar, 2015 1 commit
  23. 26 Mar, 2015 1 commit
  24. 24 Mar, 2015 1 commit
  25. 19 Mar, 2015 1 commit