Skip to content

  • Projects
  • Groups
  • Snippets
  • Help
    • Loading...
  • Sign in / Register
F
ffmpeg
  • Project
    • Project
    • Details
    • Activity
    • Cycle Analytics
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Charts
  • Issues 0
    • Issues 0
    • List
    • Board
    • Labels
    • Milestones
  • Merge Requests 0
    • Merge Requests 0
  • CI / CD
    • CI / CD
    • Pipelines
    • Jobs
    • Schedules
    • Charts
  • Packages
    • Packages
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Members
    • Members
  • Collapse sidebar
  • Activity
  • Graph
  • Charts
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
  • submodule
  • ffmpeg
  • Repository

Switch branch/tag
  • ffmpeg
  • libavutil
  • x86
  • lls.asm
Find file
BlameHistoryPermalink
  • Ganesh Ajjanagadde's avatar
    lavu/x86/lls: add fma3 optimizations for update_lls · 5989add4
    Ganesh Ajjanagadde authored Jan 13, 2016
    This improves accuracy (very slightly) and speed for processors having
    fma3.
    
    Sample benchmark (fate flac-16-lpc-cholesky, Haswell):
    old:
    5993610 decicycles in ff_lpc_calc_coefs,      64 runs,      0 skips
    5951528 decicycles in ff_lpc_calc_coefs,     128 runs,      0 skips
    
    new:
    5252410 decicycles in ff_lpc_calc_coefs,      64 runs,      0 skips
    5232869 decicycles in ff_lpc_calc_coefs,     128 runs,      0 skips
    
    Tested with FATE and --disable-fma3, also examined contents of
    lavu/lls-test.
    Reviewed-by: 's avatarJames Almer <jamrial@gmail.com>
    Reviewed-by: 's avatarHenrik Gramner <henrik@gramner.com>
    Signed-off-by: 's avatarGanesh Ajjanagadde <gajjanagadde@gmail.com>
    5989add4
lls.asm 7.58 KB
EditWeb IDE

Replace lls.asm

Attach a file by drag & drop or click to upload


Cancel
A new branch will be created in your fork and a new merge request will be started.