diff options
author | Ganesh Ajjanagadde <gajjanagadde@gmail.com> | 2016-01-13 17:59:26 -0500 |
---|---|---|
committer | Ganesh Ajjanagadde <gajjanagadde@gmail.com> | 2016-01-15 16:46:13 -0500 |
commit | 5989add4ab4e8e4daa406a66319b0a3b3faaa73d (patch) | |
tree | fa59b1ead68044bf0a10a9a75b3f112ee36b0a36 /libavutil/arm/float_dsp_neon.S | |
parent | d4ce63a1bf2520be7015df78dd8b042abe456c23 (diff) | |
download | ffmpeg-5989add4ab4e8e4daa406a66319b0a3b3faaa73d.tar.gz |
lavu/x86/lls: add fma3 optimizations for update_lls
This improves accuracy (very slightly) and speed for processors having
fma3.
Sample benchmark (fate flac-16-lpc-cholesky, Haswell):
old:
5993610 decicycles in ff_lpc_calc_coefs, 64 runs, 0 skips
5951528 decicycles in ff_lpc_calc_coefs, 128 runs, 0 skips
new:
5252410 decicycles in ff_lpc_calc_coefs, 64 runs, 0 skips
5232869 decicycles in ff_lpc_calc_coefs, 128 runs, 0 skips
Tested with FATE and --disable-fma3, also examined contents of
lavu/lls-test.
Reviewed-by: James Almer <jamrial@gmail.com>
Reviewed-by: Henrik Gramner <henrik@gramner.com>
Signed-off-by: Ganesh Ajjanagadde <gajjanagadde@gmail.com>
Diffstat (limited to 'libavutil/arm/float_dsp_neon.S')
0 files changed, 0 insertions, 0 deletions