aboutsummaryrefslogtreecommitdiffstats
path: root/libavfilter/unsharp_opencl_kernel.h
diff options
context:
space:
mode:
authorGanesh Ajjanagadde <gajjanagadde@gmail.com>2016-01-13 17:59:26 -0500
committerGanesh Ajjanagadde <gajjanagadde@gmail.com>2016-01-15 16:46:13 -0500
commit5989add4ab4e8e4daa406a66319b0a3b3faaa73d (patch)
treefa59b1ead68044bf0a10a9a75b3f112ee36b0a36 /libavfilter/unsharp_opencl_kernel.h
parentd4ce63a1bf2520be7015df78dd8b042abe456c23 (diff)
downloadffmpeg-5989add4ab4e8e4daa406a66319b0a3b3faaa73d.tar.gz
lavu/x86/lls: add fma3 optimizations for update_lls
This improves accuracy (very slightly) and speed for processors having fma3. Sample benchmark (fate flac-16-lpc-cholesky, Haswell): old: 5993610 decicycles in ff_lpc_calc_coefs, 64 runs, 0 skips 5951528 decicycles in ff_lpc_calc_coefs, 128 runs, 0 skips new: 5252410 decicycles in ff_lpc_calc_coefs, 64 runs, 0 skips 5232869 decicycles in ff_lpc_calc_coefs, 128 runs, 0 skips Tested with FATE and --disable-fma3, also examined contents of lavu/lls-test. Reviewed-by: James Almer <jamrial@gmail.com> Reviewed-by: Henrik Gramner <henrik@gramner.com> Signed-off-by: Ganesh Ajjanagadde <gajjanagadde@gmail.com>
Diffstat (limited to 'libavfilter/unsharp_opencl_kernel.h')
0 files changed, 0 insertions, 0 deletions