aboutsummaryrefslogtreecommitdiffstats
path: root/libavfilter/deshake_opencl_kernel.h
diff options
context:
space:
mode:
authorGanesh Ajjanagadde <gajjanagadde@gmail.com>2015-12-16 17:39:28 -0800
committerGanesh Ajjanagadde <gajjanagadde@gmail.com>2015-12-30 08:52:19 -0800
commit05434b0eea3f959b8b44be97c56bad6ab6a0dc22 (patch)
tree0d23af996ea83569ee2a88ee4a0029a8e1cd472e /libavfilter/deshake_opencl_kernel.h
parentb272c3a5aabeafb37d5aeecc4524e8a8aeecf9da (diff)
downloadffmpeg-05434b0eea3f959b8b44be97c56bad6ab6a0dc22.tar.gz
lavc/cook: get rid of wasteful pow in init_pow2table
The table is highly structured, so pow (or exp2 for that matter) can entirely be avoided, yielding a ~ 40x speedup with no loss of accuracy. sample benchmark (Haswell, GNU/Linux): new: 4449 decicycles in init_pow2table(loop 1000), 254 runs, 2 skips 4411 decicycles in init_pow2table(loop 1000), 510 runs, 2 skips 4391 decicycles in init_pow2table(loop 1000), 1022 runs, 2 skips old: 183673 decicycles in init_pow2table(loop 1000), 256 runs, 0 skips 182142 decicycles in init_pow2table(loop 1000), 512 runs, 0 skips 182104 decicycles in init_pow2table(loop 1000), 1024 runs, 0 skips Reviewed-by: Clément Bœsch <u@pkh.me> Signed-off-by: Ganesh Ajjanagadde <gajjanagadde@gmail.com>
Diffstat (limited to 'libavfilter/deshake_opencl_kernel.h')
0 files changed, 0 insertions, 0 deletions