diff options
author | Clément Bœsch <u@pkh.me> | 2018-05-06 12:34:54 +0200 |
---|---|---|
committer | Clément Bœsch <u@pkh.me> | 2018-05-08 10:28:06 +0200 |
commit | 43d16aef6395340c248ee79d35f60f9965427a45 (patch) | |
tree | c844c123134ec0e83d695ce4e8e4e7998a8051c6 /libavcodec/vaapi_encode_mjpeg.c | |
parent | f679711c1b516786a39f9e582622a200502fff74 (diff) | |
download | ffmpeg-43d16aef6395340c248ee79d35f60f9965427a45.tar.gz |
lavfi/nlmeans: make compute_safe_ssd_integral_image_c faster
before: ssd_integral_image_c: 49204.6
after: ssd_integral_image_c: 44272.8
Unrolling by 4 made the biggest difference on odroid-c2 (aarch64);
unrolling by 2 or 8 both raised 46k cycles vs 44k for 4.
Additionally, this is a much better reference when writing SIMD (SIMD
vectorization will just target 16 instead of 4).
Diffstat (limited to 'libavcodec/vaapi_encode_mjpeg.c')
0 files changed, 0 insertions, 0 deletions