aboutsummaryrefslogtreecommitdiffstats
path: root/tools/bisect-create
diff options
context:
space:
mode:
authorClément Bœsch <u@pkh.me>2018-05-06 12:34:54 +0200
committerClément Bœsch <u@pkh.me>2018-05-08 10:28:06 +0200
commit43d16aef6395340c248ee79d35f60f9965427a45 (patch)
treec844c123134ec0e83d695ce4e8e4e7998a8051c6 /tools/bisect-create
parentf679711c1b516786a39f9e582622a200502fff74 (diff)
downloadffmpeg-43d16aef6395340c248ee79d35f60f9965427a45.tar.gz
lavfi/nlmeans: make compute_safe_ssd_integral_image_c faster
before: ssd_integral_image_c: 49204.6 after: ssd_integral_image_c: 44272.8 Unrolling by 4 made the biggest difference on odroid-c2 (aarch64); unrolling by 2 or 8 both raised 46k cycles vs 44k for 4. Additionally, this is a much better reference when writing SIMD (SIMD vectorization will just target 16 instead of 4).
Diffstat (limited to 'tools/bisect-create')
0 files changed, 0 insertions, 0 deletions