diff options
author | Hubert Mazur <hum@semihalf.com> | 2022-10-28 11:34:38 +0000 |
---|---|---|
committer | Martin Storsjö <martin@martin.st> | 2022-11-01 15:24:53 +0200 |
commit | 9ccf8c5bfc3939762e421f4addb6fb71600e2026 (patch) | |
tree | 00cd72a12243a2c67eec34173cfc25eda28305e6 /libavfilter/boxblur.c | |
parent | 1e9cfa5bb0253784798029bda377823ab5d1a529 (diff) | |
download | ffmpeg-9ccf8c5bfc3939762e421f4addb6fb71600e2026.tar.gz |
sw_scale: Add specializations for hscale 16 to 15
Add arm64 neon implementations for hscale 16 to 15 with filter
sizes 4, 8 and X4.
The tests and benchmarks run on AWS Graviton 2 instances.
The results from a checkasm tool are shown below.
hscale_16_to_15__fs_4_dstW_512_c: 6703.5
hscale_16_to_15__fs_4_dstW_512_neon: 2298.0
hscale_16_to_15__fs_8_dstW_512_c: 10983.0
hscale_16_to_15__fs_8_dstW_512_neon: 3216.5
hscale_16_to_15__fs_12_dstW_512_c: 15526.0
hscale_16_to_15__fs_12_dstW_512_neon: 3993.0
hscale_16_to_15__fs_16_dstW_512_c: 20183.5
hscale_16_to_15__fs_16_dstW_512_neon: 5369.7
hscale_16_to_15__fs_32_dstW_512_c: 39315.2
hscale_16_to_15__fs_32_dstW_512_neon: 9511.2
hscale_16_to_15__fs_40_dstW_512_c: 48995.7
hscale_16_to_15__fs_40_dstW_512_neon: 11570.0
(Note, the checkasm tests for these functions haven't been
merged since they fail on x86.)
Signed-off-by: Hubert Mazur <hum@semihalf.com>
Signed-off-by: Martin Storsjö <martin@martin.st>
Diffstat (limited to 'libavfilter/boxblur.c')
0 files changed, 0 insertions, 0 deletions