diff options
author | Hubert Mazur <hum@semihalf.com> | 2022-10-28 11:34:39 +0000 |
---|---|---|
committer | Martin Storsjö <martin@martin.st> | 2022-11-01 15:24:58 +0200 |
commit | 2537fdc510ba2b5eca3e5c7a2b6d773498daa418 (patch) | |
tree | 43893edc1ea5a67923fee897910d538b4fce682c /libavcodec/mss3.c | |
parent | 9ccf8c5bfc3939762e421f4addb6fb71600e2026 (diff) | |
download | ffmpeg-2537fdc510ba2b5eca3e5c7a2b6d773498daa418.tar.gz |
sw_scale: Add specializations for hscale 16 to 19
Provide arm64 neon optimized implementations for hscale16To19 with
filter sizes 4, 8 and X4.
The tests and benchmarks run on AWS Graviton 2 instances.
The results from a checkasm tool are shown below.
hscale_16_to_19__fs_4_dstW_512_c: 6216.0
hscale_16_to_19__fs_4_dstW_512_neon: 2257.0
hscale_16_to_19__fs_8_dstW_512_c: 10417.7
hscale_16_to_19__fs_8_dstW_512_neon: 3112.5
hscale_16_to_19__fs_12_dstW_512_c: 14890.5
hscale_16_to_19__fs_12_dstW_512_neon: 3899.0
hscale_16_to_19__fs_16_dstW_512_c: 19006.5
hscale_16_to_19__fs_16_dstW_512_neon: 5341.2
hscale_16_to_19__fs_32_dstW_512_c: 36629.5
hscale_16_to_19__fs_32_dstW_512_neon: 9502.7
hscale_16_to_19__fs_40_dstW_512_c: 45477.5
hscale_16_to_19__fs_40_dstW_512_neon: 11552.0
(Note, the checkasm tests for these functions haven't been
merged since they fail on x86.)
Signed-off-by: Hubert Mazur <hum@semihalf.com>
Signed-off-by: Martin Storsjö <martin@martin.st>
Diffstat (limited to 'libavcodec/mss3.c')
0 files changed, 0 insertions, 0 deletions