aboutsummaryrefslogtreecommitdiffstats
path: root/libavcodec/huffyuvdsp.c
diff options
context:
space:
mode:
authorHubert Mazur <hum@semihalf.com>2022-10-28 11:34:39 +0000
committerMartin Storsjö <martin@martin.st>2022-11-01 15:24:58 +0200
commit2537fdc510ba2b5eca3e5c7a2b6d773498daa418 (patch)
tree43893edc1ea5a67923fee897910d538b4fce682c /libavcodec/huffyuvdsp.c
parent9ccf8c5bfc3939762e421f4addb6fb71600e2026 (diff)
downloadffmpeg-2537fdc510ba2b5eca3e5c7a2b6d773498daa418.tar.gz
sw_scale: Add specializations for hscale 16 to 19
Provide arm64 neon optimized implementations for hscale16To19 with filter sizes 4, 8 and X4. The tests and benchmarks run on AWS Graviton 2 instances. The results from a checkasm tool are shown below. hscale_16_to_19__fs_4_dstW_512_c: 6216.0 hscale_16_to_19__fs_4_dstW_512_neon: 2257.0 hscale_16_to_19__fs_8_dstW_512_c: 10417.7 hscale_16_to_19__fs_8_dstW_512_neon: 3112.5 hscale_16_to_19__fs_12_dstW_512_c: 14890.5 hscale_16_to_19__fs_12_dstW_512_neon: 3899.0 hscale_16_to_19__fs_16_dstW_512_c: 19006.5 hscale_16_to_19__fs_16_dstW_512_neon: 5341.2 hscale_16_to_19__fs_32_dstW_512_c: 36629.5 hscale_16_to_19__fs_32_dstW_512_neon: 9502.7 hscale_16_to_19__fs_40_dstW_512_c: 45477.5 hscale_16_to_19__fs_40_dstW_512_neon: 11552.0 (Note, the checkasm tests for these functions haven't been merged since they fail on x86.) Signed-off-by: Hubert Mazur <hum@semihalf.com> Signed-off-by: Martin Storsjö <martin@martin.st>
Diffstat (limited to 'libavcodec/huffyuvdsp.c')
0 files changed, 0 insertions, 0 deletions