aboutsummaryrefslogtreecommitdiffstats
path: root/libavcodec/aarch64/h264dsp_neon.S
diff options
context:
space:
mode:
authorMartin Storsjö <martin@martin.st>2016-11-23 14:03:05 +0200
committerMartin Storsjö <martin@martin.st>2017-03-11 13:14:22 +0200
commitdc47bf3872fde18f6aea33f015a95b70e24dff19 (patch)
tree16840d3da4ba143f03173a4f54d4d865c9aa5674 /libavcodec/aarch64/h264dsp_neon.S
parentf8fcee0daf42d16581ec581bd5ff6373c3a76103 (diff)
downloadffmpeg-dc47bf3872fde18f6aea33f015a95b70e24dff19.tar.gz
aarch64: vp9itxfm: Make the larger core transforms standalone functions
This work is sponsored by, and copyright, Google. This reduces the code size of libavcodec/aarch64/vp9itxfm_neon.o from 19496 to 14740 bytes. This gives a small slowdown of a couple of tens of cycles, but makes it more feasible to add more optimized versions of these transforms. Before: vp9_inv_dct_dct_16x16_sub4_add_neon: 1036.7 vp9_inv_dct_dct_16x16_sub16_add_neon: 1372.2 vp9_inv_dct_dct_32x32_sub4_add_neon: 5180.0 vp9_inv_dct_dct_32x32_sub32_add_neon: 8095.7 After: vp9_inv_dct_dct_16x16_sub4_add_neon: 1051.0 vp9_inv_dct_dct_16x16_sub16_add_neon: 1390.1 vp9_inv_dct_dct_32x32_sub4_add_neon: 5199.9 vp9_inv_dct_dct_32x32_sub32_add_neon: 8125.8 This is cherrypicked from libav commit 115476018d2c97df7e9b4445fe8f6cc7420ab91f. Signed-off-by: Martin Storsjö <martin@martin.st>
Diffstat (limited to 'libavcodec/aarch64/h264dsp_neon.S')
0 files changed, 0 insertions, 0 deletions