diff options
author | Martin Storsjö <martin@martin.st> | 2016-11-23 14:03:05 +0200 |
---|---|---|
committer | Martin Storsjö <martin@martin.st> | 2017-03-11 13:14:22 +0200 |
commit | dc47bf3872fde18f6aea33f015a95b70e24dff19 (patch) | |
tree | 16840d3da4ba143f03173a4f54d4d865c9aa5674 /libavcodec/aarch64/h264dsp_neon.S | |
parent | f8fcee0daf42d16581ec581bd5ff6373c3a76103 (diff) | |
download | ffmpeg-dc47bf3872fde18f6aea33f015a95b70e24dff19.tar.gz |
aarch64: vp9itxfm: Make the larger core transforms standalone functions
This work is sponsored by, and copyright, Google.
This reduces the code size of libavcodec/aarch64/vp9itxfm_neon.o from
19496 to 14740 bytes.
This gives a small slowdown of a couple of tens of cycles, but makes
it more feasible to add more optimized versions of these transforms.
Before:
vp9_inv_dct_dct_16x16_sub4_add_neon: 1036.7
vp9_inv_dct_dct_16x16_sub16_add_neon: 1372.2
vp9_inv_dct_dct_32x32_sub4_add_neon: 5180.0
vp9_inv_dct_dct_32x32_sub32_add_neon: 8095.7
After:
vp9_inv_dct_dct_16x16_sub4_add_neon: 1051.0
vp9_inv_dct_dct_16x16_sub16_add_neon: 1390.1
vp9_inv_dct_dct_32x32_sub4_add_neon: 5199.9
vp9_inv_dct_dct_32x32_sub32_add_neon: 8125.8
This is cherrypicked from libav commit
115476018d2c97df7e9b4445fe8f6cc7420ab91f.
Signed-off-by: Martin Storsjö <martin@martin.st>
Diffstat (limited to 'libavcodec/aarch64/h264dsp_neon.S')
0 files changed, 0 insertions, 0 deletions