diff options
author | Martin Storsjö <martin@martin.st> | 2017-02-24 16:10:25 +0200 |
---|---|---|
committer | Martin Storsjö <martin@martin.st> | 2017-03-19 22:54:26 +0200 |
commit | 0f2705e66b1f7f9ae900667c400e46fa0e4f15a7 (patch) | |
tree | 6aae4e968bdbfc39ff0f3e9f37d7eedc9f1a80d5 /libavcodec/libvpx.h | |
parent | 0ea603203d1a46ea36cbaa3fb53d6fc69f5367ad (diff) | |
download | ffmpeg-0f2705e66b1f7f9ae900667c400e46fa0e4f15a7.tar.gz |
aarch64: vp9itxfm16: Make the larger core transforms standalone functions
This work is sponsored by, and copyright, Google.
This reduces the code size of libavcodec/aarch64/vp9itxfm_16bpp_neon.o from
26288 to 21512 bytes.
This gives a small slowdown of a couple of tens of cycles, but makes
it more feasible to add more optimized versions of these transforms.
Before:
vp9_inv_dct_dct_16x16_sub4_add_10_neon: 1887.4
vp9_inv_dct_dct_16x16_sub16_add_10_neon: 2801.5
vp9_inv_dct_dct_32x32_sub4_add_10_neon: 9691.4
vp9_inv_dct_dct_32x32_sub32_add_10_neon: 16154.9
After:
vp9_inv_dct_dct_16x16_sub4_add_10_neon: 1899.5
vp9_inv_dct_dct_16x16_sub16_add_10_neon: 2827.2
vp9_inv_dct_dct_32x32_sub4_add_10_neon: 9714.7
vp9_inv_dct_dct_32x32_sub32_add_10_neon: 16175.9
Signed-off-by: Martin Storsjö <martin@martin.st>
Diffstat (limited to 'libavcodec/libvpx.h')
0 files changed, 0 insertions, 0 deletions