ffmpeg - Mirror of FFmpeg git repo

diff options

author	Martin Storsjö <martin@martin.st>	2017-02-24 16:10:25 +0200
committer	Martin Storsjö <martin@martin.st>	2017-03-19 22:54:26 +0200
commit	0f2705e66b1f7f9ae900667c400e46fa0e4f15a7 (patch)
tree	6aae4e968bdbfc39ff0f3e9f37d7eedc9f1a80d5 /libavcodec/libvpx.h
parent	0ea603203d1a46ea36cbaa3fb53d6fc69f5367ad (diff)
download	ffmpeg-0f2705e66b1f7f9ae900667c400e46fa0e4f15a7.tar.gz

aarch64: vp9itxfm16: Make the larger core transforms standalone functions

This work is sponsored by, and copyright, Google. This reduces the code size of libavcodec/aarch64/vp9itxfm_16bpp_neon.o from 26288 to 21512 bytes. This gives a small slowdown of a couple of tens of cycles, but makes it more feasible to add more optimized versions of these transforms. Before: vp9_inv_dct_dct_16x16_sub4_add_10_neon: 1887.4 vp9_inv_dct_dct_16x16_sub16_add_10_neon: 2801.5 vp9_inv_dct_dct_32x32_sub4_add_10_neon: 9691.4 vp9_inv_dct_dct_32x32_sub32_add_10_neon: 16154.9 After: vp9_inv_dct_dct_16x16_sub4_add_10_neon: 1899.5 vp9_inv_dct_dct_16x16_sub16_add_10_neon: 2827.2 vp9_inv_dct_dct_32x32_sub4_add_10_neon: 9714.7 vp9_inv_dct_dct_32x32_sub32_add_10_neon: 16175.9 Signed-off-by: Martin Storsjö <martin@martin.st>

Diffstat (limited to 'libavcodec/libvpx.h')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: