aboutsummaryrefslogtreecommitdiffstats
path: root/libavcodec/vaapi_encode_vp8.c
diff options
context:
space:
mode:
authorRémi Denis-Courmont <remi@remlab.net>2023-11-15 19:46:18 +0200
committerRémi Denis-Courmont <remi@remlab.net>2023-11-18 22:05:43 +0200
commit295092b46d8bdaa3638647bb958b35c698045f6b (patch)
treebc49971b564caba88ebffd3f5a7c5609b572ec35 /libavcodec/vaapi_encode_vp8.c
parent8a984aca59b4c4842bae50cadb9eb481a455e6de (diff)
downloadffmpeg-295092b46d8bdaa3638647bb958b35c698045f6b.tar.gz
lavc/flacdsp: R-V V LPC32
The entire set of 32 coefficients and corresponding past 32 samples can fit in a single vector (with LMUL=8) exactly, but... since widening double the needed vector sizes, we still end up too short with 128-bit vectors. This adds a very simple version for future 256+-bit hardware, and for pred_orders values up to 16, and a bit more involved loop for for 128-bit hardware with pred_orders between 17 and 32. With 128-bit hardware, the benchmarks look like this: flac_lpc_32_13_c: 30152.0 flac_lpc_32_13_rvv_i32: 10244.7 flac_lpc_32_16_c: 37314.2 flac_lpc_32_16_rvv_i32: 10126.2 flac_lpc_32_29_c: 61910.0 flac_lpc_32_29_rvv_i32: 14495.2 flac_lpc_32_32_c: 68204.0 flac_lpc_32_32_rvv_i32: 13273.7
Diffstat (limited to 'libavcodec/vaapi_encode_vp8.c')
0 files changed, 0 insertions, 0 deletions