ffmpeg - Mirror of FFmpeg git repo

diff options

author	Rémi Denis-Courmont <remi@remlab.net>	2024-07-04 21:38:48 +0300
committer	Rémi Denis-Courmont <remi@remlab.net>	2024-07-09 18:03:29 +0300
commit	3606e592ea239ebaf2e30570bf99abeefd52b7e0 (patch)
tree	e09db26616e90263113bc51c536a8d0f466dbce1 /libavformat/spdif.c
parent	85706f5136cf7c88f95843b2634dd3f7d7d2cb6d (diff)
download	ffmpeg-3606e592ea239ebaf2e30570bf99abeefd52b7e0.tar.gz

lavc/h264dsp: R-V V 8-bit h264_weight_pixels

There are two implementations here: - a generic scalable one processing two columns at a time, - a specialised processing one (fixed-size) row at a time. Unsurprisingly, the generic one works out better with smaller widths. With larger widths, the gains from filling vectors are outweighed by the extra cost of strided loads and stores. In other words, memory accesses become the bottleneck. T-Head C908: h264_weight2_8_c: 54.5 h264_weight2_8_rvv_i32: 13.7 h264_weight4_8_c: 101.7 h264_weight4_8_rvv_i32: 27.5 h264_weight8_8_c: 197.0 h264_weight8_8_rvv_i32: 75.5 h264_weight16_8_c: 385.0 h264_weight16_8_rvv_i32: 74.2 SpacemiT X60: h264_weight2_8_c: 48.5 h264_weight2_8_rvv_i32: 8.2 h264_weight4_8_c: 90.7 h264_weight4_8_rvv_i32: 16.5 h264_weight8_8_c: 175.0 h264_weight8_8_rvv_i32: 37.7 h264_weight16_8_c: 342.2 h264_weight16_8_rvv_i32: 66.0

Diffstat (limited to 'libavformat/spdif.c')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: