diff options
author | Rémi Denis-Courmont <remi@remlab.net> | 2024-07-04 21:38:48 +0300 |
---|---|---|
committer | Rémi Denis-Courmont <remi@remlab.net> | 2024-07-09 18:03:29 +0300 |
commit | 3606e592ea239ebaf2e30570bf99abeefd52b7e0 (patch) | |
tree | e09db26616e90263113bc51c536a8d0f466dbce1 /libavformat/spdif.c | |
parent | 85706f5136cf7c88f95843b2634dd3f7d7d2cb6d (diff) | |
download | ffmpeg-3606e592ea239ebaf2e30570bf99abeefd52b7e0.tar.gz |
lavc/h264dsp: R-V V 8-bit h264_weight_pixels
There are two implementations here:
- a generic scalable one processing two columns at a time,
- a specialised processing one (fixed-size) row at a time.
Unsurprisingly, the generic one works out better with smaller widths.
With larger widths, the gains from filling vectors are outweighed by
the extra cost of strided loads and stores. In other words, memory
accesses become the bottleneck.
T-Head C908:
h264_weight2_8_c: 54.5
h264_weight2_8_rvv_i32: 13.7
h264_weight4_8_c: 101.7
h264_weight4_8_rvv_i32: 27.5
h264_weight8_8_c: 197.0
h264_weight8_8_rvv_i32: 75.5
h264_weight16_8_c: 385.0
h264_weight16_8_rvv_i32: 74.2
SpacemiT X60:
h264_weight2_8_c: 48.5
h264_weight2_8_rvv_i32: 8.2
h264_weight4_8_c: 90.7
h264_weight4_8_rvv_i32: 16.5
h264_weight8_8_c: 175.0
h264_weight8_8_rvv_i32: 37.7
h264_weight16_8_c: 342.2
h264_weight16_8_rvv_i32: 66.0
Diffstat (limited to 'libavformat/spdif.c')
0 files changed, 0 insertions, 0 deletions