diff options
author | Lauri Kasanen <cand@gmx.com> | 2019-03-31 17:01:37 +0300 |
---|---|---|
committer | Lauri Kasanen <cand@gmx.com> | 2019-04-07 09:20:31 +0300 |
commit | 50e672bc54372f2bd228d83cc040fad8c06ad8ed (patch) | |
tree | cb1c81a9d025c1df3fed2afcd1cc17956750a56d /libavcodec/arm/int_neon.S | |
parent | 7c1875143db4573bad2d39f13aadbef7b13628ad (diff) | |
download | ffmpeg-50e672bc54372f2bd228d83cc040fad8c06ad8ed.tar.gz |
swscale/ppc: VSX-optimize non-full-chroma yuv2rgb_1
./ffmpeg -f lavfi -i yuvtestsrc=duration=1:size=1200x1440 -sws_flags fast_bilinear \
-s 1200x1440 -f null -vframes 100 -pix_fmt $i -nostats \
-cpuflags 0 -v error -
32-bit mul, power8 only.
1.8-2.3x speedup:
rgb24
18192 UNITS in yuv2packed1, 32767 runs, 1 skips
9983 UNITS in yuv2packed1, 32760 runs, 8 skips
bgr24
18665 UNITS in yuv2packed1, 32766 runs, 2 skips
9925 UNITS in yuv2packed1, 32763 runs, 5 skips
rgba
20239 UNITS in yuv2packed1, 32767 runs, 1 skips
8794 UNITS in yuv2packed1, 32759 runs, 9 skips
bgra
20354 UNITS in yuv2packed1, 32768 runs, 0 skips
8770 UNITS in yuv2packed1, 32761 runs, 7 skips
argb
20185 UNITS in yuv2packed1, 32768 runs, 0 skips
8761 UNITS in yuv2packed1, 32761 runs, 7 skips
bgra
20360 UNITS in yuv2packed1, 32766 runs, 2 skips
8759 UNITS in yuv2packed1, 32764 runs, 4 skips
This is a low speedup, but the x86 mmx version also gets only ~2x. The mmx version
is also heavily inaccurate, while the vsx version has high accuracy.
Diffstat (limited to 'libavcodec/arm/int_neon.S')
0 files changed, 0 insertions, 0 deletions