diff options
author | James Darnley <james.darnley@gmail.com> | 2013-03-16 21:42:23 +0100 |
---|---|---|
committer | Michael Niedermayer <michaelni@gmx.at> | 2013-03-16 22:32:34 +0100 |
commit | 17e7b495013de644dc49e61673846d6c0c1bde47 (patch) | |
tree | a44ef046b8e233a217be30d1738491355f47b084 /libavcodec/truemotion1data.h | |
parent | 75c7e4583f4fd727d236a12763a265502fe00988 (diff) | |
download | ffmpeg-17e7b495013de644dc49e61673846d6c0c1bde47.tar.gz |
yadif: x86 assembly for 16-bit samples
This is a fairly dumb copy of the assembly for 8-bit samples but it
works and produces identical output to the C version. The options have
been tested on an Athlon64 and a Core2Quad.
Athlon64:
1810385 decicycles in C, 32726 runs, 42 skips
1080744 decicycles in mmx, 32744 runs, 24 skips, 1.7x faster
818315 decicycles in sse2, 32735 runs, 33 skips, 2.2x faster
Core2Quad:
924025 decicycles in C, 32750 runs, 18 skips
623995 decicycles in mmx, 32767 runs, 1 skips, 1.5x faster
406223 decicycles in sse2, 32764 runs, 4 skips, 2.3x faster
387842 decicycles in ssse3, 32767 runs, 1 skips, 2.4x faster
307726 decicycles in sse4, 32763 runs, 5 skips, 3.0x faster
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
Diffstat (limited to 'libavcodec/truemotion1data.h')
0 files changed, 0 insertions, 0 deletions