diff options
author | Christophe Gisquet <christophe.gisquet@gmail.com> | 2014-02-14 15:03:12 +0000 |
---|---|---|
committer | Janne Grunau <janne-libav@jannau.net> | 2014-02-20 14:18:05 +0100 |
commit | 996697e266c8adc0ad9b7fc7568406c7529c97cf (patch) | |
tree | 4b143794e0a28c92722d81a62b8c55b9cbd00cc1 /libavutil/x86/asm.h | |
parent | ef010f08ae53479c54e2f16be5a7e1a809a9e268 (diff) | |
download | ffmpeg-996697e266c8adc0ad9b7fc7568406c7529c97cf.tar.gz |
x86: float dsp: unroll SSE versions
vector_fmul and vector_fmac_scalar are guaranteed that they can process in
batch of 16 elements, but their SSE versions only does 8 at a time.
Therefore, unroll them a bit.
299 to 261c for 256 elements in vector_fmac_scalar on Arrandale/Win64.
Signed-off-by: Janne Grunau <janne-libav@jannau.net>
Diffstat (limited to 'libavutil/x86/asm.h')
0 files changed, 0 insertions, 0 deletions