diff options
author | Ronald S. Bultje <rsbultje@gmail.com> | 2014-01-04 10:08:47 -0500 |
---|---|---|
committer | Ronald S. Bultje <rsbultje@gmail.com> | 2014-01-07 20:43:30 -0500 |
commit | e84d14df10d0408b9e06b33b2f71173188279dda (patch) | |
tree | a242f5baf023923b3fd20a7a231bdd67342b254f /libavcodec/x86/vp56_arith.h | |
parent | b0517467c0b60b9e9e1a660859727e1f512c70d0 (diff) | |
download | ffmpeg-e84d14df10d0408b9e06b33b2f71173188279dda.tar.gz |
vp9/x86: idct_32x32_add_ssse3.
Sub-IDCTs will follow later. ped1080.webm goes from 9.295s to 8.191s
(13.5% faster). The IDCT itself goes from 4372 (intra) or 4337 (inter)
to 403 (intra) or 329 (inter) cycles for the DC-only form, 23755 (intra)
or 23723 (inter) to 3497 (intra) or 3607 (inter) cycles for the no-DC
form, which averages from 23393 (intra) or 16612 (inter) to 3449 (intra)
or 2392 (inter) for all 32x32s together, i.e. about ~7x faster (all
tests done on ped1080p.webm).
Diffstat (limited to 'libavcodec/x86/vp56_arith.h')
0 files changed, 0 insertions, 0 deletions