| Commit message (Expand) | Author | Age | Files | Lines |
* | Fix NASM include directive | Dave Yeo | 2011-08-15 | 1 | -2/+2 |
* | Move x86util.asm from libavcodec/ to libavutil/. | Ronald S. Bultje | 2011-08-12 | 1 | -1/+1 |
* | Move x86inc.asm to libavutil/. | Ronald S. Bultje | 2011-08-12 | 1 | -1/+1 |
* | Modify x86util.asm to ease transitioning to 10-bit H.264 assembly. | Daniel Kang | 2011-05-17 | 1 | -5/+5 |
* | Fix FSF address copy paste error in some license headers. | Diego Biurrun | 2011-05-14 | 1 | -1/+1 |
* | Replace FFmpeg with Libav in licence headers | Mans Rullgard | 2011-03-19 | 1 | -4/+4 |
* | Use "d" suffix for general-purpose registers used with movd. | Reimar Döffinger | 2010-09-05 | 1 | -13/+13 |
* | Mark xmm registers as clobbered in simple loopfilter. Should fix the last | Ronald S. Bultje | 2010-08-24 | 1 | -11/+11 |
* | Fix segfaults in VP8 SIMD code on Win64 (and FATE/win64 failures). | Ronald S. Bultje | 2010-08-23 | 1 | -14/+14 |
* | VP8: move zeroing of luma DC block into the WHT | Jason Garrett-Glaser | 2010-08-02 | 1 | -2/+18 |
* | Use word-writing instead of dword-writing (with two cached but otherwise | Ronald S. Bultje | 2010-07-31 | 1 | -104/+95 |
* | Use pmaddubsw for the mbedge_filter (>=ssse3), 6-10 cycles faster. | Ronald S. Bultje | 2010-07-26 | 1 | -2/+78 |
* | VP8: Much faster SSE2 MC | Jason Garrett-Glaser | 2010-07-26 | 1 | -88/+78 |
* | Enable no-loop memory/register saving for ssse3/sse4 also. | Ronald S. Bultje | 2010-07-26 | 1 | -2/+2 |
* | Save a register (or regsize of stackspace for x86-32) for the no-loop | Ronald S. Bultje | 2010-07-26 | 1 | -16/+24 |
* | Use nested ifs instead of &&, which appears to not work with %ifidn (i.e. this | Ronald S. Bultje | 2010-07-26 | 1 | -3/+9 |
* | Split pextrw macro-spaghetti into several opt-specific macros, this will make | Ronald S. Bultje | 2010-07-26 | 1 | -30/+49 |
* | Fix obvious bug in assignment. Somehow, the test vectors don't test this... | Ronald S. Bultje | 2010-07-25 | 1 | -1/+1 |
* | Fix SPLATB_REG mess. Used to be a if/elseif/elseif/elseif spaghetti, so this | Ronald S. Bultje | 2010-07-24 | 1 | -33/+52 |
* | VP8: optimize DC-only chroma case in the same way as luma. | Jason Garrett-Glaser | 2010-07-23 | 1 | -3/+44 |
* | VP8 asm: cosmetics (spacing) | Jason Garrett-Glaser | 2010-07-23 | 1 | -2/+2 |
* | VP8: 30% faster idct_mb | Jason Garrett-Glaser | 2010-07-23 | 1 | -54/+127 |
* | VP8: clear DCT blocks in iDCT instead of using clear_blocks. | Jason Garrett-Glaser | 2010-07-23 | 1 | -4/+22 |
* | Use pextrw for SSE4 mbedge filter result writing, speedup 5-10cycles on | Ronald S. Bultje | 2010-07-22 | 1 | -5/+30 |
* | Fix and enable horizontal >=SSE2 mbedge loopfilter. | Ronald S. Bultje | 2010-07-22 | 1 | -2/+2 |
* | Eliminate one instruction in VP8 dc_add_sse4 | Jason Garrett-Glaser | 2010-07-21 | 1 | -2/+1 |
* | Various VP8 x86 deblocking speedups | Jason Garrett-Glaser | 2010-07-21 | 1 | -32/+67 |
* | Make mmx VP8 WHT faster | Jason Garrett-Glaser | 2010-07-21 | 1 | -17/+22 |
* | VP8 MBedge loopfilter MMX/MMX2/SSE2 functions for both luma (width=16) | Ronald S. Bultje | 2010-07-20 | 1 | -0/+641 |
* | Chroma (width=8) inner loopfilter MMX/MMX2/SSE2 for VP8 decoder. | Ronald S. Bultje | 2010-07-20 | 1 | -77/+131 |
* | Revert r24339 (it causes fate failures on x86-64) - I'll figure out what's | Ronald S. Bultje | 2010-07-19 | 1 | -108/+32 |
* | Implement chroma (width=8) inner loopfilter MMX/MMX2/SSE2 functions. | Ronald S. Bultje | 2010-07-19 | 1 | -32/+108 |
* | Be more efficient with registers or stack memory. Saves 8/16 bytes stack | Ronald S. Bultje | 2010-07-19 | 1 | -16/+16 |
* | Change function prototypes for width=8 inner and mbedge loopfilter functions | Ronald S. Bultje | 2010-07-19 | 1 | -1/+1 |
* | Attempt to fix x86-64 testsuite on fate. | Ronald S. Bultje | 2010-07-16 | 1 | -1/+1 |
* | Remove duplicate define. | Ronald S. Bultje | 2010-07-16 | 1 | -1/+0 |
* | Revert 24270, it contained some stuff that shouldn't have been in there. | Ronald S. Bultje | 2010-07-16 | 1 | -1/+2 |
* | Remove duplicate define. | Ronald S. Bultje | 2010-07-16 | 1 | -2/+1 |
* | Give x86 r%d registers names, this will simplify implementation of the chroma | Ronald S. Bultje | 2010-07-16 | 1 | -58/+81 |
* | Change return statement, the REP_RET is a mistake since the else case (x86-64, | Ronald S. Bultje | 2010-07-16 | 1 | -3/+1 |
* | VP8 H/V inner loopfilter MMX/MMXEXT/SSE2 optimizations. | Ronald S. Bultje | 2010-07-15 | 1 | -13/+464 |
* | Simple H/V loopfilter for VP8 in MMX, MMX2 and SSE2 (yay for yasm macros). | Ronald S. Bultje | 2010-07-03 | 1 | -0/+306 |
* | SSSE3 versions of vp8 width4 bilinear MC functions | Jason Garrett-Glaser | 2010-07-03 | 1 | -2/+23 |
* | SSSE3 versions of width4 VP8 6-tap MC functions | Jason Garrett-Glaser | 2010-07-02 | 1 | -161/+174 |
* | Use add instead of lshift in mmxext vp8 idct | Jason Garrett-Glaser | 2010-06-29 | 1 | -2/+2 |
* | Remove unused macros (duplicates from the now-LGPL x86util.asm). | Ronald S. Bultje | 2010-06-29 | 1 | -26/+0 |
* | MMX idct_add for VP8. | Ronald S. Bultje | 2010-06-29 | 1 | -0/+89 |
* | Add mmxext version of VP8 DC Hadamard transform | Jason Garrett-Glaser | 2010-06-29 | 1 | -0/+46 |
* | Fix VP8 bilinear mc on x86_64 | Jason Garrett-Glaser | 2010-06-28 | 1 | -6/+6 |
* | Add x86 asm functions for VP8 put_pixels | Jason Garrett-Glaser | 2010-06-28 | 1 | -0/+40 |