| Commit message (Expand) | Author | Age | Files | Lines |
* | x86: add colons after labels | Mans Rullgard | 2012-08-07 | 1 | -15/+15 |
* | x86inc: add SPLATB_LOAD, SPLATB_REG, PSHUFLW macros | Loren Merritt | 2012-07-05 | 1 | -21/+0 |
* | vp8dsp x86: perform rounding shift with a single instruction | Christophe GISQUET | 2012-04-04 | 1 | -10/+8 |
* | vp8: convert mbedge loopfilter x86 assembly to use named arguments. | Ronald S. Bultje | 2012-03-10 | 1 | -235/+212 |
* | vp8: convert inner loopfilter x86 assembly to use named arguments. | Ronald S. Bultje | 2012-03-10 | 1 | -187/+164 |
* | vp8: convert simple loopfilter x86 assembly to use named arguments. | Ronald S. Bultje | 2012-03-03 | 1 | -25/+30 |
* | vp8: convert idct x86 assembly to use named arguments. | Ronald S. Bultje | 2012-03-03 | 1 | -97/+103 |
* | vp8: convert mc x86 assembly to use named arguments. | Ronald S. Bultje | 2012-03-03 | 1 | -272/+272 |
* | vp8: convert loopfilter x86 assembly to use cpuflags(). | Ronald S. Bultje | 2012-03-03 | 1 | -201/+158 |
* | vp8: convert idct/mc x86 assembly to use cpuflags(). | Ronald S. Bultje | 2012-03-03 | 1 | -60/+68 |
* | vp8: disable mmx functions with sse/sse2 counterparts on x86-64. | Ronald S. Bultje | 2012-03-02 | 1 | -0/+15 |
* | Fix NASM include directive | Dave Yeo | 2011-08-15 | 1 | -2/+2 |
* | Move x86util.asm from libavcodec/ to libavutil/. | Ronald S. Bultje | 2011-08-12 | 1 | -1/+1 |
* | Move x86inc.asm to libavutil/. | Ronald S. Bultje | 2011-08-12 | 1 | -1/+1 |
* | Modify x86util.asm to ease transitioning to 10-bit H.264 assembly. | Daniel Kang | 2011-05-17 | 1 | -5/+5 |
* | Fix FSF address copy paste error in some license headers. | Diego Biurrun | 2011-05-14 | 1 | -1/+1 |
* | Replace FFmpeg with Libav in licence headers | Mans Rullgard | 2011-03-19 | 1 | -4/+4 |
* | Use "d" suffix for general-purpose registers used with movd. | Reimar Döffinger | 2010-09-05 | 1 | -13/+13 |
* | Mark xmm registers as clobbered in simple loopfilter. Should fix the last | Ronald S. Bultje | 2010-08-24 | 1 | -11/+11 |
* | Fix segfaults in VP8 SIMD code on Win64 (and FATE/win64 failures). | Ronald S. Bultje | 2010-08-23 | 1 | -14/+14 |
* | VP8: move zeroing of luma DC block into the WHT | Jason Garrett-Glaser | 2010-08-02 | 1 | -2/+18 |
* | Use word-writing instead of dword-writing (with two cached but otherwise | Ronald S. Bultje | 2010-07-31 | 1 | -104/+95 |
* | Use pmaddubsw for the mbedge_filter (>=ssse3), 6-10 cycles faster. | Ronald S. Bultje | 2010-07-26 | 1 | -2/+78 |
* | VP8: Much faster SSE2 MC | Jason Garrett-Glaser | 2010-07-26 | 1 | -88/+78 |
* | Enable no-loop memory/register saving for ssse3/sse4 also. | Ronald S. Bultje | 2010-07-26 | 1 | -2/+2 |
* | Save a register (or regsize of stackspace for x86-32) for the no-loop | Ronald S. Bultje | 2010-07-26 | 1 | -16/+24 |
* | Use nested ifs instead of &&, which appears to not work with %ifidn (i.e. this | Ronald S. Bultje | 2010-07-26 | 1 | -3/+9 |
* | Split pextrw macro-spaghetti into several opt-specific macros, this will make | Ronald S. Bultje | 2010-07-26 | 1 | -30/+49 |
* | Fix obvious bug in assignment. Somehow, the test vectors don't test this... | Ronald S. Bultje | 2010-07-25 | 1 | -1/+1 |
* | Fix SPLATB_REG mess. Used to be a if/elseif/elseif/elseif spaghetti, so this | Ronald S. Bultje | 2010-07-24 | 1 | -33/+52 |
* | VP8: optimize DC-only chroma case in the same way as luma. | Jason Garrett-Glaser | 2010-07-23 | 1 | -3/+44 |
* | VP8 asm: cosmetics (spacing) | Jason Garrett-Glaser | 2010-07-23 | 1 | -2/+2 |
* | VP8: 30% faster idct_mb | Jason Garrett-Glaser | 2010-07-23 | 1 | -54/+127 |
* | VP8: clear DCT blocks in iDCT instead of using clear_blocks. | Jason Garrett-Glaser | 2010-07-23 | 1 | -4/+22 |
* | Use pextrw for SSE4 mbedge filter result writing, speedup 5-10cycles on | Ronald S. Bultje | 2010-07-22 | 1 | -5/+30 |
* | Fix and enable horizontal >=SSE2 mbedge loopfilter. | Ronald S. Bultje | 2010-07-22 | 1 | -2/+2 |
* | Eliminate one instruction in VP8 dc_add_sse4 | Jason Garrett-Glaser | 2010-07-21 | 1 | -2/+1 |
* | Various VP8 x86 deblocking speedups | Jason Garrett-Glaser | 2010-07-21 | 1 | -32/+67 |
* | Make mmx VP8 WHT faster | Jason Garrett-Glaser | 2010-07-21 | 1 | -17/+22 |
* | VP8 MBedge loopfilter MMX/MMX2/SSE2 functions for both luma (width=16) | Ronald S. Bultje | 2010-07-20 | 1 | -0/+641 |
* | Chroma (width=8) inner loopfilter MMX/MMX2/SSE2 for VP8 decoder. | Ronald S. Bultje | 2010-07-20 | 1 | -77/+131 |
* | Revert r24339 (it causes fate failures on x86-64) - I'll figure out what's | Ronald S. Bultje | 2010-07-19 | 1 | -108/+32 |
* | Implement chroma (width=8) inner loopfilter MMX/MMX2/SSE2 functions. | Ronald S. Bultje | 2010-07-19 | 1 | -32/+108 |
* | Be more efficient with registers or stack memory. Saves 8/16 bytes stack | Ronald S. Bultje | 2010-07-19 | 1 | -16/+16 |
* | Change function prototypes for width=8 inner and mbedge loopfilter functions | Ronald S. Bultje | 2010-07-19 | 1 | -1/+1 |
* | Attempt to fix x86-64 testsuite on fate. | Ronald S. Bultje | 2010-07-16 | 1 | -1/+1 |
* | Remove duplicate define. | Ronald S. Bultje | 2010-07-16 | 1 | -1/+0 |
* | Revert 24270, it contained some stuff that shouldn't have been in there. | Ronald S. Bultje | 2010-07-16 | 1 | -1/+2 |
* | Remove duplicate define. | Ronald S. Bultje | 2010-07-16 | 1 | -2/+1 |
* | Give x86 r%d registers names, this will simplify implementation of the chroma | Ronald S. Bultje | 2010-07-16 | 1 | -58/+81 |