aboutsummaryrefslogtreecommitdiffstats
path: root/libavcodec/x86
Commit message (Expand)AuthorAgeFilesLines
* Fix typos when converting inline asm to yasm, fixes MMX-only fate-ea-vp61.Ronald S. Bultje2010-08-261-5/+5
* Revert r24931, it broke Win32 and some BSD compiles (yay fate).Ronald S. Bultje2010-08-251-1/+0
* Mark xmm6 and xmm7 as clobbered in ff_vp3_idct_sse2(), which is contributingRonald S. Bultje2010-08-251-0/+1
* VP6: fix vp6_filter_diag4_mmx/sse on 64-bitMåns Rullgård2010-08-251-0/+3
* Move vp6_filter_diag4() x86 SIMD code from inline ASM to YASM. This shouldRonald S. Bultje2010-08-257-270/+178
* Move vp6_filter_diag4() from DSPContext to VP56DSPContext.Ronald S. Bultje2010-08-253-12/+46
* Remove global mm_flags variableMåns Rullgård2010-08-2410-10/+14
* Mark xmm registers as clobbered in simple loopfilter. Should fix the lastRonald S. Bultje2010-08-241-11/+11
* imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits".Alex Converse2010-08-232-3/+3
* Fix segfaults in VP8 SIMD code on Win64 (and FATE/win64 failures).Ronald S. Bultje2010-08-231-14/+14
* Convert ff_imdct_half_sse() to yasm.Alex Converse2010-08-222-108/+195
* VP5/6/8: ~7% faster arithmetic decodingJason Garrett-Glaser2010-08-121-1/+1
* Split h264dsp and h264pred in configure.Jason Garrett-Glaser2010-08-072-2/+3
* Add file missing in r24702Jason Garrett-Glaser2010-08-051-0/+170
* H.264: SSE2/SSSE3 weighted prediction asmEli Friedman2010-08-053-0/+23
* Move cavs dsp functions to their own structMåns Rullgård2010-08-033-10/+11
* VP5/6/8: add one inline missed in r24677Jason Garrett-Glaser2010-08-031-1/+1
* VP8: move zeroing of luma DC block into the WHTJason Garrett-Glaser2010-08-022-2/+20
* Use word-writing instead of dword-writing (with two cached but otherwiseRonald S. Bultje2010-07-312-105/+98
* Remove x86/mmx.h. It is not used anymore and has been deprecated for years.Vitor Sessak2010-07-311-267/+0
* Convert deinterlacing MMX code to YASMVitor Sessak2010-07-313-0/+95
* Fix compilation in x86_64. I broke it with r24580.Vitor Sessak2010-07-291-2/+2
* Translate libmpeg2 MMX IDCT to plain asmVitor Sessak2010-07-291-208/+237
* Use pmaddubsw for the mbedge_filter (>=ssse3), 6-10 cycles faster.Ronald S. Bultje2010-07-261-2/+78
* VP8: Much faster SSE2 MCJason Garrett-Glaser2010-07-261-88/+78
* Enable no-loop memory/register saving for ssse3/sse4 also.Ronald S. Bultje2010-07-261-2/+2
* Save a register (or regsize of stackspace for x86-32) for the no-loopRonald S. Bultje2010-07-261-16/+24
* Use nested ifs instead of &&, which appears to not work with %ifidn (i.e. thisRonald S. Bultje2010-07-261-3/+9
* Split pextrw macro-spaghetti into several opt-specific macros, this will makeRonald S. Bultje2010-07-261-30/+49
* Fix obvious bug in assignment. Somehow, the test vectors don't test this...Ronald S. Bultje2010-07-251-1/+1
* Fix SPLATB_REG mess. Used to be a if/elseif/elseif/elseif spaghetti, so thisRonald S. Bultje2010-07-241-33/+52
* Inline asm for VP56 arith coderEli Friedman2010-07-231-0/+54
* VP8: optimize DC-only chroma case in the same way as luma.Jason Garrett-Glaser2010-07-232-10/+53
* VP8 asm: cosmetics (spacing)Jason Garrett-Glaser2010-07-231-2/+2
* VP8: 30% faster idct_mbJason Garrett-Glaser2010-07-232-54/+132
* VP8: clear DCT blocks in iDCT instead of using clear_blocks.Jason Garrett-Glaser2010-07-232-4/+24
* Use pextrw for SSE4 mbedge filter result writing, speedup 5-10cycles onRonald S. Bultje2010-07-222-5/+34
* Fix and enable horizontal >=SSE2 mbedge loopfilter.Ronald S. Bultje2010-07-222-8/+8
* relicense h264 deblock sse2 to lgplLoren Merritt2010-07-223-15/+19
* sync yasm macros from x264Loren Merritt2010-07-211-12/+23
* Eliminate one instruction in VP8 dc_add_sse4Jason Garrett-Glaser2010-07-211-2/+1
* Various VP8 x86 deblocking speedupsJason Garrett-Glaser2010-07-212-92/+107
* Make mmx VP8 WHT fasterJason Garrett-Glaser2010-07-212-19/+24
* Add header declarations for mmx/sse constants missing themDavid Conrad2010-07-211-0/+6
* Move ff_pw_* from vc1dsp_mmx.c to dsputil_mmx.cDavid Conrad2010-07-212-7/+1
* VP8 MBedge loopfilter MMX/MMX2/SSE2 functions for both luma (width=16)Ronald S. Bultje2010-07-204-2/+687
* Chroma (width=8) inner loopfilter MMX/MMX2/SSE2 for VP8 decoder.Ronald S. Bultje2010-07-202-77/+150
* Revert r24339 (it causes fate failures on x86-64) - I'll figure out what'sRonald S. Bultje2010-07-192-127/+32
* Remove FF_MM_SSE2/3 flags for CPUs where this is generally not faster thanRonald S. Bultje2010-07-193-6/+25
* Implement chroma (width=8) inner loopfilter MMX/MMX2/SSE2 functions.Ronald S. Bultje2010-07-192-32/+127