aboutsummaryrefslogtreecommitdiffstats
path: root/libavcodec/ppc
Commit message (Collapse)AuthorAgeFilesLines
* ppc: cosmetics: Consistently format CPU flag detection invocationsDiego Biurrun2013-08-297-28/+35
|
* ppc: Add missing AltiVec cpuflag detection invocationsDiego Biurrun2013-08-293-0/+12
|
* ppc: fdct: Remove vim editor settings commentDiego Biurrun2013-08-281-2/+0
|
* ppc: Add and use convenience macro to check for AltiVec availabilityDiego Biurrun2013-08-281-3/+2
|
* altivec: perform an explicit unaligned loadKostya Shishkov2013-08-161-3/+1
| | | | | | | | Implicit vector loads on POWER7 hardware can use the VSX instruction set instead of classic Altivec/VMX. Let's force a VMX load in this case. Signed-off-by: Martin Storsjö <martin@martin.st>
* Consistently use "cpu_flags" as variable/parameter name for CPU flagsDiego Biurrun2013-07-181-3/+1
|
* fmtconvert: Explicitly use int32_t instead of intChristophe Gisquet2013-07-171-1/+1
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* fix scalarproduct_and_madd_int16_altivec() for orders > 16Kostya Shishkov2013-05-261-2/+2
| | | | the second and third sources were incremented only by half of the needed size
* ppc: Only compile AltiVec FFT assembly when AltiVec is enabledDiego Biurrun2013-05-021-3/+4
|
* ppc: Drop unnecessary ff_ name prefixes from static functionsDiego Biurrun2013-04-302-32/+47
|
* ppc: More consistent arch initializationDiego Biurrun2013-04-308-25/+63
|
* ppc: Move AltiVec utility headers out of AltiVec ifdefsDiego Biurrun2013-04-304-14/+13
| | | | | Now that the headers themselves have ifdef protection this is no longer necessary and more consistent with normal include handling.
* ppc: More consistent names for H.264 optimizations filesDiego Biurrun2013-04-304-4/+4
|
* mpegaudiosp: More consistent names for ppc/x86 optimization filesDiego Biurrun2013-04-302-1/+1
|
* ppc: hpeldsp: Include attributes.hMartin Storsjö2013-04-201-0/+1
| | | | | | This fixes building in configurations where altivec is disabled. Signed-off-by: Martin Storsjö <martin@martin.st>
* ppc: hpeldsp: Move half-pel assembly from dsputil to hpeldspRonald S. Bultje2013-04-193-421/+467
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* vp3: Use full transpose for all IDCTsRonald S. Bultje2013-04-151-1/+0
| | | | | | | | | | | This way, the special IDCT permutations are no longer needed. This is similar to how H264 does it, and removes the dsputil dependency imposed by the scantable code. Also remove the unused type == 0 cases from the plain C version of the idct. Signed-off-by: Martin Storsjö <martin@martin.st>
* h264: Integrate clear_blocks calls with IDCTRonald S. Bultje2013-04-101-0/+3
| | | | | | | | | The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700 to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb (in the decode_slice loop) goes from 1759 to 1733 cycles on the clip tested (cathedral), i.e. almost 30 cycles per mb faster. Signed-off-by: Martin Storsjö <martin@martin.st>
* dsputil: convert remaining functions to use ptrdiff_t stridesLuca Barbato2013-03-121-16/+32
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* Remove unnecessary dsputil.h #includesDiego Biurrun2013-02-267-7/+0
|
* dsputil: Move LOCAL_ALIGNED macros to libavutilDiego Biurrun2013-02-081-1/+1
|
* dsputil: Separate h264chromaDiego Biurrun2013-02-068-294/+356
|
* Add av_cold attributes to arch-specific init functionsDiego Biurrun2013-02-0511-11/+24
|
* Use ptrdiff_t instead of int for {avg, put}_pixels line_size parameter.Diego Biurrun2013-02-052-13/+13
| | | | | This avoids SIMD-optimized functions having to sign-extend their line size argument manually to be able to do pointer arithmetic.
* ppc: fmtconvert: Drop two unused variables.Diego Biurrun2013-02-011-2/+2
|
* dsputil: Separate h264 qpelMans Rullgard2013-01-245-284/+322
| | | | | | | | | | The sh4 optimizations are removed, because the code is 100% identical to the C code, so it is unlikely to provide any real practical benefit. Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* Drop DCTELEM typedefDiego Biurrun2013-01-226-24/+24
| | | | | | It does not help as an abstraction and adds dsputil dependencies. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp.Ronald S. Bultje2013-01-223-59/+0
| | | | | | Now, nellymoserenc and aacenc no longer depends on dsputil. Independent of this patch, wmaprodec also does not depend on dsputil, so I removed it from there also.
* floatdsp: move vector_fmul_add from dsputil to avfloatdsp.Ronald S. Bultje2013-01-221-25/+0
|
* vorbisdsp: change block_size type from int to intptr_t.Ronald S. Bultje2013-01-201-1/+1
| | | | This saves one instruction in the x86-64 assembly.
* ppc: vorbisdsp: Drop some unnecessary #includesDiego Biurrun2013-01-201-2/+0
| | | | Also fixes compilation with AltiVec disabled.
* ppc: Include string.h for memsetMartin Storsjö2013-01-201-0/+2
| | | | | | | This fixes build failures on ppc machines with a compiler that supports -Werror=implicit-function-declaration. Signed-off-by: Martin Storsjö <martin@martin.st>
* Move vorbis_inverse_coupling from dsputil to vorbisdspcontext.Ronald S. Bultje2013-01-193-25/+63
| | | | | Conveniently (together with Justin's earlier patches), this makes our vorbis decoder entirely independent of dsputil.
* vp3: integrate clear_blocks with idct of previous block.Ronald S. Bultje2013-01-191-0/+2
| | | | | | | | | This is identical to what e.g. vp8 does, and prevents the function call overhead (plus dependency on dsputil for this particular function). Arm asm updated by Janne Grunau <janne-libav@jannau.net>. Signed-off-by: Janne Grunau <janne-libav@jannau.net>
* lavc: Move vector_fmul_window to AVFloatDSPContextJustin Ruggles2013-01-161-35/+0
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* lavc: introduce VideoDSPContextRonald S. Bultje2012-12-203-10/+36
| | | | | | | | Move some functions from dsputil. The idea is that videodsp contains functions that are useful for a large and varied set of video decoders. Currently, it contains emulated_edge_mc() and prefetch(). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* ppc: replace pointer casting with AV_COPY32Mans Rullgard2012-11-121-1/+2
| | | | | | This removes warnings about strict aliasing violations. Signed-off-by: Mans Rullgard <mans@mansr.com>
* ppc: fix some unused variable warningsMans Rullgard2012-11-121-9/+6
| | | | | | | The third argument of OP_U8_ALTIVEC is evaluated at most once so there is no need for a potentially unused temporary variable. Signed-off-by: Mans Rullgard <mans@mansr.com>
* build: non-x86: Only compile mpegvideo optimizations when necessaryDiego Biurrun2012-10-091-1/+1
|
* ppc: fix Altivec build with old compilersMans Rullgard2012-10-081-5/+14
| | | | | | | | | The vec_splat() intrinsic requires a constant argument for the element number, and the code relies on the compiler unrolling the loop to provide this. Manually unrolling the loop avoids this reliance and works with all compilers. Signed-off-by: Mans Rullgard <mans@mansr.com>
* ppc: fmtconvert: kill VLA in float_to_int16_interleave_altivec()Mans Rullgard2012-10-051-7/+18
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* Add more missing includes after removing the implicit common.hMartin Storsjö2012-08-161-0/+1
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* Add some more missing includes after removing the implicit common.hMartin Storsjö2012-08-151-0/+2
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* Don't include common.h from avutil.hMartin Storsjö2012-08-154-0/+5
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* dsputil: ppc: cosmetics: pretty-printJustin Ruggles2012-07-221-62/+64
|
* ppc: fix build with altivec disabledMans Rullgard2012-07-181-1/+3
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* vp3: move idct and loop filter pointers to new vp3dsp contextMans Rullgard2012-07-184-33/+22
| | | | | | | | This moves all VP3-specific function pointers from dsputil to a new vp3dsp context. There is no reason to ever use the VP3 IDCT where an MPEG2 IDCT is expected or vice versa. Signed-off-by: Mans Rullgard <mans@mansr.com>
* ppc: Rename H.264 optimization template file for consistency.Diego Biurrun2012-06-123-4/+4
|
* Add a float DSP framework to libavutilJustin Ruggles2012-06-081-16/+0
| | | | Move vector_fmul() from DSPContext to AVFloatDSPContext.
* PPC: Move types_altivec.h and util_altivec.h from libavcodec to libavutilJustin Ruggles2012-06-0815-188/+23
| | | | | This will allow for easier implementation of Altivec functions in libraries other than libavcodec.