aboutsummaryrefslogtreecommitdiffstats
path: root/libavcodec/arm/Makefile
Commit message (Collapse)AuthorAgeFilesLines
* arm: Add an option for making sure NEON registers aren't clobberedMartin Storsjö2014-01-111-0/+1
| | | | | | This is pretty much based on the same test for XMM registers. Signed-off-by: Martin Storsjö <martin@martin.st>
* vc1: arm: Add NEON assemblyMason Carter2013-12-201-0/+3
| | | | | | | | | | | | | For: ff_vc1_inv_trans_{8,4}x{8,4}_{dc_,}neon ff_put_pixels8x8_neon ff_put_vc1_mspel_mc{0,1,2,3}{0,1,2,3}_neon (except for 00) Based on ARM assembly code in libavcodec/arm by Rob Clark and Mans Rullgard. Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: fmtconvert: Split armv6 fmtconvert code off from vfp codeDiego Biurrun2013-08-291-1/+3
|
* vp56: Mark VP6-only optimizations as such.Diego Biurrun2013-08-231-4/+2
| | | | | Most of our VP56 optimizations are VP6-only and will stay that way. So avoid compiling them for VP5-only builds.
* arm: Add assembly version of h264_find_start_code_candidateBen Avison2013-08-081-0/+1
| | | | | | | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 508.8 23.4 185.4 9.0 +174.4% Overall 3068.5 31.7 2752.1 29.4 +11.5% In combination with the preceding patch: Before After Mean StdDev Mean StdDev Change Overall 2925.6 26.2 2752.1 29.4 +6.3% Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Add VFP-accelerated version of fft16Martin Storsjö2013-07-221-0/+1
| | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 1389.3 4.2 967.8 35.1 +43.6% Overall 15577.5 83.2 15400.0 336.4 +1.2% Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Add VFP-accelerated version of dca_lfe_firMartin Storsjö2013-07-221-1/+2
| | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 868.2 33.5 436.0 27.0 +99.1% Overall 15973.0 223.2 15577.5 83.2 +2.5% Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Add VFP-accelerated version of imdct_halfMartin Storsjö2013-07-221-0/+1
| | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 2653.0 28.5 1108.8 51.4 +139.3% Overall 17049.5 408.2 15973.0 223.2 +6.7% Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Add VFP-accelerated version of synth_filter_floatBen Avison2013-07-221-0/+1
| | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 9295.0 114.9 4853.2 83.5 +91.5% Overall 23699.8 397.6 19285.5 292.0 +22.9% Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Include hpeldsp_neon.o if h264qpel is enabledMartin Storsjö2013-05-301-1/+2
| | | | | | | | | A few of the h264qpel neon functions are shared with other hpeldsp functions in this file. This fixes standalone compilation of the h264 decoder on arm. Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Don't unconditionally build dsputil filesMartin Storsjö2013-05-301-14/+13
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Only build the FFT init files if FFT is enabledMartin Storsjö2013-05-301-2/+2
| | | | | | This fixes build errors in cases where FFT is disabled. Signed-off-by: Martin Storsjö <martin@martin.st>
* build: cosmetics: Place unconditional before conditional OBJS linesDiego Biurrun2013-05-301-19/+19
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* build: arm: cosmetics: Place all OBJS declarations in alphabetical orderDiego Biurrun2013-05-301-61/+36
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: hpeldsp: Move half-pel assembly from dsputil to hpeldspRonald S. Bultje2013-04-191-0/+8
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* dsputil: Separate h264chromaDiego Biurrun2013-02-061-2/+2
|
* arm: Add mathops.h to ARCH_HEADERS listDiego Biurrun2013-01-241-0/+2
| | | | It is an arch-specific header not suitable for standalone compilation.
* dsputil: Separate h264 qpelMans Rullgard2013-01-241-0/+3
| | | | | | | | | | The sh4 optimizations are removed, because the code is 100% identical to the C code, so it is unlikely to provide any real practical benefit. Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp.Ronald S. Bultje2013-01-221-3/+0
| | | | | | Now, nellymoserenc and aacenc no longer depends on dsputil. Independent of this patch, wmaprodec also does not depend on dsputil, so I removed it from there also.
* Move vorbis_inverse_coupling from dsputil to vorbisdspcontext.Ronald S. Bultje2013-01-191-0/+3
| | | | | Conveniently (together with Justin's earlier patches), this makes our vorbis decoder entirely independent of dsputil.
* lavc: introduce VideoDSPContextRonald S. Bultje2012-12-201-0/+5
| | | | | | | | Move some functions from dsputil. The idea is that videodsp contains functions that are useful for a large and varied set of video decoders. Currently, it contains emulated_edge_mc() and prefetch(). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* arm: rename ARMVFP config symbol to VFPMans Rullgard2012-12-071-2/+2
| | | | | | | This is consistent with usual ARM nomenclature as well as with the VFPV3 and NEON symbols which both lack the ARM prefix. Signed-off-by: Mans Rullgard <mans@mansr.com>
* arm: call arm-specific rv34dsp init functions under if (ARCH_ARM)Jean-Baptiste Kempf2012-10-101-6/+6
| | | | | | | Assign NEON specific function pointers after runtime check via av_get_cpu_flags(). Signed-off-by: Janne Grunau <janne-libav@jannau.net>
* build: non-x86: Only compile mpegvideo optimizations when necessaryDiego Biurrun2012-10-091-4/+5
|
* flacdsp: arm optimised lpc filterMans Rullgard2012-09-151-0/+3
|
* vp3: move idct and loop filter pointers to new vp3dsp contextMans Rullgard2012-07-181-0/+1
| | | | | | | | This moves all VP3-specific function pointers from dsputil to a new vp3dsp context. There is no reason to ever use the VP3 IDCT where an MPEG2 IDCT is expected or vice versa. Signed-off-by: Mans Rullgard <mans@mansr.com>
* build: add CONFIG_VP3DSP, reduce repetition in OBJS listsMans Rullgard2012-07-181-3/+1
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* aacps: NEON optimisationsMans Rullgard2012-05-051-2/+4
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* vp8: arm: separate ARMv6 functions from NEONMans Rullgard2012-04-251-2/+5
| | | | | | This is a preparation for complete ARMv6 optimisations. Signed-off-by: Mans Rullgard <mans@mansr.com>
* build: Consistently handle conditional compilation for all optimization OBJS.Diego Biurrun2012-04-121-8/+5
|
* remove iwmmxt optimizationsJanne Grunau2012-03-121-3/+0
| | | | | | The were broken since August of 2010 without anyone noticing until three weeks ago. Nobody cares about it anymore and hopefully Marvell will support NEON like in the PXA978 from now on.
* aacsbr: ARM NEON optimised sbrdsp functionsMans Rullgard2012-01-281-0/+4
| | | | | | Overall speedup of HE-AAC decoding 2.3x on Cortex-A8, 1.2x on A9. Signed-off-by: Mans Rullgard <mans@mansr.com>
* rv40: NEON optimised weighted predictionJanne Grunau2011-12-061-0/+1
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* rv40: NEON optimised chroma MCJanne Grunau2011-12-061-0/+2
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* ARM: move NEON H264 chroma mc to a separate fileMans Rullgard2011-12-061-0/+1
| | | | | | This allows sharing code with the rv40 version of these functions. Signed-off-by: Mans Rullgard <mans@mansr.com>
* rv34: NEON optimised inverse transform functionsJanne Grunau2011-12-061-0/+6
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* ARM: optimised mpadsp_apply_window_fixedMans Rullgard2011-06-131-0/+3
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* ARM: ac3dsp: optimised update_bap_counts()Mans Rullgard2011-06-011-0/+1
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* ARM: unbreak buildMans Rullgard2011-05-281-1/+0
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* ARM: NEON fixed-point forward MDCTMans Rullgard2011-04-031-0/+1
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* ARM: NEON fixed-point FFTMans Rullgard2011-04-031-0/+2
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* ac3enc: ARM optimised ac3_compute_matissa_sizeMans Rullgard2011-04-011-1/+3
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* ac3: armv6 optimised bit_alloc_calc_bapMans Rullgard2011-04-011-0/+2
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* ac3enc: NEON optimised ac3_max_msb_abs_int16 and ac3_exponent_minMans Rullgard2011-03-241-0/+3
|
* VP8: ARM optimised decode_block_coeffs_internalMans Rullgard2011-02-111-0/+2
| | | | | | Approximately 5% faster on Cortex-A8. Signed-off-by: Mans Rullgard <mans@mansr.com>
* VP8: ARM NEON optimisations for dsp functionsMans Rullgard2011-02-071-0/+3
| | | | | | | This adds NEON optimised versions of all functions in VP8DSPContext. Based on initial work by Rob Clark. Signed-off-by: Mans Rullgard <mans@mansr.com>
* Separate format conversion DSP functions from DSPContext.Justin Ruggles2011-02-021-0/+5
| | | | | | | This will be beneficial for use with the audio conversion API without requiring it to depend on all of dsputil. Signed-off-by: Mans Rullgard <mans@mansr.com>
* Split h264dsp and h264pred in configure.Jason Garrett-Glaser2010-08-071-3/+4
| | | | | | | | | Many H.264 derivatives, like RV40 and VP8, use the H.264 prediction functions but not the weight/loopfilter functions. This should reduce the size of builds with one of these derivatives but without H.264 decoding itself. Originally committed as revision 24741 to svn://svn.ffmpeg.org/ffmpeg/trunk
* fix VP5/6 neon dependenciesAurelien Jacobs2010-07-101-2/+5
| | | | Originally committed as revision 24160 to svn://svn.ffmpeg.org/ffmpeg/trunk
* ARM: NEON optimised dct_unquantize_h263_{intra,inter}Måns Rullgård2010-05-291-0/+1
| | | | Originally committed as revision 23386 to svn://svn.ffmpeg.org/ffmpeg/trunk