]> git.sesse.net Git - x264/log
x264
16 years agomerge x86_32 and x86_64 asm, with macros to abstract calling convention and register...
Loren Merritt [Sun, 16 Mar 2008 19:54:58 +0000 (13:54 -0600)]
merge x86_32 and x86_64 asm, with macros to abstract calling convention and register names

16 years agogit compatible version script
Loren Merritt [Sun, 9 Mar 2008 11:58:55 +0000 (05:58 -0600)]
git compatible version script

16 years agocheck for broken versions of yasm
Loren Merritt [Mon, 3 Mar 2008 00:53:01 +0000 (17:53 -0700)]
check for broken versions of yasm

16 years agoincrease the alignment of the i8x8 edge cache, needed for sse2 intra prediction.
Loren Merritt [Mon, 3 Mar 2008 00:27:38 +0000 (17:27 -0700)]
increase the alignment of the i8x8 edge cache, needed for sse2 intra prediction.
patch by Alexander Strange.

16 years ago.gitignore
Loren Merritt [Sun, 2 Mar 2008 23:12:57 +0000 (16:12 -0700)]
.gitignore

16 years agopic macros now keep track of which register holds the GOT, so variable access doesn...
Loren Merritt [Sun, 2 Mar 2008 03:04:07 +0000 (03:04 +0000)]
pic macros now keep track of which register holds the GOT, so variable access doesn't have to care

git-svn-id: svn://svn.videolan.org/x264/trunk@745 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoremove x86_64 predict_8x8_ddl_mmxext because sse2 is faster even on amd
Loren Merritt [Sun, 2 Mar 2008 02:27:45 +0000 (02:27 +0000)]
remove x86_64 predict_8x8_ddl_mmxext because sse2 is faster even on amd

git-svn-id: svn://svn.videolan.org/x264/trunk@744 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agocosmetics in dsp init
Loren Merritt [Sun, 2 Mar 2008 02:26:00 +0000 (02:26 +0000)]
cosmetics in dsp init

git-svn-id: svn://svn.videolan.org/x264/trunk@743 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agosse2 16x16 intra pred.
Loren Merritt [Sun, 2 Mar 2008 02:11:12 +0000 (02:11 +0000)]
sse2 16x16 intra pred.
port the remaining intra pred functions from x86_64 to x86_32.
patch by Fiona Glaser.

git-svn-id: svn://svn.videolan.org/x264/trunk@742 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agosome simplifications to mmx intra pred that should have been done way back when we...
Loren Merritt [Sat, 1 Mar 2008 13:47:05 +0000 (13:47 +0000)]
some simplifications to mmx intra pred that should have been done way back when we switched to constant fdec_stride.
and remove pic spills in functions that have a free caller-saved reg.
patch partly by Fiona Glaser.

git-svn-id: svn://svn.videolan.org/x264/trunk@741 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofaster array_non_zero
Loren Merritt [Sat, 1 Mar 2008 07:30:34 +0000 (07:30 +0000)]
faster array_non_zero

git-svn-id: svn://svn.videolan.org/x264/trunk@740 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agox86_32 sse2 idct8
Loren Merritt [Sat, 1 Mar 2008 04:33:24 +0000 (04:33 +0000)]
x86_32 sse2 idct8
ported from ffmpeg by Fiona Glaser

git-svn-id: svn://svn.videolan.org/x264/trunk@739 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agocheckasm: relax the threshold for floating-point ssim
Loren Merritt [Sat, 1 Mar 2008 04:13:55 +0000 (04:13 +0000)]
checkasm: relax the threshold for floating-point ssim

git-svn-id: svn://svn.videolan.org/x264/trunk@738 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agocheckasm: test idct with the range of coefficients what can really be encountered...
Loren Merritt [Sat, 1 Mar 2008 04:07:44 +0000 (04:07 +0000)]
checkasm: test idct with the range of coefficients what can really be encountered, as opposed to random numbers which might overflow.

git-svn-id: svn://svn.videolan.org/x264/trunk@737 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agointra_rd_refine in B-frames
Loren Merritt [Mon, 28 Jan 2008 14:33:42 +0000 (14:33 +0000)]
intra_rd_refine in B-frames

git-svn-id: svn://svn.videolan.org/x264/trunk@736 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoprint average of macroblock QPs instead of frame's nominal QP
Loren Merritt [Sun, 27 Jan 2008 16:29:54 +0000 (16:29 +0000)]
print average of macroblock QPs instead of frame's nominal QP

git-svn-id: svn://svn.videolan.org/x264/trunk@735 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoupdate date
Loren Merritt [Sun, 27 Jan 2008 16:16:37 +0000 (16:16 +0000)]
update date

git-svn-id: svn://svn.videolan.org/x264/trunk@734 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoremove colorspace conversion support, because it has no business in any codec
Loren Merritt [Sun, 27 Jan 2008 16:06:31 +0000 (16:06 +0000)]
remove colorspace conversion support, because it has no business in any codec

git-svn-id: svn://svn.videolan.org/x264/trunk@733 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agomisc fixes in checkasm
Loren Merritt [Sun, 27 Jan 2008 14:01:40 +0000 (14:01 +0000)]
misc fixes in checkasm

git-svn-id: svn://svn.videolan.org/x264/trunk@732 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoremove a useless bit of me=umh (originally copied from JM, where it was used for...
Loren Merritt [Sun, 27 Jan 2008 13:39:09 +0000 (13:39 +0000)]
remove a useless bit of me=umh (originally copied from JM, where it was used for something)

git-svn-id: svn://svn.videolan.org/x264/trunk@731 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofix a memleak in cqm
Loren Merritt [Sun, 27 Jan 2008 11:50:50 +0000 (11:50 +0000)]
fix a memleak in cqm

git-svn-id: svn://svn.videolan.org/x264/trunk@730 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofix a memleak in mkv muxer
Loren Merritt [Sun, 27 Jan 2008 11:49:16 +0000 (11:49 +0000)]
fix a memleak in mkv muxer
patch by saintdev

git-svn-id: svn://svn.videolan.org/x264/trunk@729 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agosatd exhaustive motion search (--me tesa)
Loren Merritt [Sun, 27 Jan 2008 11:36:11 +0000 (11:36 +0000)]
satd exhaustive motion search (--me tesa)

git-svn-id: svn://svn.videolan.org/x264/trunk@728 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofix cabac context for nonzero delta_qp of the 2nd mb of a frame in interlaced mode
Loren Merritt [Sun, 27 Jan 2008 11:09:52 +0000 (11:09 +0000)]
fix cabac context for nonzero delta_qp of the 2nd mb of a frame in interlaced mode

git-svn-id: svn://svn.videolan.org/x264/trunk@727 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofix mapping of mvs to partitions in p4x4_chroma
Loren Merritt [Sun, 27 Jan 2008 10:32:36 +0000 (10:32 +0000)]
fix mapping of mvs to partitions in p4x4_chroma
patch by Noboru Asai

git-svn-id: svn://svn.videolan.org/x264/trunk@726 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofix mvp for b16x8 and b8x16 L1 search
Loren Merritt [Sun, 27 Jan 2008 10:12:24 +0000 (10:12 +0000)]
fix mvp for b16x8 and b8x16 L1 search
patch by Wei-Yin Chen

git-svn-id: svn://svn.videolan.org/x264/trunk@725 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoshave a couple cycles off cabac functions
Loren Merritt [Sun, 27 Jan 2008 10:05:20 +0000 (10:05 +0000)]
shave a couple cycles off cabac functions

git-svn-id: svn://svn.videolan.org/x264/trunk@724 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofaster and smaller x264_macroblock_cache_mv etc
Loren Merritt [Sun, 27 Jan 2008 09:12:39 +0000 (09:12 +0000)]
faster and smaller x264_macroblock_cache_mv etc

git-svn-id: svn://svn.videolan.org/x264/trunk@723 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoconfigure test for endianness
Loren Merritt [Sun, 27 Jan 2008 09:11:01 +0000 (09:11 +0000)]
configure test for endianness

git-svn-id: svn://svn.videolan.org/x264/trunk@722 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agochange the meaning of --ref: it now selects DPB size (including B-frames), rather...
Loren Merritt [Fri, 18 Jan 2008 00:42:38 +0000 (00:42 +0000)]
change the meaning of --ref: it now selects DPB size (including B-frames), rather than L0 size (which B-frames are added to)

git-svn-id: svn://svn.videolan.org/x264/trunk@721 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoadd / fix support for FreeBSD, based on a patch by Igor Mozolevsky % igor A hybrid...
Guillaume Poirier [Mon, 14 Jan 2008 09:54:33 +0000 (09:54 +0000)]
add / fix support for FreeBSD, based on a patch by Igor Mozolevsky % igor A hybrid-lab P co P uk %

git-svn-id: svn://svn.videolan.org/x264/trunk@720 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoshut up some valgrind warnings
Loren Merritt [Wed, 9 Jan 2008 11:25:09 +0000 (11:25 +0000)]
shut up some valgrind warnings

git-svn-id: svn://svn.videolan.org/x264/trunk@719 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoslightly wrong memory allocation in r717, fixes a potential crash with merange>32
Loren Merritt [Tue, 8 Jan 2008 18:10:51 +0000 (18:10 +0000)]
slightly wrong memory allocation in r717, fixes a potential crash with merange>32

git-svn-id: svn://svn.videolan.org/x264/trunk@718 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoconvert absolute difference of sums from mmx to sse2
Loren Merritt [Sun, 6 Jan 2008 08:15:04 +0000 (08:15 +0000)]
convert absolute difference of sums from mmx to sse2
convert mv bits cost and ads threshold from C to sse2
convert bytemask-to-list from C to scalar asm
1.6x faster me=esa (x86_64) or 1.3x faster (x86_32). (times consider only motion estimation. overall encode speedup may vary.)

git-svn-id: svn://svn.videolan.org/x264/trunk@717 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoround esa range to a multiple of 4
Loren Merritt [Sun, 6 Jan 2008 08:06:36 +0000 (08:06 +0000)]
round esa range to a multiple of 4

git-svn-id: svn://svn.videolan.org/x264/trunk@716 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agouse define _WIN32 instead of __WIN32__ or WIN32 defines.
Guillaume Poirier [Thu, 3 Jan 2008 22:24:38 +0000 (22:24 +0000)]
use define _WIN32 instead of __WIN32__ or WIN32 defines.
NSDN reference: http://msdn2.microsoft.com/en-us/library/b0084kay(VS.80).aspx
Patch by BugMaster %BugMaster A narod P ru%
Original thread:
date: Dec 27, 2007 3:18 AM
subject: [x264-devel] VS2008 compilation error (need of replacement __WIN32__ with _WIN32)

git-svn-id: svn://svn.videolan.org/x264/trunk@715 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agotweak x264_pixel_sad_x4_16x16_sse2 horizontal sum. 168 -> 166 cycles on core2.
Loren Merritt [Fri, 21 Dec 2007 01:57:14 +0000 (01:57 +0000)]
tweak x264_pixel_sad_x4_16x16_sse2 horizontal sum. 168 -> 166 cycles on core2.

git-svn-id: svn://svn.videolan.org/x264/trunk@714 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofix a nondeterminism involving 8x8dct, rdo, and threads.
Loren Merritt [Thu, 20 Dec 2007 19:24:17 +0000 (19:24 +0000)]
fix a nondeterminism involving 8x8dct, rdo, and threads.

git-svn-id: svn://svn.videolan.org/x264/trunk@713 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoalso test arch-specific x264_zigzag_* implementations in checkasm.c
Guillaume Poirier [Thu, 13 Dec 2007 15:43:41 +0000 (15:43 +0000)]
also test arch-specific x264_zigzag_* implementations in checkasm.c
patch by Patch by Noboru Asai % noboru P asai A gmail P com%

git-svn-id: svn://svn.videolan.org/x264/trunk@712 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoAdd AltiVec implementation of
Guillaume Poirier [Mon, 10 Dec 2007 22:09:13 +0000 (22:09 +0000)]
Add AltiVec implementation of
- x264_zigzag_scan_4x4_frame_altivec()
- x264_zigzag_scan_4x4ac_frame_altivec()
- x264_zigzag_scan_4x4_field_altivec()
- x264_zigzag_scan_4x4ac_field_altivec()
each around 1.3 tp 1.8x faster than C version
Patch by Noboru Asai % noboru P asai A gmail P com%

git-svn-id: svn://svn.videolan.org/x264/trunk@711 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoadds AliVec implementation of predict_16x16_p()
Guillaume Poirier [Sun, 9 Dec 2007 15:50:52 +0000 (15:50 +0000)]
adds AliVec implementation of predict_16x16_p()
over 4x faster than C version

git-svn-id: svn://svn.videolan.org/x264/trunk@710 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agorevert the x86_32 part of r708. elf shared libraries aren't important enough to be...
Loren Merritt [Tue, 4 Dec 2007 21:56:18 +0000 (21:56 +0000)]
revert the x86_32 part of r708. elf shared libraries aren't important enough to be worth the extra lines of code to check for nasm.

git-svn-id: svn://svn.videolan.org/x264/trunk@709 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agomark asm functions as hidden
Loren Merritt [Mon, 3 Dec 2007 01:17:23 +0000 (01:17 +0000)]
mark asm functions as hidden

git-svn-id: svn://svn.videolan.org/x264/trunk@708 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agocheck whether ld supports -Bsymbolic before using it
Loren Merritt [Mon, 3 Dec 2007 01:16:57 +0000 (01:16 +0000)]
check whether ld supports -Bsymbolic before using it

git-svn-id: svn://svn.videolan.org/x264/trunk@707 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoreduce the data type used in some tables. 16KB smaller exe.
Loren Merritt [Sun, 2 Dec 2007 15:57:43 +0000 (15:57 +0000)]
reduce the data type used in some tables. 16KB smaller exe.

git-svn-id: svn://svn.videolan.org/x264/trunk@706 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofaster removal of duplicate mv predictors
Loren Merritt [Sat, 1 Dec 2007 18:03:16 +0000 (18:03 +0000)]
faster removal of duplicate mv predictors

git-svn-id: svn://svn.videolan.org/x264/trunk@705 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoavoid a division in x264_mb_predict_mv_ref16x16.
Loren Merritt [Sat, 1 Dec 2007 15:17:19 +0000 (15:17 +0000)]
avoid a division in x264_mb_predict_mv_ref16x16.
patch by Fiona Glaser.

git-svn-id: svn://svn.videolan.org/x264/trunk@704 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoavoid a division in umh.
Loren Merritt [Sat, 1 Dec 2007 02:58:34 +0000 (02:58 +0000)]
avoid a division in umh.
patch by Fiona Glaser.

git-svn-id: svn://svn.videolan.org/x264/trunk@703 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofix a memleak in h->mb.mvr
Loren Merritt [Mon, 26 Nov 2007 11:44:37 +0000 (11:44 +0000)]
fix a memleak in h->mb.mvr

git-svn-id: svn://svn.videolan.org/x264/trunk@702 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofix compilation as a shared library on x86_64 (regression in r696)
Loren Merritt [Sun, 25 Nov 2007 12:38:19 +0000 (12:38 +0000)]
fix compilation as a shared library on x86_64 (regression in r696)

git-svn-id: svn://svn.videolan.org/x264/trunk@701 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoadd support for x86_64 on Darwin9.0 (Mac OS X 10.5, aka Leopard)
Guillaume Poirier [Wed, 21 Nov 2007 18:30:49 +0000 (18:30 +0000)]
add support for x86_64 on Darwin9.0 (Mac OS X 10.5, aka Leopard)
Patch by Antoine Gerschenfeld %gerschen A clipper P ens P fr%

git-svn-id: svn://svn.videolan.org/x264/trunk@700 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agocover some more options in fprofile. (esa, bime, cqm, nr, no-dct-decimate, trellis2)
Loren Merritt [Wed, 21 Nov 2007 11:52:19 +0000 (11:52 +0000)]
cover some more options in fprofile. (esa, bime, cqm, nr, no-dct-decimate, trellis2)
previously, esa was slower with fprofile than without, since gcc thought it wasn't important. now esa benefits like anything else.

git-svn-id: svn://svn.videolan.org/x264/trunk@699 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoAdd AltiVec implementation of x264_pixel_ssd_8x8, 3x faster than C version
Guillaume Poirier [Tue, 20 Nov 2007 18:22:03 +0000 (18:22 +0000)]
Add AltiVec implementation of x264_pixel_ssd_8x8, 3x faster than C version
Overall speed-up: 0.7% with  --bframes 3 --ref 5 -m 7 --b-rdo
Patch by Noboru Asai %noboru P asai A gmail P com%

git-svn-id: svn://svn.videolan.org/x264/trunk@698 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agolimit mvs to [-512,511.75] instead of [-512,512]
Loren Merritt [Tue, 20 Nov 2007 08:53:26 +0000 (08:53 +0000)]
limit mvs to [-512,511.75] instead of [-512,512]

git-svn-id: svn://svn.videolan.org/x264/trunk@697 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoavoid memory loads that span the border between two cachelines.
Loren Merritt [Tue, 20 Nov 2007 06:07:17 +0000 (06:07 +0000)]
avoid memory loads that span the border between two cachelines.
on core2 this makes x264_pixel_sad an average of 2x faster. other intel cpus gain various amounts. amd are unaffected.
overall speedup: 1-10%, depending on how much time is spent in fullpel motion estimation.

git-svn-id: svn://svn.videolan.org/x264/trunk@696 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoadd cache info to cpu_detect. also print sse3.
Loren Merritt [Tue, 20 Nov 2007 05:57:29 +0000 (05:57 +0000)]
add cache info to cpu_detect. also print sse3.

git-svn-id: svn://svn.videolan.org/x264/trunk@695 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agocosmetics: reorder mc_luma/mc_chroma/get_ref arguments for consistency with other...
Loren Merritt [Mon, 19 Nov 2007 17:10:57 +0000 (17:10 +0000)]
cosmetics: reorder mc_luma/mc_chroma/get_ref arguments for consistency with other functions

git-svn-id: svn://svn.videolan.org/x264/trunk@694 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoseparate pixel_avg into cases for mc and for bipred
Loren Merritt [Mon, 19 Nov 2007 17:08:07 +0000 (17:08 +0000)]
separate pixel_avg into cases for mc and for bipred

git-svn-id: svn://svn.videolan.org/x264/trunk@693 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoadd AltiVec implementation of ssim_4x4x2_core, about 4x faster than C version.
Guillaume Poirier [Sun, 18 Nov 2007 23:58:18 +0000 (23:58 +0000)]
add AltiVec implementation of ssim_4x4x2_core, about 4x faster than C version.
Overall: 0.1-0.2% faster with default encoding settings
Patch by Noboru Asai %noboru P asai A gmail P com%

git-svn-id: svn://svn.videolan.org/x264/trunk@692 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoAdd AltiVec implementation ofx264_hpel_filter. Provides a 10-11% overall speed-up...
Guillaume Poirier [Sun, 18 Nov 2007 23:47:41 +0000 (23:47 +0000)]
Add AltiVec implementation ofx264_hpel_filter. Provides a 10-11% overall speed-up with default encoding options
Patch by Noboru Asai %noboru P asai A gmail P com%

git-svn-id: svn://svn.videolan.org/x264/trunk@691 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agocosmetics in dsp function selection
Loren Merritt [Sun, 18 Nov 2007 01:45:44 +0000 (01:45 +0000)]
cosmetics in dsp function selection

git-svn-id: svn://svn.videolan.org/x264/trunk@690 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoremove sad_pde. it's been unused ever since successive elimination replaced it.
Loren Merritt [Sat, 17 Nov 2007 10:21:46 +0000 (10:21 +0000)]
remove sad_pde. it's been unused ever since successive elimination replaced it.

git-svn-id: svn://svn.videolan.org/x264/trunk@689 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agocosmetics: use symbolic constants for frame padding radius
Loren Merritt [Fri, 16 Nov 2007 10:27:14 +0000 (10:27 +0000)]
cosmetics: use symbolic constants for frame padding radius

git-svn-id: svn://svn.videolan.org/x264/trunk@688 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agomove hpel_filter cpu detection to a function pointer like everything else
Loren Merritt [Fri, 16 Nov 2007 09:17:58 +0000 (09:17 +0000)]
move hpel_filter cpu detection to a function pointer like everything else

git-svn-id: svn://svn.videolan.org/x264/trunk@687 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agocosmetics: use separate variables for frame width and stride
Loren Merritt [Thu, 15 Nov 2007 10:50:37 +0000 (10:50 +0000)]
cosmetics: use separate variables for frame width and stride

git-svn-id: svn://svn.videolan.org/x264/trunk@686 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoAdd AltiVec implementation of add4x4_idct, add8x8_idct, add16x16_idct, 3.2x faster...
Guillaume Poirier [Mon, 12 Nov 2007 20:36:33 +0000 (20:36 +0000)]
Add AltiVec implementation of add4x4_idct, add8x8_idct, add16x16_idct, 3.2x faster on average
1.05x faster overall with default encoding options
Patch by Noboru Asai % noboru DD asai AA gmail DD com %

git-svn-id: svn://svn.videolan.org/x264/trunk@685 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoadd AltiVec implementation of dequant_4x4 and dequant_8x8, 2.8x faster than C,
Guillaume Poirier [Mon, 12 Nov 2007 20:28:30 +0000 (20:28 +0000)]
add AltiVec implementation of dequant_4x4 and dequant_8x8, 2.8x faster than C,
 1.01x faster than previous revision with default encoding options
Patch by Noboru Asai % noboru DD asai AA gmail DD com %

git-svn-id: svn://svn.videolan.org/x264/trunk@684 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agoAdd AltiVec implementation of quant_2x2_dc,
Guillaume Poirier [Mon, 12 Nov 2007 12:47:38 +0000 (12:47 +0000)]
Add AltiVec implementation of quant_2x2_dc,
fix Altivec implementation of quant_(4x4|8x8)(|_dc) wrt current C implementation
Patch by Noboru Asai % noboru DD asai AA gmail DD com %

git-svn-id: svn://svn.videolan.org/x264/trunk@683 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agofix a possible nondeterminism with me=umh + threads.
Loren Merritt [Thu, 1 Nov 2007 12:21:13 +0000 (12:21 +0000)]
fix a possible nondeterminism with me=umh + threads.

git-svn-id: svn://svn.videolan.org/x264/trunk@682 df754926-b1dd-0310-bc7b-ec298dee348c

16 years agouse hex instead of dia for rdo mv refinement. ~0.5% lower bitrate at subme=7.
Loren Merritt [Mon, 29 Oct 2007 14:48:46 +0000 (14:48 +0000)]
use hex instead of dia for rdo mv refinement. ~0.5% lower bitrate at subme=7.
patch by Fiona Glaser.

git-svn-id: svn://svn.videolan.org/x264/trunk@681 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoport sad_*_x3_sse2 to x86_64
Loren Merritt [Mon, 24 Sep 2007 13:37:44 +0000 (13:37 +0000)]
port sad_*_x3_sse2 to x86_64

git-svn-id: svn://svn.videolan.org/x264/trunk@680 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agodon't overwrite pthread* namespace, because system headers might define those functio...
Loren Merritt [Mon, 24 Sep 2007 11:24:28 +0000 (11:24 +0000)]
don't overwrite pthread* namespace, because system headers might define those functions even if we don't want them

git-svn-id: svn://svn.videolan.org/x264/trunk@679 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agofaster 4x4 sad
Loren Merritt [Fri, 21 Sep 2007 20:20:22 +0000 (20:20 +0000)]
faster 4x4 sad

git-svn-id: svn://svn.videolan.org/x264/trunk@678 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agofix an arithmetic overflow in trellis at high qp.
Loren Merritt [Thu, 20 Sep 2007 08:10:45 +0000 (08:10 +0000)]
fix an arithmetic overflow in trellis at high qp.

git-svn-id: svn://svn.videolan.org/x264/trunk@677 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoimplement multithreaded me=esa
Loren Merritt [Sat, 15 Sep 2007 06:34:05 +0000 (06:34 +0000)]
implement multithreaded me=esa

git-svn-id: svn://svn.videolan.org/x264/trunk@676 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agofix some integer overflows. now vbv size can exceed 2 Gbit.
Loren Merritt [Wed, 12 Sep 2007 05:42:23 +0000 (05:42 +0000)]
fix some integer overflows. now vbv size can exceed 2 Gbit.

git-svn-id: svn://svn.videolan.org/x264/trunk@675 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoallow --vbv-init to take absolute values (in kbit), in addition to the previous fract...
Loren Merritt [Sun, 9 Sep 2007 03:17:20 +0000 (03:17 +0000)]
allow --vbv-init to take absolute values (in kbit), in addition to the previous fractions of vbv-bufsize.

git-svn-id: svn://svn.videolan.org/x264/trunk@674 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoremove a bashism
Loren Merritt [Fri, 7 Sep 2007 20:40:13 +0000 (20:40 +0000)]
remove a bashism

git-svn-id: svn://svn.videolan.org/x264/trunk@673 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoreorder headers so that largefile support is defined before the first copy of stdio
Loren Merritt [Sun, 2 Sep 2007 04:32:17 +0000 (04:32 +0000)]
reorder headers so that largefile support is defined before the first copy of stdio

git-svn-id: svn://svn.videolan.org/x264/trunk@672 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoregression in r669: broke saving of configure args if make has to re-run configure
Loren Merritt [Mon, 20 Aug 2007 16:44:42 +0000 (16:44 +0000)]
regression in r669: broke saving of configure args if make has to re-run configure

git-svn-id: svn://svn.videolan.org/x264/trunk@671 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoregression in r669: --enable-shared should imply --enable-pic on some archs.
Loren Merritt [Sat, 18 Aug 2007 01:13:22 +0000 (01:13 +0000)]
regression in r669: --enable-shared should imply --enable-pic on some archs.

git-svn-id: svn://svn.videolan.org/x264/trunk@670 df754926-b1dd-0310-bc7b-ec298dee348c

17 years ago* Add a --host flag to allow overriding config.guess; this is particularly
Loïc Minier [Sun, 12 Aug 2007 12:46:15 +0000 (12:46 +0000)]
* Add a --host flag to allow overriding config.guess; this is particularly
  useful with a 64-bits kernel running a 32-bits userland to build 32-bits
  apps.
* Normalize any host triplet into a quadruplet via config.sub.
* Move option parsing before any use of architecture information.

git-svn-id: svn://svn.videolan.org/x264/trunk@669 df754926-b1dd-0310-bc7b-ec298dee348c

17 years ago* Update config.guess.
Loïc Minier [Sun, 12 Aug 2007 12:36:23 +0000 (12:36 +0000)]
* Update config.guess.

git-svn-id: svn://svn.videolan.org/x264/trunk@668 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agomingw doesn't have strtok_r
Loren Merritt [Tue, 17 Jul 2007 11:24:26 +0000 (11:24 +0000)]
mingw doesn't have strtok_r

git-svn-id: svn://svn.videolan.org/x264/trunk@667 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agomove os/compiler specific defines to their own header
Loren Merritt [Tue, 17 Jul 2007 11:11:19 +0000 (11:11 +0000)]
move os/compiler specific defines to their own header

git-svn-id: svn://svn.videolan.org/x264/trunk@666 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoextend zones to support (some) encoding parameters in addition to ratecontrol.
Loren Merritt [Thu, 12 Jul 2007 23:48:23 +0000 (23:48 +0000)]
extend zones to support (some) encoding parameters in addition to ratecontrol.

git-svn-id: svn://svn.videolan.org/x264/trunk@665 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agocosmetics
Loren Merritt [Fri, 6 Jul 2007 17:08:26 +0000 (17:08 +0000)]
cosmetics

git-svn-id: svn://svn.videolan.org/x264/trunk@664 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agolimit vertical motion vectors to +/-512, since some decoders actually depend on that...
Loren Merritt [Thu, 28 Jun 2007 21:26:21 +0000 (21:26 +0000)]
limit vertical motion vectors to +/-512, since some decoders actually depend on that limit.

git-svn-id: svn://svn.videolan.org/x264/trunk@663 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoAdd vertical and horizontal luma deblocking accelerated with Altivec,
Guillaume Poirier [Fri, 22 Jun 2007 21:42:41 +0000 (21:42 +0000)]
Add vertical and horizontal luma deblocking accelerated with Altivec,
based on Graham Booker's code written for FFmpeg with slight modifications
to re-use x264's macros

git-svn-id: svn://svn.videolan.org/x264/trunk@662 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agocosmetics in cpu detection
Loren Merritt [Sat, 16 Jun 2007 04:09:01 +0000 (04:09 +0000)]
cosmetics in cpu detection

git-svn-id: svn://svn.videolan.org/x264/trunk@661 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agofix compilation without asm on x86_32 (r658 worked only on x86_64).
Loren Merritt [Sat, 16 Jun 2007 04:02:48 +0000 (04:02 +0000)]
fix compilation without asm on x86_32 (r658 worked only on x86_64).

git-svn-id: svn://svn.videolan.org/x264/trunk@660 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoexempt 1080p from the non-mod16 warning.
Loren Merritt [Sun, 10 Jun 2007 23:46:31 +0000 (23:46 +0000)]
exempt 1080p from the non-mod16 warning.

git-svn-id: svn://svn.videolan.org/x264/trunk@659 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoallow compiling without yasm/nasm on x86 and x86-64 platforms
Alex Izvorski [Tue, 5 Jun 2007 18:38:31 +0000 (18:38 +0000)]
allow compiling without yasm/nasm on x86 and x86-64 platforms

git-svn-id: svn://svn.videolan.org/x264/trunk@658 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoupdated MS VC8/VC7 build, patch by Gabriel Bouvigne
Alex Izvorski [Tue, 5 Jun 2007 18:32:13 +0000 (18:32 +0000)]
updated MS VC8/VC7 build, patch by Gabriel Bouvigne

git-svn-id: svn://svn.videolan.org/x264/trunk@657 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoreplace alloca with malloc everywhere. per manpage, use of alloca is discouraged...
Alex Izvorski [Sat, 26 May 2007 03:13:08 +0000 (03:13 +0000)]
replace alloca with malloc everywhere. per manpage, use of alloca is discouraged. this may have a minor effect on the speed of ssim and esa, but that appears too small to measure.

git-svn-id: svn://svn.videolan.org/x264/trunk@656 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agorequire a ratecontrol method to be specified, it no longer defaults to cqp=26.
Loren Merritt [Wed, 2 May 2007 21:33:43 +0000 (21:33 +0000)]
require a ratecontrol method to be specified, it no longer defaults to cqp=26.

git-svn-id: svn://svn.videolan.org/x264/trunk@655 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agofix nnz computation in cavlc+8x8dct+deblock. (regression in r607)
Loren Merritt [Mon, 23 Apr 2007 08:34:52 +0000 (08:34 +0000)]
fix nnz computation in cavlc+8x8dct+deblock. (regression in r607)

git-svn-id: svn://svn.videolan.org/x264/trunk@654 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agofix the computation of bits used for vbv. (regression in r651)
Loren Merritt [Mon, 23 Apr 2007 07:09:30 +0000 (07:09 +0000)]
fix the computation of bits used for vbv. (regression in r651)

git-svn-id: svn://svn.videolan.org/x264/trunk@653 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agoc89 compile fix
Loren Merritt [Sun, 22 Apr 2007 03:37:56 +0000 (03:37 +0000)]
c89 compile fix

git-svn-id: svn://svn.videolan.org/x264/trunk@652 df754926-b1dd-0310-bc7b-ec298dee348c

17 years agocabac: use bytestream instead of bitstream.
Loren Merritt [Sat, 21 Apr 2007 11:32:34 +0000 (11:32 +0000)]
cabac: use bytestream instead of bitstream.
35% faster cabac, 20% faster overall lossless, ~1% faster overall at normal bitrates.

git-svn-id: svn://svn.videolan.org/x264/trunk@651 df754926-b1dd-0310-bc7b-ec298dee348c